DavidAU
/

L3-Grand-Story-Darkness-MOE-4X8-24.9B-e32-GGUF

Text Generation

mixture of experts

Mixture of Experts

32 bit enhanced

float 32 quants

creative writing

fiction writing

plot generation

sub-plot generation

story generation

science fiction

Inference Endpoints

Model card Files Files and versions Community

DavidAU commited on Jan 3

Commit

29f35fb

·

verified ·

1 Parent(s): cf1462a

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -76,7 +76,7 @@ Several prompts and outputs below.
 <B>QUANTS From Float 32 (32-bit) Source:</B>
-- All quants have been "refreshed", quanted with the lastest LLAMACPP improvements : Better instruction following, output generation across all quants.
 - All quants have also been upgraded with "more bits" for output tensor (all set at Q8_0) and embed for better performance (this is in addition to the "refresh")
 - New specialized quants (in addition to the new refresh/upgrades): "max, max-cpu" (will include this in the file name) for quants "Q2K", "IQ4_XS", "Q6_K" and "Q8_0"
 - "MAX": output tensor / embed at float 32. You get better instruction following/output generation than standard/upgraded quants.

 <B>QUANTS From Float 32 (32-bit) Source:</B>
+- All quants have been quanted with the lastest LLAMACPP improvements : Better instruction following, output generation across all quants.
 - All quants have also been upgraded with "more bits" for output tensor (all set at Q8_0) and embed for better performance (this is in addition to the "refresh")
 - New specialized quants (in addition to the new refresh/upgrades): "max, max-cpu" (will include this in the file name) for quants "Q2K", "IQ4_XS", "Q6_K" and "Q8_0"
 - "MAX": output tensor / embed at float 32. You get better instruction following/output generation than standard/upgraded quants.