DavidAU commited on
Commit
29f35fb
·
verified ·
1 Parent(s): cf1462a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -76,7 +76,7 @@ Several prompts and outputs below.
76
 
77
  <B>QUANTS From Float 32 (32-bit) Source:</B>
78
 
79
- - All quants have been "refreshed", quanted with the lastest LLAMACPP improvements : Better instruction following, output generation across all quants.
80
  - All quants have also been upgraded with "more bits" for output tensor (all set at Q8_0) and embed for better performance (this is in addition to the "refresh")
81
  - New specialized quants (in addition to the new refresh/upgrades): "max, max-cpu" (will include this in the file name) for quants "Q2K", "IQ4_XS", "Q6_K" and "Q8_0"
82
  - "MAX": output tensor / embed at float 32. You get better instruction following/output generation than standard/upgraded quants.
 
76
 
77
  <B>QUANTS From Float 32 (32-bit) Source:</B>
78
 
79
+ - All quants have been quanted with the lastest LLAMACPP improvements : Better instruction following, output generation across all quants.
80
  - All quants have also been upgraded with "more bits" for output tensor (all set at Q8_0) and embed for better performance (this is in addition to the "refresh")
81
  - New specialized quants (in addition to the new refresh/upgrades): "max, max-cpu" (will include this in the file name) for quants "Q2K", "IQ4_XS", "Q6_K" and "Q8_0"
82
  - "MAX": output tensor / embed at float 32. You get better instruction following/output generation than standard/upgraded quants.