DavidAU
/

Llama-3.2-1B-Instruct-NEO-SI-FI-GGUF

Model card Files Files and versions Community

DavidAU commited on Oct 26, 2024

Commit

66fa526

·

verified ·

1 Parent(s): 98dd720

Update README.md

Files changed (1) hide show

README.md +40 -0

README.md CHANGED Viewed

@@ -95,6 +95,46 @@ ARM QUANTS:
 3 "ARM" quants have been added to the repo (23/10/2024). These quants are machine specific.
 <b>Optional Enhancement:</B>
 The following can be used in place of the "system prompt" or "system role" to further enhance the model.

 3 "ARM" quants have been added to the repo (23/10/2024). These quants are machine specific.
+<B>Settings, Quants and Critical Operations Notes:</b>
+Change in temp (ie, .4, .8, 1.5, 2, 3 ) will drastically alter output.
+Rep pen settings will also alter output too.
+This model needs "rep pen" of 1.02 or higher.
+For role play: Rep pen of 1.05 to 1.08 is suggested.
+Raise/lower rep pen SLOWLY ie: 1.011, 1.012 ...
+Rep pen will alter prose, word choice (lower rep pen=small words / more small word - sometimes) and creativity.
+To really push the model:
+Rep pen 1.05 or lower / Temp 3+ ... be ready to stop the output because it may go and go at these strong settings.
+Longer prompts vastly increase the quality of the model's output.
+QUANT CHOICE(S):
+Higher quants will have more detail, nuance and in some cases stronger "emotional" levels. Characters will also be
+more "fleshed out" too. Sense of "there" will also increase.
+Q4KM/Q4KS are good, strong quants however if you can run Q5, Q6 or Q8 - go for the highest quant you can.
+This repo also has 3 "ARM" quants for computers that support this quant. If you use these on a "non arm" machine token per second will be very low.
+IQ4XS: Due to the unusual nature of this quant (mixture/processing), generations from it will be different then other quants.
+You may want to try it / compare it to other quant(s) output.
+Special note on Q2k/Q3 quants:
+You may need to use temp 2 or lower with these quants (1 or lower for q2k). Just too much compression at this level, damaging the model. I will see if Imatrix versions
+of these quants will function better.
+Rep pen adjustments may also be required to get the most out of this model at this/these quant level(s).
 <b>Optional Enhancement:</B>
 The following can be used in place of the "system prompt" or "system role" to further enhance the model.