Can you make a 2.25bpw quantization for this model?
#4 opened 20 days ago
by
xldistance
Reason for high performance may be an error in evaluation
4
#3 opened 3 months ago
by
ChuckMcSneed
what is your "continuous finetuning"
7
#2 opened 3 months ago
by
MaziyarPanahi