DevQuasar/deepseek-ai.DeepSeek-R1-Zero-bf16
Text Generation
•
Updated
I've rerun hellaswag with the suggested config, the results haven't improved:
Tasks | Version | Filter | n-shot | Metric | Value | Stderr | ||
---|---|---|---|---|---|---|---|---|
hellaswag | 1 | none | 0 | acc | ↑ | 0.5559 | ± | 0.0050 |
none | 0 | acc_norm | ↑ | 0.7436 | ± | 0.0044 |
command:accelerate launch -m lm_eval --model hf --model_args pretrained=deepseek-ai/DeepSeek-R1-Distill-Llama-8B,parallelize=True,dtype="float16" --tasks hellaswag --batch_size auto:4 --log_samples --output_path eval_results --gen_kwargs temperature=0.6,top_p=0.95,generate_until=64,do_sample=True
Thx, will try
Thx, will try