Models fine-tuned for multiple choice question answering (mc) and mathematical reasoning (gsm8k). https://arxiv.org/abs/2407.07890
Ricardo
ricdomolm
AI & ML interests
LLMs
Recent Activity
updated
a dataset
about 3 hours ago
ricdomolm/MATH-500
published
a dataset
about 3 hours ago
ricdomolm/MATH-500
updated
a model
8 days ago
ricdomolm/pythia-1.4b-sft-gsm8k-3e
Organizations
None yet
Collections
1
models
67
ricdomolm/pythia-1.4b-sft-gsm8k-3e
Text Generation
•
Updated
•
336
ricdomolm/pythia-1.4b-sft-gsm8k-1e
Text Generation
•
Updated
•
20
ricdomolm/ml4331-reward-model
Text Generation
•
Updated
•
270
ricdomolm/ml4331-reward-model2
Text Generation
•
Updated
•
6
ricdomolm/ml4331-dpo-model
Text Generation
•
Updated
•
222
ricdomolm/ml4331-instruction-model
Text Generation
•
Updated
•
378
ricdomolm/test-model
Updated
ricdomolm/SmolLM2-135M-SFT-Alpaca
Updated
ricdomolm/reward-model-exercise
Updated
ricdomolm/lawma-8b
Text Generation
•
Updated
•
2.38k
•
6
datasets
16
ricdomolm/MATH-500
Viewer
•
Updated
•
12.5k
ricdomolm/caselawqa_leaderboard_results
Updated
•
1.27k
ricdomolm/caselawqa_leaderboard_requests
Viewer
•
Updated
•
29
•
1.23k
ricdomolm/lawma-instructions_gemma2_8k
Viewer
•
Updated
•
554k
•
71
ricdomolm/lawma-instructions_llama3_16k
Viewer
•
Updated
•
554k
•
32
ricdomolm/lawma-instructions_llama3_8k
Viewer
•
Updated
•
554k
•
54
ricdomolm/lawma-instructions
Viewer
•
Updated
•
554k
•
38
ricdomolm/lawma-tasks
Viewer
•
Updated
•
692k
•
989
•
2
ricdomolm/lawma-task-files
Updated
•
29
ricdomolm/caselawqa-8k
Viewer
•
Updated
•
16.1k
•
37
•
2