Orpo finetuned models
Muhammad Bin Usman
Muhammad2003
AI & ML interests
- Model Alignment (SFT / DPO / ORPO )
- Model Merging / Pruning / MoE + latest tecniques
- Instruction tuning and Preference datasets curation
- Evaluation
Recent Activity
updated
a model
about 2 hours ago
Muhammad2003/Llama3-LegalLM
published
a model
about 2 hours ago
Muhammad2003/Llama3-LegalLM
upvoted
a
collection
7 months ago
haiku
Organizations
models
21

Muhammad2003/Llama3-LegalLM
Updated

Muhammad2003/router-classifier
Text Classification
•
Updated
•
114

Muhammad2003/router-embedding
Sentence Similarity
•
Updated
•
9
•
1

Muhammad2003/TriMistral-7B-TIES
Text Generation
•
Updated
•
36

Muhammad2003/TriMistral-7B-SLERP
Text Generation
•
Updated
•
129

Muhammad2003/TriMistral-7B-MODELSTOCK
Text Generation
•
Updated
•
60

Muhammad2003/TriMistral-7B-DARETIES
Text Generation
•
Updated
•
11

Muhammad2003/Llama-3-8B-DPO-500
Text Generation
•
Updated
•
9

Muhammad2003/Llama-3-8B-DPO-1500
Text Generation
•
Updated
•
7

Muhammad2003/Llama-3-8B-DPO-1000
Text Generation
•
Updated
•
11
datasets
7
Muhammad2003/routing-dataset
Viewer
•
Updated
•
14.3k
•
70
Muhammad2003/OpenMed_11k_train
Viewer
•
Updated
•
11.3k
•
95
Muhammad2003/OpenMed_11k
Viewer
•
Updated
•
11.7k
•
70
Muhammad2003/GrandMed_364k
Viewer
•
Updated
•
364k
•
64
Muhammad2003/Nectar-DPO-50k
Viewer
•
Updated
•
50k
•
72
Muhammad2003/Big_Pretrain_11K
Viewer
•
Updated
•
11.7k
•
62
Muhammad2003/Toxic_PreTrain_8k
Viewer
•
Updated
•
8.41k
•
70