Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
dpo
Inference Endpoints
AutoTrain Compatible
text-generation-inference
custom_code
4-bit precision
Eval Results
Merge
8-bit precision
Mixture of Experts
Carbon Emissions
Misc with no match
text-embeddings-inference
Apply filters
Models
4,002
Full-text search
Edit filters
Sort: Trending
Active filters:
dpo
Clear all
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-3
Text Generation
•
Updated
Jun 8
•
82
•
5
QuantFactory/NeuralDaredevil-8B-abliterated-GGUF
Text Generation
•
Updated
May 29
•
3.94k
•
51
mradermacher/Phoenix-GGUF
Updated
10 days ago
•
63
•
1
mradermacher/Phoenix-i1-GGUF
Updated
10 days ago
•
322
•
1
nvidia/Llama3-70B-DPO-Chat
Updated
Jun 14
•
8
•
3
mlabonne/TwinLlama-3.1-8B-DPO
Text Generation
•
Updated
Oct 6
•
125
•
3
v000000/L3.1-Niitorm-8B-DPO-t0.0001
Text Generation
•
Updated
Oct 3
•
2.57k
•
7
v000000/L3.1-Niitorm-8B-DPO-t0.0001-GGUFs-IMATRIX
Updated
Oct 6
•
163
•
2
tanliboy/lambda-qwen2.5-14b-dpo-test
Text Generation
•
Updated
Sep 20
•
2.63k
•
7
v000000/Qwen2.5-Lumen-14B
Text Generation
•
Updated
Oct 3
•
2.94k
•
18
v000000/Qwen2.5-14B-Gutenberg-Instruct-Slerpeno
Text Generation
•
Updated
Sep 30
•
2.78k
•
5
QuantFactory/Qwen2.5-Lumen-14B-GGUF
Text Generation
•
Updated
Sep 21
•
207
•
3
mradermacher/Qwen2.5-Lumen-14B-GGUF
Updated
Sep 22
•
122
•
4
tanliboy/lambda-qwen2.5-32b-dpo-test
Text Generation
•
Updated
Sep 22
•
2.63k
•
4
mradermacher/Qwen2.5-Lumen-14B-i1-GGUF
Updated
Sep 22
•
461
•
8
trl-lib/Qwen2-0.5B-DPO
Text Generation
•
Updated
Sep 27
•
82
•
4
HumanLLMs/Human-Like-LLama3-8B-Instruct
Updated
Oct 7
•
72
•
2
pbevan11/Mistral-Nemo-MCAI-SFT-DPO-revision-only
Text Generation
•
Updated
Oct 5
•
31
•
1
HumanLLMs/Human-Like-Qwen2.5-7B-Instruct
Updated
Oct 7
•
59
•
3
mradermacher/Mistral-Nemo-Instruct-MCAI-SFT-DPO-revision-only-GGUF
Updated
10 days ago
•
43
•
1
mradermacher/Mistral-Nemo-Instruct-MCAI-SFT-DPO-revision-only-i1-GGUF
Updated
10 days ago
•
102
•
1
mradermacher/Qwen2.5-14B-Wernicke-DPO-GGUF
Updated
Oct 25
•
75
•
1
mradermacher/Qwen2.5-14B-Wernicke-DPO-i1-GGUF
Updated
Oct 25
•
577
•
3
mradermacher/mistral-7b-dpo-constitutional-ai-GGUF
Updated
Oct 31
•
158
•
1
VAGOsolutions/SauerkrautLM-v2-14b-DPO
Updated
Nov 7
•
485
•
18
andito/SmolLM2-1.7B-Instruct-F16-GGUF
Updated
Oct 31
•
479
•
1
mradermacher/CapybaraHermes-2.5-Mistral-7B-GGUF
Updated
Nov 15
•
49
•
1
mradermacher/CapybaraHermes-2.5-Mistral-7B-i1-GGUF
Updated
Nov 15
•
525
•
1
mradermacher/Humanish-Qwen2.5-7B-Instruct-GGUF
Updated
10 days ago
•
217
•
1
mradermacher/Humanish-Qwen2.5-7B-Instruct-i1-GGUF
Updated
10 days ago
•
642
•
1
Previous
1
2
3
4
...
100
Next