Triangle104
/

Dumpling-Qwen2.5-1.5B-Q4_K_S-GGUF

Inference Endpoints

Model card Files Files and versions Community

Triangle104 commited on 15 days ago

Commit

fe17ab7

·

verified ·

1 Parent(s): 4f7a1e5

Update README.md

Files changed (1) hide show

README.md +24 -0

README.md CHANGED Viewed

@@ -24,6 +24,30 @@ tags:
 This model was converted to GGUF format from [`nbeerbower/Dumpling-Qwen2.5-1.5B`](https://huggingface.co/nbeerbower/Dumpling-Qwen2.5-1.5B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/nbeerbower/Dumpling-Qwen2.5-1.5B) for more details on the model.
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)

 This model was converted to GGUF format from [`nbeerbower/Dumpling-Qwen2.5-1.5B`](https://huggingface.co/nbeerbower/Dumpling-Qwen2.5-1.5B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/nbeerbower/Dumpling-Qwen2.5-1.5B) for more details on the model.
+---
+Dumpling-Qwen2.5-32B
+nbeerbower/EVA-abliterated-TIES-Qwen2.5-1.5B finetuned on:
+    nbeerbower/GreatFirewall-DPO
+    nbeerbower/Schule-DPO
+    nbeerbower/Purpura-DPO
+    nbeerbower/Arkhaios-DPO
+    jondurbin/truthy-dpo-v0.1
+    antiven0m/physical-reasoning-dpo
+    flammenai/Date-DPO-NoAsterisks
+    flammenai/Prude-Phi3-DPO
+    Atsunori/HelpSteer2-DPO (1,000 samples)
+    jondurbin/gutenberg-dpo-v0.1
+    nbeerbower/gutenberg2-dpo
+    nbeerbower/gutenberg-moderne-dpo.
+Method
+QLoRA ORPO tune with 2x RTX 3090 for 2 epochs.
+---
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)