Triangle104 commited on
Commit
fe17ab7
·
verified ·
1 Parent(s): 4f7a1e5

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +24 -0
README.md CHANGED
@@ -24,6 +24,30 @@ tags:
24
  This model was converted to GGUF format from [`nbeerbower/Dumpling-Qwen2.5-1.5B`](https://huggingface.co/nbeerbower/Dumpling-Qwen2.5-1.5B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
25
  Refer to the [original model card](https://huggingface.co/nbeerbower/Dumpling-Qwen2.5-1.5B) for more details on the model.
26
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
27
  ## Use with llama.cpp
28
  Install llama.cpp through brew (works on Mac and Linux)
29
 
 
24
  This model was converted to GGUF format from [`nbeerbower/Dumpling-Qwen2.5-1.5B`](https://huggingface.co/nbeerbower/Dumpling-Qwen2.5-1.5B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
25
  Refer to the [original model card](https://huggingface.co/nbeerbower/Dumpling-Qwen2.5-1.5B) for more details on the model.
26
 
27
+ ---
28
+
29
+ Dumpling-Qwen2.5-32B
30
+
31
+ nbeerbower/EVA-abliterated-TIES-Qwen2.5-1.5B finetuned on:
32
+
33
+ nbeerbower/GreatFirewall-DPO
34
+ nbeerbower/Schule-DPO
35
+ nbeerbower/Purpura-DPO
36
+ nbeerbower/Arkhaios-DPO
37
+ jondurbin/truthy-dpo-v0.1
38
+ antiven0m/physical-reasoning-dpo
39
+ flammenai/Date-DPO-NoAsterisks
40
+ flammenai/Prude-Phi3-DPO
41
+ Atsunori/HelpSteer2-DPO (1,000 samples)
42
+ jondurbin/gutenberg-dpo-v0.1
43
+ nbeerbower/gutenberg2-dpo
44
+ nbeerbower/gutenberg-moderne-dpo.
45
+
46
+ Method
47
+
48
+ QLoRA ORPO tune with 2x RTX 3090 for 2 epochs.
49
+
50
+ ---
51
  ## Use with llama.cpp
52
  Install llama.cpp through brew (works on Mac and Linux)
53