Triangle104 commited on
Commit
9ae4e0b
·
verified ·
1 Parent(s): 41a67c8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +23 -0
README.md CHANGED
@@ -24,6 +24,29 @@ tags:
24
  This model was converted to GGUF format from [`nbeerbower/Dumpling-Qwen2.5-1.5B`](https://huggingface.co/nbeerbower/Dumpling-Qwen2.5-1.5B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
25
  Refer to the [original model card](https://huggingface.co/nbeerbower/Dumpling-Qwen2.5-1.5B) for more details on the model.
26
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
27
  ## Use with llama.cpp
28
  Install llama.cpp through brew (works on Mac and Linux)
29
 
 
24
  This model was converted to GGUF format from [`nbeerbower/Dumpling-Qwen2.5-1.5B`](https://huggingface.co/nbeerbower/Dumpling-Qwen2.5-1.5B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
25
  Refer to the [original model card](https://huggingface.co/nbeerbower/Dumpling-Qwen2.5-1.5B) for more details on the model.
26
 
27
+ ---
28
+ Dumpling-Qwen2.5-32B
29
+
30
+ nbeerbower/EVA-abliterated-TIES-Qwen2.5-1.5B finetuned on:
31
+
32
+ nbeerbower/GreatFirewall-DPO
33
+ nbeerbower/Schule-DPO
34
+ nbeerbower/Purpura-DPO
35
+ nbeerbower/Arkhaios-DPO
36
+ jondurbin/truthy-dpo-v0.1
37
+ antiven0m/physical-reasoning-dpo
38
+ flammenai/Date-DPO-NoAsterisks
39
+ flammenai/Prude-Phi3-DPO
40
+ Atsunori/HelpSteer2-DPO (1,000 samples)
41
+ jondurbin/gutenberg-dpo-v0.1
42
+ nbeerbower/gutenberg2-dpo
43
+ nbeerbower/gutenberg-moderne-dpo.
44
+
45
+ Method
46
+
47
+ QLoRA ORPO tune with 2x RTX 3090 for 2 epochs.
48
+
49
+ ---
50
  ## Use with llama.cpp
51
  Install llama.cpp through brew (works on Mac and Linux)
52