EryriLabs commited on
Commit
619e0ef
·
verified ·
1 Parent(s): b026be4

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ deepseek-r1-distill-llama-thinking-farmer-8b.bf16.gguf filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,46 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - CopyleftCultivars/llama-3.1-natural-farmer-16bit
4
+ - deepseek-ai/DeepSeek-R1-Distill-Llama-8B
5
+ library_name: transformers
6
+ tags:
7
+ - mergekit
8
+ - merge
9
+ - autoquant
10
+ - gguf
11
+ ---
12
+ # DeepSeek-R1-Distill-Llama-Thinking-Farmer-8B
13
+
14
+
15
+ <figure>
16
+ <img src="farm.png" alt=" DeepSeek-R1-Distill-Llama-Thinking-Farmer-8B" width="300">
17
+ </figure>
18
+
19
+
20
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
21
+
22
+ ## Merge Details
23
+ ### Merge Method
24
+
25
+ This model was merged using the SLERP merge method.
26
+
27
+ ### Models Merged
28
+
29
+ The following models were included in the merge:
30
+ * [CopyleftCultivars/llama-3.1-natural-farmer-16bit](https://huggingface.co/CopyleftCultivars/llama-3.1-natural-farmer-16bit)
31
+ * [deepseek-ai/DeepSeek-R1-Distill-Llama-8B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B)
32
+
33
+ ### Configuration
34
+
35
+ The following YAML configuration was used to produce this model:
36
+
37
+ ```yaml
38
+ models:
39
+ - model: deepseek-ai/DeepSeek-R1-Distill-Llama-8B
40
+ - model: CopyleftCultivars/llama-3.1-natural-farmer-16bit
41
+ merge_method: slerp
42
+ base_model: deepseek-ai/DeepSeek-R1-Distill-Llama-8B
43
+ dtype: bfloat16
44
+ parameters:
45
+ t: [0, 0.5, 0.25]
46
+ ```
deepseek-r1-distill-llama-thinking-farmer-8b.bf16.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:40fedc139e64bf2317821f4f40820cbcc4becd519ce84085aa9ec57e7bc044ed
3
+ size 16068894048