Steelskull commited on
Commit
9ebf166
·
verified ·
1 Parent(s): 445a551

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -33
README.md CHANGED
@@ -1,38 +1,6 @@
1
  ---
2
- base_model:
3
- - deepseek-ai/DeepSeek-R1-Distill-Llama-70B
4
- - nbeerbower/Llama-3.1-Nemotron-lorablated-70B
5
- - abacusai/Dracarys2-Llama-3.1-70B-Instruct
6
  library_name: transformers
7
  tags:
8
  - mergekit
9
  - merge
10
-
11
- ---
12
- # merge
13
-
14
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
15
-
16
- ## Merge Details
17
- ### Merge Method
18
-
19
- This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [deepseek-ai/DeepSeek-R1-Distill-Llama-70B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-70B) as a base.
20
-
21
- ### Models Merged
22
-
23
- The following models were included in the merge:
24
- * [nbeerbower/Llama-3.1-Nemotron-lorablated-70B](https://huggingface.co/nbeerbower/Llama-3.1-Nemotron-lorablated-70B)
25
- * [abacusai/Dracarys2-Llama-3.1-70B-Instruct](https://huggingface.co/abacusai/Dracarys2-Llama-3.1-70B-Instruct)
26
-
27
- ### Configuration
28
-
29
- The following YAML configuration was used to produce this model:
30
-
31
- ```yaml
32
- base_model: deepseek-ai/DeepSeek-R1-Distill-Llama-70B
33
- merge_method: model_stock
34
- dtype: bfloat16
35
- models:
36
- - model: nbeerbower/Llama-3.1-Nemotron-lorablated-70B
37
- - model: abacusai/Dracarys2-Llama-3.1-70B-Instruct
38
- ```
 
1
  ---
 
 
 
 
2
  library_name: transformers
3
  tags:
4
  - mergekit
5
  - merge
6
+ ---