Triangle104
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -7,13 +7,12 @@ library_name: transformers
|
|
7 |
tags:
|
8 |
- mergekit
|
9 |
- merge
|
10 |
-
|
11 |
---
|
12 |
# merge
|
13 |
|
14 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
15 |
|
16 |
-
## Merge Details
|
17 |
### Merge Method
|
18 |
|
19 |
This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using [deepseek-ai/DeepSeek-R1-Distill-Qwen-32B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B) as a base.
|
@@ -47,4 +46,4 @@ parameters:
|
|
47 |
normalize: false
|
48 |
int8_mask: true
|
49 |
dtype: float16
|
50 |
-
```
|
|
|
7 |
tags:
|
8 |
- mergekit
|
9 |
- merge
|
10 |
+
license: apache-2.0
|
11 |
---
|
12 |
# merge
|
13 |
|
14 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
15 |
|
|
|
16 |
### Merge Method
|
17 |
|
18 |
This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using [deepseek-ai/DeepSeek-R1-Distill-Qwen-32B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B) as a base.
|
|
|
46 |
normalize: false
|
47 |
int8_mask: true
|
48 |
dtype: float16
|
49 |
+
```
|