Eurdem
/

megatron_v1

Text Generation

Mixture of Experts

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Eurdem commited on Jan 17, 2024

Commit

5263230

·

verified ·

1 Parent(s): 03d2e8b

Update README.md

Files changed (1) hide show

README.md +2 -32

README.md CHANGED Viewed

@@ -3,46 +3,16 @@ license: apache-2.0
 tags:
 - moe
 - merge
-- mergekit
-- lazymergekit
-- cognitivecomputations/dolphin-2.6-mistral-7b-dpo-laser
-- berkeley-nest/Starling-LM-7B-alpha
 ---
 # megatron_v1
-megatron_v1 is a Mixure of Experts (MoE) made with the following models using [LazyMergekit](https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb?usp=sharing):
-* [cognitivecomputations/dolphin-2.6-mistral-7b-dpo-laser](https://huggingface.co/cognitivecomputations/dolphin-2.6-mistral-7b-dpo-laser)
-* [berkeley-nest/Starling-LM-7B-alpha](https://huggingface.co/berkeley-nest/Starling-LM-7B-alpha)
-## 🧩 Configuration
-```yaml
-base_model: openchat/openchat-3.5-0106
-gate_mode: hidden
-dtype: bfloat16
-experts:
-  - source_model: cognitivecomputations/dolphin-2.6-mistral-7b-dpo-laser
-    positive_prompts:
-    - "Mathematics"
-    - "Physics"
-    negative_prompts:
-    - "History"
-    - "Philosophy"
-  - source_model: berkeley-nest/Starling-LM-7B-alpha
-    positive_prompts:
-    - "retrieval"
-    - "life science"
-    negative_prompts:
-    - "Education"
-    - "Law"
-```
 ## 💻 Usage
 ```python
-!pip install -qU transformers bitsandbytes accelerate
 from transformers import AutoTokenizer
 import transformers
 import torch

 tags:
 - moe
 - merge
 ---
 # megatron_v1
+megatron_v1 is a Mixure of Experts (MoE) made of mistral models.
 ## 💻 Usage
 ```python
 from transformers import AutoTokenizer
 import transformers
 import torch