openGPT-X
/

Teuken-7B-instruct-research-v0.4

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

mfromm commited on Nov 22, 2024

Commit

dad31d2

·

verified ·

1 Parent(s): e605687

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -137,9 +137,9 @@ This example demonstrates how to load the model and tokenizer, prepare input, ge
 <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
-[Teuken-7B-base-v0.4](https://huggingface.co/openGPT-X/Teuken-7B-base-v0.4) was pre-trained on 4 trillion tokens of data from publicly available sources.
 The pretraining data has a cutoff of September 2023.
-More information are available in our [preprint](http://arxiv.org/abs/2410.08800).
 ### Instruction-Tuning Data

 <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
+[Teuken-7B-instruct-research-v0.4](https://huggingface.co/openGPT-X/Teuken-7B-instruct-research-v0.4) was pre-trained on 4 trillion tokens of data from publicly available sources.
 The pretraining data has a cutoff of September 2023.
+More information is available in our [preprint "Data Processing for the OpenGPT-X Model Family"](http://arxiv.org/abs/2410.08800).
 ### Instruction-Tuning Data