NeoChen1024
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -2,7 +2,7 @@
|
|
2 |
license: llama3.1
|
3 |
tags:
|
4 |
- int8
|
5 |
-
-
|
6 |
datasets:
|
7 |
- OpenCoder-LLM/opc-sft-stage1
|
8 |
- OpenCoder-LLM/opc-sft-stage2
|
@@ -23,7 +23,7 @@ base_model:
|
|
23 |
- cognitivecomputations/Dolphin3.0-Llama3.1-8B
|
24 |
---
|
25 |
|
26 |
-
#
|
27 |
Quantization script: <https://github.com/NeoChen1024/scripts/blob/master/llm-compressor-quantize.py>
|
28 |
|
29 |
Curated and trained by [Eric Hartford](https://huggingface.co/ehartford), [Ben Gitter](https://huggingface.co/bigstorm), [BlouseJury](https://huggingface.co/BlouseJury) and [Cognitive Computations](https://huggingface.co/cognitivecomputations)
|
|
|
2 |
license: llama3.1
|
3 |
tags:
|
4 |
- int8
|
5 |
+
- fp8
|
6 |
datasets:
|
7 |
- OpenCoder-LLM/opc-sft-stage1
|
8 |
- OpenCoder-LLM/opc-sft-stage2
|
|
|
23 |
- cognitivecomputations/Dolphin3.0-Llama3.1-8B
|
24 |
---
|
25 |
|
26 |
+
# FP8 Quant of Dolphin 3.0 Llama 3.1 8B 🐬
|
27 |
Quantization script: <https://github.com/NeoChen1024/scripts/blob/master/llm-compressor-quantize.py>
|
28 |
|
29 |
Curated and trained by [Eric Hartford](https://huggingface.co/ehartford), [Ben Gitter](https://huggingface.co/bigstorm), [BlouseJury](https://huggingface.co/BlouseJury) and [Cognitive Computations](https://huggingface.co/cognitivecomputations)
|