NeoChen1024
/

Dolphin3.0-Llama3.1-8B-FP8_DYNAMIC

compressed-tensors

Model card Files Files and versions Community

NeoChen1024 commited on 6 days ago

Commit

b8a26ce

·

verified ·

1 Parent(s): e6148a1

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -2,7 +2,7 @@
 license: llama3.1
 tags:
 - int8
-- w8a8
 datasets:
 - OpenCoder-LLM/opc-sft-stage1
 - OpenCoder-LLM/opc-sft-stage2
@@ -23,7 +23,7 @@ base_model:
 - cognitivecomputations/Dolphin3.0-Llama3.1-8B
 ---
-# W8A8 Quant of Dolphin 3.0 Llama 3.1 8B 🐬
 Quantization script: <https://github.com/NeoChen1024/scripts/blob/master/llm-compressor-quantize.py>
 Curated and trained by [Eric Hartford](https://huggingface.co/ehartford), [Ben Gitter](https://huggingface.co/bigstorm), [BlouseJury](https://huggingface.co/BlouseJury) and [Cognitive Computations](https://huggingface.co/cognitivecomputations)

 license: llama3.1
 tags:
 - int8
+- fp8
 datasets:
 - OpenCoder-LLM/opc-sft-stage1
 - OpenCoder-LLM/opc-sft-stage2
 - cognitivecomputations/Dolphin3.0-Llama3.1-8B
 ---
+# FP8 Quant of Dolphin 3.0 Llama 3.1 8B 🐬
 Quantization script: <https://github.com/NeoChen1024/scripts/blob/master/llm-compressor-quantize.py>
 Curated and trained by [Eric Hartford](https://huggingface.co/ehartford), [Ben Gitter](https://huggingface.co/bigstorm), [BlouseJury](https://huggingface.co/BlouseJury) and [Cognitive Computations](https://huggingface.co/cognitivecomputations)