NeoChen1024 commited on
Commit
b8a26ce
·
verified ·
1 Parent(s): e6148a1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -2,7 +2,7 @@
2
  license: llama3.1
3
  tags:
4
  - int8
5
- - w8a8
6
  datasets:
7
  - OpenCoder-LLM/opc-sft-stage1
8
  - OpenCoder-LLM/opc-sft-stage2
@@ -23,7 +23,7 @@ base_model:
23
  - cognitivecomputations/Dolphin3.0-Llama3.1-8B
24
  ---
25
 
26
- # W8A8 Quant of Dolphin 3.0 Llama 3.1 8B 🐬
27
  Quantization script: <https://github.com/NeoChen1024/scripts/blob/master/llm-compressor-quantize.py>
28
 
29
  Curated and trained by [Eric Hartford](https://huggingface.co/ehartford), [Ben Gitter](https://huggingface.co/bigstorm), [BlouseJury](https://huggingface.co/BlouseJury) and [Cognitive Computations](https://huggingface.co/cognitivecomputations)
 
2
  license: llama3.1
3
  tags:
4
  - int8
5
+ - fp8
6
  datasets:
7
  - OpenCoder-LLM/opc-sft-stage1
8
  - OpenCoder-LLM/opc-sft-stage2
 
23
  - cognitivecomputations/Dolphin3.0-Llama3.1-8B
24
  ---
25
 
26
+ # FP8 Quant of Dolphin 3.0 Llama 3.1 8B 🐬
27
  Quantization script: <https://github.com/NeoChen1024/scripts/blob/master/llm-compressor-quantize.py>
28
 
29
  Curated and trained by [Eric Hartford](https://huggingface.co/ehartford), [Ben Gitter](https://huggingface.co/bigstorm), [BlouseJury](https://huggingface.co/BlouseJury) and [Cognitive Computations](https://huggingface.co/cognitivecomputations)