Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -18,6 +18,7 @@ base_model:
 - AWQ 4bit version of [Nexusflow/Athene-V2-Chat](https://huggingface.co/Nexusflow/Athene-V2-Chat)
 - [Quantization code](https://docs.vllm.ai/en/latest/quantization/auto_awq.html)
 ## Eval AWQ version

 - AWQ 4bit version of [Nexusflow/Athene-V2-Chat](https://huggingface.co/Nexusflow/Athene-V2-Chat)
 - [Quantization code](https://docs.vllm.ai/en/latest/quantization/auto_awq.html)
+- This model [only fits to 1 gpu](https://huggingface.co/radm/Athene-V2-Chat-AWQ/discussions/2). Use [kosbu/Athene-V2-Chat-AWQ](kosbu/Athene-V2-Chat-AWQ) for multi-gpu support
 ## Eval AWQ version