neuralmagic/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic Text Generation • Updated Oct 17 • 11.8k • 14
neuralmagic/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-FP8-dynamic Text Generation • Updated 6 days ago • 34 • 1
neuralmagic/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-quantized.w4a16 Text Generation • Updated 6 days ago • 2.09k • 3