FP8 LLMs for vLLM Collection Accurate FP8 quantized models by Neural Magic, ready for use with vLLM! • 44 items • Updated Oct 17 • 60
tomaarsen/span-marker-roberta-large-fewnerd-fine-super Token Classification • Updated Mar 22 • 1.81k • 10