view article Article From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub 29 days ago β’ 49
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference Jan 16 β’ 71
StarCoder 2 and The Stack v2: The Next Generation Paper β’ 2402.19173 β’ Published Feb 29, 2024 β’ 138
MADLAD-400: A Multilingual And Document-Level Large Audited Dataset Paper β’ 2309.04662 β’ Published Sep 9, 2023 β’ 23