Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models Mar 20 β’ 69
Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Language Model Aug 22, 2023 β’ 28
Huggy Lingo: Using Machine Learning to Improve Language Metadata on the Hugging Face Hub Aug 2, 2023 β’ 1
knowledgator/modern-gliner-bi-large-v1.0 Token Classification β’ Updated about 23 hours ago β’ 22 β’ 11
knowledgator/modern-gliner-bi-base-v1.0 Token Classification β’ Updated about 23 hours ago β’ 13 β’ 11
data-is-better-together/fineweb-c-progress Viewer β’ Updated about 5 hours ago β’ 668 β’ 556 β’ 2