Introducing Synthetic Data Workshop: Your Gateway to Easy Synthetic Dataset Creation Jun 20, 2024 • 12
Synthetic dataset generation techniques: generating custom sentence similarity data May 23, 2024 • 16
Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models Mar 20, 2024 • 74
Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Language Model Aug 22, 2023 • 29
Huggy Lingo: Using Machine Learning to Improve Language Metadata on the Hugging Face Hub Aug 2, 2023 • 1
Magpie-Align/Magpie-Reasoning-V2-250K-CoT-Deepseek-R1-Llama-70B Viewer • Updated 3 days ago • 250k • 57 • 5