DolphinLabeled Datasets Collection Eric Hartford has added labels to help you filter datasets, for your pleasure. • 5 items • Updated 29 days ago • 11
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 3 items • Updated 9 days ago • 313
Qwen2.5-1M Collection The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 9 days ago • 97
Helium-1 Collection Kyutai's Helium-1 2B Model, outperforming other state of the art small models. • 4 items • Updated 17 days ago • 1
J.O.S.I.E. v6.0 Collection Trained on opensourced and private custom DPO/ORPO datasets • 8 items • Updated 24 days ago • 2
Dolphin 3.0 Collection Dolphin 3.0 is the next generation of the Dolphin series of instruct-tuned models. Designed to be the ultimate general purpose local model. • 7 items • Updated about 1 month ago • 60
Llama 3.2 Collection Meta goes small with Llama3.2, both text only 1B and 3B, and the 11B Vision models. • 15 items • Updated Dec 17, 2024 • 11
Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination Paper • 2411.03823 • Published Nov 6, 2024 • 45
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level Paper • 2411.03562 • Published Nov 5, 2024 • 65
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated Dec 22, 2024 • 212
LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding Paper • 2404.16710 • Published Apr 25, 2024 • 77