ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling β’ 3 items β’ Updated 7 days ago β’ 93
Falcon3 Collection Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. β’ 40 items β’ Updated 7 days ago β’ 71
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters Paper β’ 2408.03314 β’ Published Aug 6 β’ 51
Gradio WebRTC Cookbook β‘οΈ Collection Collection of real-time voice and video demos built with gradio-webrtc custom component β’ 8 items β’ Updated 15 days ago β’ 9
PaliGemma 2: A Family of Versatile VLMs for Transfer Paper β’ 2412.03555 β’ Published 21 days ago β’ 118
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. β’ 23 items β’ Updated 12 days ago β’ 119
Flux.1 Tools Collection FLUX.1 Tools, a suite of models designed to add control and steerability to base text-to-image models FLUX.1 β’ 6 items β’ Updated Nov 22 β’ 13
SmolVLM Collection State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct β’ 5 items β’ Updated 3 days ago β’ 30
view article Article Letβs make a generation of amazing image generation models By burtenshaw β’ 30 days ago β’ 33
MagicQuill: An Intelligent Interactive Image Editing System Paper β’ 2411.09703 β’ Published Nov 14 β’ 57
OpenCulture Collection A multilingual dataset of public domain books and newspapers. β’ 27 items β’ Updated Nov 6 β’ 121
view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais β’ Nov 13 β’ 98
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M β’ 15 items β’ Updated 3 days ago β’ 195
CLEAR: Character Unlearning in Textual and Visual Modalities Paper β’ 2410.18057 β’ Published Oct 23 β’ 200
Mapping the Media Landscape: Predicting Factual Reporting and Political Bias Through Web Interactions Paper β’ 2410.17655 β’ Published Oct 23 β’ 5