Falcon3 Collection Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. • 40 items • Updated 6 days ago • 70
Multilingual LLM Evaluation Collection Multilingual Evaluation Benchmarks • 6 items • Updated 12 days ago • 8
🧪 FineWeb v1 data experiments Collection Ablation models trained for our data experiments. • 22 items • Updated Jun 12 • 3
📀 Dataset comparison models Collection 1.8B models trained on 350BT to compare different pretraining datasets • 8 items • Updated Jun 12 • 34
AraDICE Collection AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMs • 12 items • Updated 12 days ago • 4
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. • 23 items • Updated 12 days ago • 119
view article Article Rethinking Backpropagation: Thoughts on What's Wrong with Backpropagation By Jaward • 24 days ago • 5
view article Article Comparing Open-source and Proprietary LLMs in Medical AI By mpimentel • Oct 3 • 16
Hymba: A Hybrid-head Architecture for Small Language Models Paper • 2411.13676 • Published Nov 20 • 39
view article Article Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK By davidberenstein1957 • Nov 21 • 34
Hermes: A Large Language Model Framework on the Journey to Autonomous Networks Paper • 2411.06490 • Published Nov 10 • 6
view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais • Nov 13 • 98
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion Paper • 2411.04928 • Published Nov 7 • 48