Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK Nov 21 • 34
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated 3 days ago • 195
Synthetic Data Generator Collection A collection of tools and datasets related to no-code the Synthetic Data Generation. • 13 items • Updated 8 days ago • 4
Smol but mighty Collection A collection of smoll but mighty models • 10 items • Updated 6 days ago • 2
Gradio WebRTC Cookbook ⚡️ Collection Collection of real-time voice and video demos built with gradio-webrtc custom component • 8 items • Updated 15 days ago • 9
Lora Land - 27 High-Quality LoRA Adapters Collection 27 Fine-tuned LoRA Adapters using Mistral-7B. Try them here: https://predibase.com/lora-land • 27 items • Updated Apr 26 • 4
Self-Instruct: Aligning Language Model with Self Generated Instructions Paper • 2212.10560 • Published Dec 20, 2022 • 9
view article Article 🐺🐦⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs By wolfram • 21 days ago • 70
Open Image Preferences Collection Containing all artifacts for the Stable Diffusion 3.5L vs Flux Dev image preference community sprint. • 14 items • Updated 7 days ago • 5
view article Article Let’s make a generation of amazing image generation models By burtenshaw • 29 days ago • 33
view article Article Zero to Hero with the TRL learning link bomb 💣 By burtenshaw • about 1 month ago • 4
Datasets built with ⚗️ distilabel Collection This collection contains some datasets generated and/or labelled using https://github.com/argilla-io/distilabel • 8 items • Updated 14 days ago • 11
Dataset Creation Collection Spaces and utilities for creating datasets and getting them on the Hub • 3 items • Updated Nov 10 • 10
Synthetic Dataset Creation Collection Spaces focused on generating synthetic datasets • 6 items • Updated 27 days ago • 9