view article Article Private Synthetic Data Generation Made Easy: Out-of-the-Box with Docker, Argilla & Ollama By daqc and 1 other • 4 days ago • 4
view article Article Agentic RAG Stack (3/5) - Generate responses using a SmolLM By davidberenstein1957 • Feb 6 • 6
view article Article Agentic RAG Stack (2/5) - Augment retrieval results by reranking using Sentence Transformers By davidberenstein1957 • Feb 5 • 9
view article Article Agentic RAG Stack (1/5) - Index and retrieve documents for vector search using Sentence Transformers and DuckDB By davidberenstein1957 • Jan 27 • 18
view article Article Fine-tune ModernBERT for RAG with Synthetic Data By sdiazlor and 2 others • Jan 20 • 37
view article Article Fine-tune a SmolLM on domain-specific synthetic data from a LLM By davidberenstein1957 • Jan 3 • 36
view article Article Fine-tune ModernBERT for text classification using synthetic data By davidberenstein1957 • Dec 30, 2024 • 32
view article Article Introducing the Synthetic Data Generator - Build Datasets with Natural Language Dec 16, 2024 • 115
view article Article Open Preference Dataset for Text-to-Image Generation by the 🤗 Community Dec 9, 2024 • 54
view article Article Let’s make a generation of amazing image generation models By burtenshaw and 4 others • Nov 26, 2024 • 33
view article Article Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK By davidberenstein1957 and 1 other • Nov 21, 2024 • 35
view article Article How to build a custom text classifier without days of human labeling By sdiazlor and 4 others • Oct 17, 2024 • 55
view article Article How to optimize your data labelling project with custom interfaces By burtenshaw and 9 others • Oct 16, 2024 • 18
view article Article To what extent are we responsible for our content and how to create safer Spaces? By davidberenstein1957 • Aug 30, 2024 • 5