view article Article Use Models from the Hugging Face Hub in LM Studio By yagilb β’ 27 days ago β’ 127
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper β’ 2412.13663 β’ Published 7 days ago β’ 103
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling β’ 3 items β’ Updated 6 days ago β’ 92
view article Article Building a Local Vector Database Index with Annoy and Sentence Transformers By theeseus-ai β’ 20 days ago β’ 3
view article Article πΊπ¦ββ¬ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs By wolfram β’ 21 days ago β’ 70
view article Article Accelerating Embedding & Reranking Models on AMD Using Infinity By michaelfeil β’ 22 days ago β’ 4
A Flexible Large Language Models Guardrail Development Methodology Applied to Off-Topic Prompt Detection Paper β’ 2411.12946 β’ Published Nov 20 β’ 20
view article Article Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK By davidberenstein1957 β’ Nov 21 β’ 34
Drowning in Documents: Consequences of Scaling Reranker Inference Paper β’ 2411.11767 β’ Published Nov 18 β’ 17
view article Article Halo: Open Source Health Tracking with Wearables By cyrilzakka β’ Nov 19 β’ 96
view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais β’ Nov 13 β’ 98
Training with Prompts Collection See the Training with Prompts documentation for more details: https://sbert.net/examples/training/prompts/README.html β’ 5 items β’ Updated Nov 7 β’ 3
view article Article Releasing Common Corpus: the largest public domain dataset for training LLMs By Pclanglais β’ Mar 20 β’ 18
Model2Vec base models Collection These are the Minishlab Model2Vec base models. Load them and use them with model2vec (https://github.com/MinishLab/model2vec) or sentence-transformers β’ 7 items β’ Updated 11 days ago β’ 8
POTION Collection These are the flagship POTION models. Load them and use them with model2vec (https://github.com/MinishLab/model2vec) or sentence-transformers β’ 3 items β’ Updated Oct 30 β’ 6