GPT-Generated Unified Format (GGUF) Collection ease of reading β’ 13 items β’ Updated about 10 hours ago β’ 7
OpenCoder Collection OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. β’ 4 items β’ Updated 5 days ago β’ 54
Llama3-8B-1.58 Collection A trio of powerful models: fine-tuned from Llama3-8b-Instruct, with BitNet architecture! β’ 3 items β’ Updated Sep 14 β’ 12
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M β’ 8 items β’ Updated 9 days ago β’ 163
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases Paper β’ 2402.14905 β’ Published Feb 22 β’ 126
MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 β’ 8 items β’ Updated 7 days ago β’ 93
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning Paper β’ 2410.02884 β’ Published Oct 3 β’ 48
D_AU - Source files for GGUF, EXL2, AWQ, GPTQ, HQQ etc etc Collection Safetensor source files (by David_AU) to use directly and/or create different quants and/or merges. Link to GGUFS/full model card on each. β’ 56 items β’ Updated 3 days ago β’ 3
GGUF Image Model Quants Collection List of GGUF quants for text to image base models. β’ 9 items β’ Updated 15 days ago β’ 9
Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs Paper β’ 2404.05719 β’ Published Apr 8 β’ 80
view article Article MedEmbed: Fine-Tuned Embedding Models for Medical / ClinicalΒ IR By abhinand β’ 24 days ago β’ 30
view article Article Advanced Flux Dreambooth LoRA Training with 𧨠diffusers By linoyts ⒠23 days ago ⒠27
Granite 3.0 Language Models Collection A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. β’ 8 items β’ Updated 9 days ago β’ 86