Enhancing Human-Like Responses in Large Language Models Paper • 2501.05032 • Published 4 days ago • 37
Cosmos Tokenizer Collection A suite of image and video tokenizers • 13 items • Updated 2 days ago • 36
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding Paper • 2412.10302 • Published about 1 month ago • 11
Granite 3.0 Language Models Collection A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated 26 days ago • 96
Granite 3.1 Language Models Collection A series of language models with 128K context length trained by IBM licensed under Apache 2.0 license. • 8 items • Updated 26 days ago • 48
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated 25 days ago • 123
Falcon3 Collection Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. • 40 items • Updated 5 days ago • 78
SigLIP Collection Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 • 10 items • Updated about 1 month ago • 50
DataGemma Release Collection A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated about 1 month ago • 82
Llama 3.3 (All Versions) Collection Meta's new Llama 3.3 (70B) model in all formats. Includes GGUF, 4-bit bnb and original versions. • 3 items • Updated about 15 hours ago • 30