Jina Reader-LM Collection Convert HTML content to LLM-friendly Markdown/JSON content • 3 items • Updated Jan 16 • 10
jina-embeddings-v3: Multilingual Embeddings With Task LoRA Paper • 2409.10173 • Published Sep 16, 2024 • 31
jina-embeddings-v3 Collection Multilingual multi-task general text embedding model • 6 items • Updated Sep 19, 2024 • 22
LettuceDetect: A Hallucination Detection Framework for RAG Applications Paper • 2502.17125 • Published 13 days ago • 7
Hallucination detection Collection Trained ModernBERT (base and large) for detection hallucinations in LLM responses. The models are trained as token classifications. • 4 items • Updated 4 days ago • 14
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality 6 days ago • 57
view article Article MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era By MiniMax-AI • Jan 15 • 43
view article Article Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita 🔥 20 days ago • 93
NeoBERT Collection NeoBERT is a next-generation encoder model for English text representation, pre-trained from scratch on the RefinedWeb dataset. • 1 item • Updated 10 days ago • 2
The Ultimate Collection of Code Classifiers Collection 🔥 15 classifiers, 124M parameters, one per programming language— for assessing the educational value of GitHub code • 15 items • Updated 17 days ago • 10
Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study Paper • 2502.02481 • Published Feb 4 • 10
GemmaX2 Collection GemmaX2 language models, including pretrained and instruction-tuned models of 2 sizes, including 2B, 9B. • 7 items • Updated about 1 month ago • 20
MMTEB Collection A collection of items telated the the MMTEB release • 2 items • Updated 16 days ago • 1