lightonai/large-3-shot-seqlen-3x-max-lt-8k-all-better-hparam-instruct-100k-mcqa-heldout-zs-unused-tokens Updated 4 days ago • 13
lightonai/large-3-shot-seqlen-3x-max-lt-8k-all-better-hparam-instruct-100k-mcqa-heldout-zs-unused-tokens Updated 4 days ago • 13
Rank1: Test-Time Compute for Reranking in Information Retrieval Paper • 2502.18418 • Published 12 days ago • 25
knowledgator/gliclass-modern-large-v2.0-init Zero-Shot Classification • Updated 15 days ago • 671 • 8
knowledgator/gliclass-modern-base-v2.0-init Zero-Shot Classification • Updated 3 days ago • 2.48k • 19
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 8 items • Updated 14 days ago • 389