Collections
Discover the best community collections!
Collections including paper arxiv:2502.12524
-
YOLOv1 to YOLOv10: The fastest and most accurate real-time object detection systems
Paper • 2408.09332 • Published -
YOLOv10: Real-Time End-to-End Object Detection
Paper • 2405.14458 • Published • 6 -
End-to-End Object Detection with Transformers
Paper • 2005.12872 • Published • 5 -
YOLOv12: Attention-Centric Real-Time Object Detectors
Paper • 2502.12524 • Published • 10
-
Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages
Paper • 2410.16153 • Published • 44 -
AutoTrain: No-code training for state-of-the-art models
Paper • 2410.15735 • Published • 59 -
The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio
Paper • 2410.12787 • Published • 31 -
LEOPARD : A Vision Language Model For Text-Rich Multi-Image Tasks
Paper • 2410.01744 • Published • 26
-
FAN: Fourier Analysis Networks
Paper • 2410.02675 • Published • 26 -
Tensor Product Attention Is All You Need
Paper • 2501.06425 • Published • 84 -
Scalable-Softmax Is Superior for Attention
Paper • 2501.19399 • Published • 21 -
EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling
Paper • 2502.09509 • Published • 7