Multi-subject Open-set Personalization in Video Generation Paper • 2501.06187 • Published 3 days ago • 4
ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning Paper • 2501.04698 • Published 5 days ago • 4
ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding Paper • 2501.05452 • Published 4 days ago • 4
OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding? Paper • 2501.05510 • Published 4 days ago • 20
Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains Paper • 2501.05707 • Published 3 days ago • 3
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs Paper • 2501.06186 • Published 3 days ago • 17
OmniManip: Towards General Robotic Manipulation via Object-Centric Interaction Primitives as Spatial Constraints Paper • 2501.03841 • Published 6 days ago • 33
VideoRAG: Retrieval-Augmented Generation over Video Corpus Paper • 2501.05874 • Published 3 days ago • 33
SWE-Fixer: Training Open-Source LLMs for Effective and Efficient GitHub Issue Resolution Paper • 2501.05040 • Published 4 days ago • 11
Centurio: On Drivers of Multilingual Ability of Large Vision-Language Model Paper • 2501.05122 • Published 4 days ago • 15
Enhancing Human-Like Responses in Large Language Models Paper • 2501.05032 • Published 4 days ago • 36
Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives Paper • 2501.04003 • Published 6 days ago • 20
On Computational Limits and Provably Efficient Criteria of Visual Autoregressive Models: A Fine-Grained Complexity Analysis Paper • 2501.04377 • Published 5 days ago • 9
An Empirical Study of Autoregressive Pre-training from Videos Paper • 2501.05453 • Published 4 days ago • 30
The GAN is dead; long live the GAN! A Modern GAN Baseline Paper • 2501.05441 • Published 4 days ago • 64
Multi-task retriever fine-tuning for domain-specific and efficient RAG Paper • 2501.04652 • Published 5 days ago • 8
EpiCoder: Encompassing Diversity and Complexity in Code Generation Paper • 2501.04694 • Published 5 days ago • 9