Token-Efficient Long Video Understanding for Multimodal LLMs Paper • 2503.04130 • Published 3 days ago • 61 • 2
The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation Paper • 2503.04606 • Published 3 days ago • 7 • 1
Dedicated Feedback and Edit Models Empower Inference-Time Scaling for Open-Ended General-Domain Tasks Paper • 2503.04378 • Published 3 days ago • 6 • 3
GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control Paper • 2503.03751 • Published 4 days ago • 18 • 4
Sim-to-Real Reinforcement Learning for Vision-Based Dexterous Manipulation on Humanoids Paper • 2502.20396 • Published 10 days ago • 12 • 2
HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models Paper • 2502.20811 • Published 9 days ago • 2 • 2
SoS1: O1 and R1-Like Reasoning LLMs are Sum-of-Square Solvers Paper • 2502.20545 • Published 10 days ago • 20 • 2
Mobius: Text to Seamless Looping Video Generation via Latent Shift Paper • 2502.20307 • Published 10 days ago • 16 • 2
FlexiDiT: Your Diffusion Transformer Can Easily Generate High-Quality Samples with Less Compute Paper • 2502.20126 • Published 10 days ago • 19 • 2
R1-T1: Fully Incentivizing Translation Capability in LLMs via Reasoning Learning Paper • 2502.19735 • Published 10 days ago • 7 • 2
TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding Paper • 2502.19400 • Published 11 days ago • 42 • 2
An Overview of Large Language Models for Statisticians Paper • 2502.17814 • Published 12 days ago • 4 • 2
WebGames: Challenging General-Purpose Web-Browsing AI Agents Paper • 2502.18356 • Published 12 days ago • 11 • 2
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution Paper • 2502.18449 • Published 12 days ago • 67 • 5
X-Dancer: Expressive Music to Human Dance Video Generation Paper • 2502.17414 • Published 13 days ago • 11 • 3
VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing Paper • 2502.17258 • Published 13 days ago • 72 • 4