Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 3 items • Updated 3 days ago • 271
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 7 days ago • 261
FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces Paper • 2501.12909 • Published 7 days ago • 62
Diffusion Adversarial Post-Training for One-Step Video Generation Paper • 2501.08316 • Published 15 days ago • 32
VideoAuteur: Towards Long Narrative Video Generation Paper • 2501.06173 • Published 19 days ago • 31 • 3
Masked Autoencoders Enable Efficient Knowledge Distillers Paper • 2208.12256 • Published Aug 25, 2022
Delving into Masked Autoencoders for Multi-Label Thorax Disease Classification Paper • 2210.12843 • Published Oct 23, 2022
CLIP-Driven Universal Model for Organ Segmentation and Tumor Detection Paper • 2301.00785 • Published Jan 2, 2023