CrowdSelect: Synthetic Instruction Data Selection with Multi-LLM Wisdom Paper • 2503.01836 • Published 6 days ago • 10
GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control Paper • 2503.03751 • Published 4 days ago • 18
Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers Paper • 2503.00865 • Published 7 days ago • 55
Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation Paper • 2503.01370 • Published 6 days ago • 8
CODESYNC: Synchronizing Large Language Models with Dynamic Code Evolution at Scale Paper • 2502.16645 • Published 14 days ago • 21
Rank1: Test-Time Compute for Reranking in Information Retrieval Paper • 2502.18418 • Published 12 days ago • 25
view article Article Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset By sdiazlor • 27 days ago • 46
view article Article Wanx AI :AlibabaCloud Best Video Generation Model By LLMhacker • 13 days ago • 6
GHOST 2.0: generative high-fidelity one shot transfer of heads Paper • 2502.18417 • Published 12 days ago • 62
VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing Paper • 2502.17258 • Published 13 days ago • 72
K-LoRA: Unlocking Training-Free Fusion of Any Subject and Style LoRAs Paper • 2502.18461 • Published 12 days ago • 15
KV-Edit: Training-Free Image Editing for Precise Background Preservation Paper • 2502.17363 • Published 13 days ago • 32
SurveyX: Academic Survey Automation via Large Language Models Paper • 2502.14776 • Published 17 days ago • 91
X-Dancer: Expressive Music to Human Dance Video Generation Paper • 2502.17414 • Published 13 days ago • 11
YOLOv12: Attention-Centric Real-Time Object Detectors Paper • 2502.12524 • Published 19 days ago • 10
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper • 2502.14786 • Published 17 days ago • 128
PhotoDoodle: Learning Artistic Image Editing from Few-Shot Pairwise Data Paper • 2502.14397 • Published 17 days ago • 38
Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation Paper • 2502.14846 • Published 17 days ago • 13