Iterative Value Function Optimization for Guided Decoding Paper • 2503.02368 • Published 6 days ago • 14
Mask-DPO: Generalizable Fine-grained Factuality Alignment of LLMs Paper • 2503.02846 • Published 5 days ago • 18
Sim-to-Real Reinforcement Learning for Vision-Based Dexterous Manipulation on Humanoids Paper • 2502.20396 • Published 10 days ago • 12
DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping Paper • 2502.20900 • Published 10 days ago • 7
CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models Paper • 2502.16614 • Published 15 days ago • 24
DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks Paper • 2502.17157 • Published 14 days ago • 51
SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines Paper • 2502.14739 • Published 17 days ago • 94
RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning Paper • 2502.13144 • Published 19 days ago • 37
Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model Paper • 2502.10248 • Published 23 days ago • 51
Phantom: Subject-consistent video generation via cross-modal alignment Paper • 2502.11079 • Published 22 days ago • 52
Soundwave: Less is More for Speech-Text Alignment in LLMs Paper • 2502.12900 • Published 20 days ago • 76
Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation Paper • 2502.13145 • Published 19 days ago • 36