Submitted by Hush-cd 77 xVerify: Efficient Answer Verifier for Reasoning Model Evaluations · 9 authors 2
Submitted by xufangzhi 50 Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning · 9 authors 2
Submitted by zhoutianyi 36 How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients · 4 authors 2
Submitted by wbhu-tc 14 NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors · 5 authors 2
Submitted by IanMagnusson 13 DataDecide: How to Predict Best Pretraining Data with Small Experiments · 13 authors 2
Submitted by LXT 13 The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer · 7 authors 3
Submitted by davanstrien 10 DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning · 15 authors 6
Submitted by Daniel0724 10 SimpleAR: Pushing the Frontier of Autoregressive Visual Generation through Pretraining, SFT, and RL · 7 authors 1
Submitted by pierlj 10 RealHarm: A Collection of Real-World Language Model Application Failures · 4 authors 3
Submitted by SempraETY 10 Efficient Generative Model Training via Embedded Representation Warmup · 4 authors 2
Submitted by CoreloneH 10 D^2iT: Dynamic Diffusion Transformer for Accurate Image Generation · 5 authors 2
Submitted by jrd971000 9 Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning · 18 authors 2
Submitted by weqweasdas 9 A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce · 11 authors 3
Submitted by yueqis 9 VisualPuzzles: Decoupling Multimodal Reasoning Evaluation from Domain Knowledge · 6 authors 2
Submitted by simocimolato 6 AI-University: An LLM-based platform for instructional alignment to scientific classrooms · 8 authors 2
Submitted by SYZhang0805 4 Diffusion Distillation With Direct Preference Optimization For Efficient 3D LiDAR Scene Completion · 8 authors 2
Submitted by HenghuiDing 4 PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild · 36 authors 2
Submitted by Hoar012 3 Multimodal Long Video Modeling Based on Temporal Dynamic Context · 4 authors 2
Submitted by sukannya 2 LazyReview A Dataset for Uncovering Lazy Thinking in NLP Peer Reviews · 5 authors 2
Submitted by gigant 2 Summarization of Multimodal Presentations with Vision-Language Models: Study of the Effect of Modalities and Structure · 3 authors 2
Submitted by ziqipang 1 Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception · 3 authors 2
Submitted by ElmanGhazaei - Change State Space Models for Remote Sensing Change Detection · 2 authors 2