OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models Paper • 2502.01061 • Published 1 day ago • 76 • 4
ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning Paper • 2502.01100 • Published 1 day ago • 8 • 1
The Jumping Reasoning Curve? Tracking the Evolution of Reasoning Performance in GPT-[n] and o-[n] Models on Multimodal Puzzles Paper • 2502.01081 • Published 1 day ago • 7 • 1
Improving Transformer World Models for Data-Efficient RL Paper • 2502.01591 • Published 1 day ago • 7 • 1
MatAnyone: Stable Video Matting with Consistent Memory Propagation Paper • 2501.14677 • Published 11 days ago • 19 • 2
Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming Paper • 2501.18837 • Published 5 days ago • 7 • 5
Trading Inference-Time Compute for Adversarial Robustness Paper • 2501.18841 • Published 5 days ago • 3 • 2
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs Paper • 2501.18585 • Published 5 days ago • 46 • 7
Large Language Models Think Too Fast To Explore Effectively Paper • 2501.18009 • Published 6 days ago • 22 • 3
Atla Selene Mini: A General Purpose Evaluation Model Paper • 2501.17195 • Published 8 days ago • 30 • 4
Early External Safety Testing of OpenAI's o3-mini: Insights from the Pre-Deployment Evaluation Paper • 2501.17749 • Published 6 days ago • 12 • 2
TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models Paper • 2501.16937 • Published 7 days ago • 4 • 2
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published 7 days ago • 95 • 6
Optimizing Large Language Model Training Using FP4 Quantization Paper • 2501.17116 • Published 7 days ago • 31 • 2
Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling Paper • 2501.16975 • Published 7 days ago • 21 • 4
Low-Rank Adapters Meet Neural Architecture Search for LLM Compression Paper • 2501.16372 • Published 13 days ago • 7 • 2
iFormer: Integrating ConvNet and Transformer for Mobile Application Paper • 2501.15369 • Published 10 days ago • 10 • 2