Personalized Graph-Based Retrieval for Large Language Models Paper • 2501.02157 • Published 9 days ago • 26
Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers Paper • 2501.03931 • Published 6 days ago • 13
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos Paper • 2501.04001 • Published 6 days ago • 38
Search-o1: Agentic Search-Enhanced Large Reasoning Models Paper • 2501.05366 • Published 4 days ago • 56
Agent Laboratory: Using LLM Agents as Research Assistants Paper • 2501.04227 • Published 5 days ago • 68
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though Paper • 2501.04682 • Published 5 days ago • 74
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published 5 days ago • 197
MixLLM: LLM Quantization with Global Mixed-precision between Output-features and Highly-efficient System Design Paper • 2412.14590 • Published 25 days ago • 13
SCOPE: Optimizing Key-Value Cache Compression in Long-context Generation Paper • 2412.13649 • Published 26 days ago • 20
Offline Reinforcement Learning for LLM Multi-Step Reasoning Paper • 2412.16145 • Published 24 days ago • 38
OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning Paper • 2412.16849 • Published 22 days ago • 8
NILE: Internal Consistency Alignment in Large Language Models Paper • 2412.16686 • Published 23 days ago • 8
Agent-SafetyBench: Evaluating the Safety of LLM Agents Paper • 2412.14470 • Published 25 days ago • 12
DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought Paper • 2412.17498 • Published 21 days ago • 21
Revisiting In-Context Learning with Long Context Language Models Paper • 2412.16926 • Published 22 days ago • 29
Large Motion Video Autoencoding with Cross-modal Video VAE Paper • 2412.17805 • Published 21 days ago • 24
Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning Paper • 2412.15797 • Published 24 days ago • 17
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search Paper • 2412.18319 • Published 20 days ago • 35