DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 5 days ago • 225
UI-TARS: Pioneering Automated GUI Interaction with Native Agents Paper • 2501.12326 • Published 6 days ago • 45
PaSa: An LLM Agent for Comprehensive Academic Paper Search Paper • 2501.10120 • Published 11 days ago • 38
From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning Paper • 2411.03817 • Published Nov 6, 2024 • 1
DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning Paper • 2406.11896 • Published Jun 14, 2024 • 20