Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments Paper • 2501.10893 • Published 9 days ago • 22
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training Paper • 2501.11425 • Published 7 days ago • 77
Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks Paper • 2501.11733 • Published 7 days ago • 25
Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement Paper • 2501.12273 • Published 6 days ago • 14
The Heap: A Contamination-Free Multilingual Code Dataset for Evaluating Large Language Models Paper • 2501.09653 • Published 11 days ago • 12
OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking Paper • 2501.09751 • Published 11 days ago • 46
Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models Paper • 2501.09686 • Published 11 days ago • 35
PaSa: An LLM Agent for Comprehensive Academic Paper Search Paper • 2501.10120 • Published 10 days ago • 38
Towards Best Practices for Open Datasets for LLM Training Paper • 2501.08365 • Published 13 days ago • 49
HALoGEN: Fantastic LLM Hallucinations and Where to Find Them Paper • 2501.08292 • Published 13 days ago • 16
OpenCSG Chinese Corpus: A Series of High-quality Chinese Datasets for LLM Training Paper • 2501.08197 • Published 13 days ago • 7
Eliciting In-context Retrieval and Reasoning for Long-context Large Language Models Paper • 2501.08248 • Published 13 days ago • 1
LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token Paper • 2501.03895 • Published 20 days ago • 48
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models Paper • 2501.03262 • Published 24 days ago • 87
Cosmos World Foundation Model Platform for Physical AI Paper • 2501.03575 • Published 21 days ago • 66
Multi-task retriever fine-tuning for domain-specific and efficient RAG Paper • 2501.04652 • Published 19 days ago • 10
EpiCoder: Encompassing Diversity and Complexity in Code Generation Paper • 2501.04694 • Published 19 days ago • 13