IterPref: Focal Preference Learning for Code Generation via Iterative Debugging Paper • 2503.02783 • Published 5 days ago • 5
ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation Paper • 2502.18364 • Published 12 days ago • 32
Enhancing Language Multi-Agent Learning with Multi-Agent Credit Re-Assignment for Interactive Environment Generalization Paper • 2502.14496 • Published 17 days ago
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 341
OpenCSG Chinese Corpus: A Series of High-quality Chinese Datasets for LLM Training Paper • 2501.08197 • Published Jan 14 • 8
Gradient-Mask Tuning Elevates the Upper Limits of LLM Performance Paper • 2406.15330 • Published Jun 21, 2024
Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training Paper • 2411.14318 • Published Nov 21, 2024
EpiCoder: Encompassing Diversity and Complexity in Code Generation Paper • 2501.04694 • Published Jan 8 • 15
VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation Paper • 2412.21059 • Published Dec 30, 2024 • 18
Efficiently Serving LLM Reasoning Programs with Certaindex Paper • 2412.20993 • Published Dec 30, 2024 • 36
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search Paper • 2412.18319 • Published Dec 24, 2024 • 37
LLMs Do Not Think Step-by-step In Implicit Reasoning Paper • 2411.15862 • Published Nov 24, 2024 • 10
AIGS: Generating Science from AI-Powered Automated Falsification Paper • 2411.11910 • Published Nov 17, 2024
Patience Is The Key to Large Language Model Reasoning Paper • 2411.13082 • Published Nov 20, 2024 • 7
Hyper-multi-step: The Truth Behind Difficult Long-context Tasks Paper • 2410.04422 • Published Oct 6, 2024 • 7
"Paraphrasing The Original Text" Makes High Accuracy Long-Context QA Paper • 2312.11193 • Published Dec 18, 2023
An Intelligent Remote Sensing Image Quality Inspection System Paper • 2307.11965 • Published Jul 22, 2023