ChengpengLi's picture

2 8 2

ChengpengLi

ChengpengLi

·

AI & ML interests

LLM for Reasoning, reinforcement learning, recommendation system, diffusion models

Recent Activity

authored a paper 3 days ago

START: Self-taught Reasoner with Tools

commented on a paper 3 days ago

START: Self-taught Reasoner with Tools

upvoted a paper 3 days ago

START: Self-taught Reasoner with Tools

View all activity

Organizations

None yet

ChengpengLi's activity

upvoted a paper 3 days ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published 3 days ago • 77

upvoted 2 papers about 2 months ago

Enabling Scalable Oversight via Self-Evolving Critic

Paper • 2501.05727 • Published Jan 10 • 70

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 92

upvoted a paper 3 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 349

upvoted 2 collections 6 months ago

Qwen2.5-Math

Math-specific model series based on Qwen2.5 • 11 items • Updated Jan 14 • 77

Qwen2-Math

Math-specific model series based on Qwen2 • 8 items • Updated Nov 28, 2024 • 51

upvoted 2 papers 8 months ago

Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models

Paper • 2406.13542 • Published Jun 19, 2024 • 16

DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical Reasoning

Paper • 2407.04078 • Published Jul 4, 2024 • 21