-
Reasoning Language Models: A Blueprint
Paper • 2501.11223 • Published • 25 -
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Paper • 2402.03300 • Published • 80 -
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper • 2501.12948 • Published • 204
Peng Zhang
irvingfish
·
AI & ML interests
None yet
Recent Activity
updated
a collection
4 days ago
Reasoning
updated
a collection
4 days ago
Reasoning
liked
a model
5 months ago
Qwen/Qwen2-VL-7B-Instruct
Organizations
None yet
Collections
1
models
None public yet
datasets
None public yet