arxiv:2501.12948
zhuqihao
zqh11
AI & ML interests
None yet
Recent Activity
authored
a paper
4 days ago
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via
Reinforcement Learning
liked
a model
8 days ago
deepseek-ai/DeepSeek-R1-Zero
authored
a paper
5 months ago
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for
Reinforcement Learning and Monte-Carlo Tree Search
Organizations
models
None public yet
datasets
None public yet