arxiv:2501.12948
DejianYang
DejianYang
AI & ML interests
None yet
Recent Activity
authored
a paper
4 days ago
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via
Reinforcement Learning
authored
a paper
5 months ago
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for
Reinforcement Learning and Monte-Carlo Tree Search
authored
a paper
7 months ago
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code
Intelligence
Organizations
models
None public yet
datasets
None public yet