arxiv:2501.06173
Junfei Xiao
lambertxiao
AI & ML interests
None yet
Recent Activity
upvoted
a
collection
about 21 hours ago
Qwen2.5-VL
upvoted
a
paper
5 days ago
Kimi k1.5: Scaling Reinforcement Learning with LLMs
upvoted
a
paper
5 days ago
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via
Reinforcement Learning
Organizations
None yet
models
None public yet
datasets
None public yet