arxiv:2501.01257
Bofei Gao
KbsdJames
AI & ML interests
LLM
Recent Activity
authored
a paper
6 days ago
CodeElo: Benchmarking Competition-level Code Generation of LLMs with
Human-comparable Elo Ratings
upvoted
a
paper
about 1 month ago
ProcessBench: Identifying Process Errors in Mathematical Reasoning
Organizations
None yet