Shanghaoran Quan's picture

Shanghaoran Quan

quanshr

·

quanshr

AI & ML interests

Large Language Model

Recent Activity

updated a Space 2 days ago

Yyk040316/long-context-icl

new activity 2 days ago

Yyk040316/long-context-icl:add results

upvoted a paper 5 days ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

View all activity

Organizations

quanshr's activity

updated a Space 2 days ago

Configuration error

Long Context Icl

New activity in Yyk040316/long-context-icl 2 days ago

add results

#1 opened 2 days ago by

upvoted a paper 5 days ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published 7 days ago • 59

liked a model 6 days ago

meta-llama/Llama-3.3-70B-Instruct

Text Generation • Updated Dec 21, 2024 • 581k • • 1.77k

upvoted a paper 14 days ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published 14 days ago • 88

updated a dataset 22 days ago

Qwen/CodeElo

Viewer • Updated 22 days ago • 408 • 264 • 15

New activity in Qwen/CodeElo 22 days ago

Update README.md

#5 opened 22 days ago by

Delete test.json

#3 opened 22 days ago by

Create test.json

#4 opened 22 days ago by

commented a paper 22 days ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published 25 days ago • 48 •

commented a paper 23 days ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published 25 days ago • 48 •

authored a paper 24 days ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published 25 days ago • 48

liked a dataset 24 days ago

Qwen/CodeElo

Viewer • Updated 22 days ago • 408 • 264 • 15

upvoted a paper 25 days ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published 25 days ago • 48

upvoted a paper about 1 month ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 343

upvoted a paper about 2 months ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 79

liked a model 3 months ago

Qwen/Qwen2.5-Coder-32B-Instruct

Text Generation • Updated 16 days ago • 173k • • 1.52k

upvoted a collection 3 months ago

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 40 items • Updated Nov 28, 2024 • 264

updated a dataset 3 months ago

quanshr/LonGen

Viewer • Updated Nov 7, 2024 • 240 • 46 • 1

liked a dataset 3 months ago

quanshr/LonGen

Viewer • Updated Nov 7, 2024 • 240 • 46 • 1