64 18 17

Binyuan Hui

huybery

https://huybery.github.io

AI & ML interests

Large Language Models, Code Generation, Semantic Parsing

Recent Activity

upvoted a paper 5 days ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

new activity 22 days ago

Qwen/CodeElo:Update README.md

new activity 22 days ago

Qwen/CodeElo:Create test.json

View all activity

Articles

BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks

Jun 18, 2024

• 43

Organizations

huybery's activity

upvoted a paper 5 days ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published 7 days ago • 59

upvoted a paper 25 days ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published 25 days ago • 48

upvoted a paper about 1 month ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 343

upvoted 2 papers about 2 months ago

Evaluating and Aligning CodeLLMs on Human Preference

Paper • 2412.05210 • Published Dec 6, 2024 • 47

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 79

upvoted a paper 4 months ago

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25, 2024 • 61

upvoted a collection 4 months ago

Qwen2.5-Coder

Collection

Code-specific model series based on Qwen2.5 • 40 items • Updated Nov 28, 2024 • 264

upvoted 2 papers 4 months ago

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Paper • 2409.12191 • Published Sep 18, 2024 • 76

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18, 2024 • 140

upvoted a paper 7 months ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 161

upvoted a collection 8 months ago

Qwen2

Collection

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Nov 28, 2024 • 355

upvoted a collection 10 months ago

Qwen1.5

Collection

Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. • 55 items • Updated Nov 28, 2024 • 205

upvoted 6 papers over 1 year ago

Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs

Paper • 2305.03111 • Published May 4, 2023 • 9