60 26 91

Chujie Zheng

chujiezheng

https://chujiezheng.github.io/

AI & ML interests

Large Language Models

Recent Activity

upvoted a paper 3 days ago

START: Self-taught Reasoner with Tools

liked a model 4 days ago

Qwen/QwQ-32B-GGUF

liked a model 4 days ago

Qwen/QwQ-32B

View all activity

Organizations

chujiezheng's activity

upvoted a paper 3 days ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published 3 days ago • 73

liked 3 models 4 days ago

liked a Space 4 days ago

263

QwQ 32B Demo

🌖

Generate text responses based on user input

liked a model 10 days ago

Qwen/Qwen2.5-VL-72B-Instruct-AWQ

Image-Text-to-Text • Updated 2 days ago • 146k • 36

upvoted a paper 10 days ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published 18 days ago • 157

authored 2 papers 16 days ago

Aligning Instruction Tuning with Pre-training

Paper • 2501.09368 • Published Jan 16

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published 17 days ago • 94

commented a paper 17 days ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published 17 days ago • 94 •

upvoted a paper 17 days ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published 17 days ago • 94

liked 5 models about 1 month ago

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • Updated 4 days ago • 3.46M • 634

Qwen/Qwen2.5-VL-72B-Instruct

Image-Text-to-Text • Updated 2 days ago • 270k • 360

Qwen/Qwen2.5-14B-Instruct-1M

Text Generation • Updated Jan 29 • 54.7k • 276

Qwen/Qwen2.5-7B-Instruct-1M

Text Generation • Updated Jan 29 • 333k • 259

Qwen/Qwen2.5-VL-3B-Instruct

Image-Text-to-Text • Updated 22 days ago • 907k • 252

liked 3 Spaces about 1 month ago

125

Qwen2.5 VL 72B Instruct

💻

Interact with Qwen2.5-VL-Chat model using text and files

Qwen2.5-1M Demo

💻

Upload documents to answer questions

554

Qwen2.5 Max Demo

🐢

Chat with an AI language model

upvoted a paper about 2 months ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published Jan 21 • 63