77 21 203

Junyang Lin

JustinLin610

https://justinlin610.github.io

AI & ML interests

Pretraining, NLP, CV, etc.

Recent Activity

authored a paper 3 days ago

START: Self-taught Reasoner with Tools

new activity 3 days ago

Qwen/QwQ-32B:复杂推理进入死循环

liked a model 4 days ago

Qwen/QwQ-32B-GGUF

View all activity

Organizations

JustinLin610's activity

authored a paper 3 days ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published 3 days ago • 73

New activity in Qwen/QwQ-32B 3 days ago

复杂推理进入死循环

#21 opened 4 days ago by

frankgxy

liked 3 models 4 days ago

liked a Space 4 days ago

263

QwQ 32B Demo

🌖

Generate text responses based on user input

authored a paper 9 days ago

Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think

Paper • 2502.20172 • Published 10 days ago • 26

updated 2 collections 12 days ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated 12 days ago • 552

Qwen2.5-1M

Collection

The long-context version of Qwen2.5, supporting 1M-token context lengths • 3 items • Updated 12 days ago • 105

upvoted a collection 12 days ago

Qwen2.5-1M

Collection

The long-context version of Qwen2.5, supporting 1M-token context lengths • 3 items • Updated 12 days ago • 105

authored a paper 17 days ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published 18 days ago • 157

liked a Space about 1 month ago

554

Qwen2.5 Max Demo

🐢

Chat with an AI language model

authored a paper about 1 month ago

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published Jan 26 • 63

liked a model about 1 month ago

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • Updated 4 days ago • 3.46M • 634

published a model about 1 month ago

Qwen/Qwen2.5-VL-3B-Instruct

Image-Text-to-Text • Updated 23 days ago • 907k • 252

authored a paper about 1 month ago

RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques

Paper • 2501.14492 • Published Jan 24 • 31

liked a model about 1 month ago

Qwen/Qwen2.5-14B-Instruct-1M

Text Generation • Updated Jan 29 • 54.7k • 276

authored 2 papers about 2 months ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published Jan 21 • 63

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 92

upvoted a paper about 2 months ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 92