Junyang Lin's picture

Junyang Lin

JustinLin610

·

https://justinlin610.github.io

AI & ML interests

Pretraining, NLP, CV, etc.

Recent Activity

authored a paper 3 days ago

START: Self-taught Reasoner with Tools

new activity 3 days ago

Qwen/QwQ-32B:复杂推理进入死循环

liked a model 4 days ago

Qwen/QwQ-32B-GGUF

View all activity

Organizations

JustinLin610's activity

New activity in Qwen/QwQ-32B 3 days ago

复杂推理进入死循环

#21 opened 4 days ago by

New activity in Qwen/Qwen2.5-Math-7B-Instruct 5 months ago

Independent evaluation results

#1 opened 5 months ago by

New activity in Qwen/Qwen2-VL-7B-Instruct 6 months ago

Have you deleted your GitHub page?

#10 opened 6 months ago by

New activity in Qwen/Qwen2-72B-Instruct 8 months ago

32B

#13 opened 9 months ago by

The sample code could not run...

#16 opened 8 months ago by

New activity in Qwen/CodeQwen1.5-7B-Chat 10 months ago

fine-tuning

#16 opened 11 months ago by

Maybe a silly question...

#18 opened 10 months ago by

This model is Awesome

#20 opened 10 months ago by

areumtecnologia

New activity in Qwen/Qwen1.5-110B-Chat-AWQ 11 months ago

Update tokenizer_config.json

#3 opened 11 months ago by

New activity in Qwen/Qwen1.5-MoE-A2.7B-Chat 11 months ago

请问这个版本GPU内存消耗28G与14B对比如何?

#7 opened 11 months ago by

New activity in Qwen/CodeQwen1.5-7B-Chat 11 months ago

Fine tuning this model with Proprietary Code

#6 opened 11 months ago by

What are the diffences of this with Qwen/CodeQwen1.5-7B

#5 opened 11 months ago by

New activity in Qwen/Qwen1.5-7B-Chat 11 months ago

Adding Evaluation Results

#14 opened 11 months ago by

leaderboard-pr-bot

qwen1.5-7b-chat是不是推理起来比qwen1.5-7b快很多

#9 opened 12 months ago by

New activity in Qwen/Qwen1.5-0.5B 11 months ago

tie_word_embeddings=true ?

#6 opened 11 months ago by

New activity in Qwen/Qwen1.5-72B-Chat 11 months ago

Why 72B model has different vocab size comparing with other models?

#1 opened about 1 year ago by

New activity in Qwen/CodeQwen1.5-7B-Chat-GGUF 11 months ago

Using llama.cpp server, responses always end with <|im_end|>

#2 opened 11 months ago by

New activity in Qwen/CodeQwen1.5-7B-Chat 11 months ago

The llm output is incomplete

#11 opened 11 months ago by

GGUF models

#1 opened 11 months ago by

is 14b coming?

#3 opened 11 months ago by