Shirong Ma
msr2000
AI & ML interests
None yet
Recent Activity
updated
a model
1 day ago
deepseek-ai/DeepSeek-R1-Zero
updated
a model
1 day ago
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
updated
a model
1 day ago
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
Organizations
msr2000's activity
qwen32B蒸馏模型,长度>8k时,预测一定比例乱码,出现<think><think><think><think><think><think>
5
#44 opened 3 days ago
by
daniellibin
Upload 1735146950945.jpg
3
#11 opened about 1 month ago
by
NZEEMSZY
Create Dondasse
#15 opened 28 days ago
by
Dondasse
Water and forests
2
#16 opened 28 days ago
by
Dondasse
Update README.md with vLLM Support
1
#8 opened about 1 month ago
by
simon-mo
Update README.md with vLLM Support
#28 opened about 1 month ago
by
simon-mo
fail to run the example
8
#4 opened 9 months ago
by
Leymore
keyError: 'sdpa'
1
#3 opened 9 months ago
by
fengzi258
vllm support
7
#2 opened 9 months ago
by
Sihangli
KV Cache for compress_kv or key-value states
6
#1 opened 9 months ago
by
House-99