北京奇虎科技有限公司

company

AI & ML interests

None defined yet.

Recent Activity

yuhanwuuu authored a paper about 21 hours ago

TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation

lincharliesun authored a paper about 23 hours ago

Stress Testing Generalization: How Minor Modifications Undermine Large Language Model Performance

lincharliesun authored a paper about 23 hours ago

TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation

View all activity

qihoo360's activity

yuhanwuuu

authored a paper about 21 hours ago

TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation

Paper • 2503.04872 • Published 5 days ago • 11

lincharliesun

authored 2 papers about 23 hours ago

Stress Testing Generalization: How Minor Modifications Undermine Large Language Model Performance

Paper • 2502.12459 • Published 21 days ago

TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation

Paper • 2503.04872 • Published 5 days ago • 11

lincharliesun

authored 3 papers about 24 hours ago

Expand VSR Benchmark for VLLM to Expertize in Spatial Rules

Paper • 2412.18224 • Published Dec 24, 2024

LongAttn: Selecting Long-context Training Data via Token-level Attention

Paper • 2502.16860 • Published 15 days ago

Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision

Paper • 2502.20790 • Published 11 days ago

lincharliesun

updated a model 1 day ago

qihoo360/TinyR1-32B-Preview

Text Generation • Updated 1 day ago • 5.11k • 316

zhs12

in qihoo360/Light-R1-32B 4 days ago

missing train scripts

#2 opened 5 days ago by

zhs12

updated 2 datasets 6 days ago

qihoo360/Light-R1-DPOData

Viewer • Updated 6 days ago • 2.97k • 113 • 10

qihoo360/Light-R1-SFTData

Updated 6 days ago • 232 • 12

zhs12

updated a model 6 days ago

qihoo360/Light-R1-32B

Updated 6 days ago • 397 • 52

zhs12

in qihoo360/Light-R1-DPOData 6 days ago

Request for Wechat Group

#2 opened 6 days ago by

lincharliesun

updated a collection 6 days ago

TinyR1

1 item • Updated 6 days ago • 2

zhs12

updated a collection 7 days ago

Light-R1

Surpassing R1-Distill from Scratch* with 70k Math Data through Curriculum SFT & DPO • 3 items • Updated 7 days ago • 9

zhs12

published 2 datasets 7 days ago

qihoo360/Light-R1-DPOData

Viewer • Updated 6 days ago • 2.97k • 113 • 10

qihoo360/Light-R1-SFTData

Updated 6 days ago • 232 • 12

zhs12

updated a collection 7 days ago

Light-R1

Surpassing R1-Distill from Scratch* with 70k Math Data through Curriculum SFT & DPO • 3 items • Updated 7 days ago • 9

zhs12

published a model 7 days ago

qihoo360/Light-R1-32B

Updated 6 days ago • 397 • 52

yuhanwuuu

updated a model 8 days ago

qihoo360/TinyR1-32B-Preview

Text Generation • Updated 1 day ago • 5.11k • 316