1 10 37

Honglin Guo

KYLN24

KYLN24

AI & ML interests

None yet

Recent Activity

authored a paper 5 days ago

DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting

authored a paper 7 days ago

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning

upvoted an article 7 days ago

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

View all activity

Organizations

KYLN24's activity

authored a paper 5 days ago

DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting

Paper • 2503.00784 • Published 7 days ago • 9

authored a paper 7 days ago

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning

Paper • 2402.05808 • Published Feb 8, 2024

upvoted an article 7 days ago

Article

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

•

Jan 31

• 42

liked 2 models 9 days ago

microsoft/Phi-4-multimodal-instruct

Automatic Speech Recognition • Updated 1 day ago • 231k • 1.03k

moonshotai/Moonlight-16B-A3B-Instruct

Text Generation • Updated 6 days ago • 4.09k • 126

authored a paper 10 days ago

CritiQ: Mining Data Quality Criteria from Human Preferences

Paper • 2502.19279 • Published 11 days ago • 9

upvoted a paper 10 days ago

CritiQ: Mining Data Quality Criteria from Human Preferences

Paper • 2502.19279 • Published 11 days ago • 9

commented a paper 10 days ago

CritiQ: Mining Data Quality Criteria from Human Preferences

Paper • 2502.19279 • Published 11 days ago • 9 •

upvoted a paper 12 days ago

Thus Spake Long-Context Large Language Model

Paper • 2502.17129 • Published 13 days ago • 67

liked a model 3 months ago

Qwen/QwQ-32B-Preview

Text Generation • Updated Jan 12 • 258k • • 1.7k

upvoted a collection 3 months ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated 11 days ago • 552

upvoted an article 4 months ago

Article

Saving Memory Using Padding-Free Transformer Layers during Finetuning

•

Jun 11, 2024

• 16

authored a paper 9 months ago

AgentGym: Evolving Large Language Model-based Agents across Diverse Environments

Paper • 2406.04151 • Published Jun 6, 2024 • 20

updated a dataset 9 months ago

AgentGym/AgentTraj-L

Viewer • Updated Jun 6, 2024 • 14.5k • 152 • 6

upvoted a collection 9 months ago

InternLM2

Collection

7 items • Updated 26 days ago • 9

liked 2 models 10 months ago

JackFram/llama-68m

Text Generation • Updated May 23, 2024 • 552k • 26

deepseek-ai/DeepSeek-V2

Text Generation • Updated Jun 8, 2024 • 10.2k • 310

liked a dataset 11 months ago

HuggingFaceFW/fineweb

Viewer • Updated Jan 31 • 25B • 317k • 2.02k

liked 2 models 11 months ago

openbmb/MiniCPM-V-2

Visual Question Answering • Updated Jan 15 • 5.96k • 449

microsoft/Phi-3-mini-128k-instruct

Text Generation • Updated 7 days ago • 119k • • 1.64k