57 16 119

chansung park PRO

chansung

AI & ML interests

None yet

Recent Activity

updated a model about 7 hours ago

llama-duo/llama3-3b-summarize-gpt4o-128k

published a model about 8 hours ago

llama-duo/llama3-3b-summarize-gpt4o-128k

updated a model about 8 hours ago

llama-duo/llama3-3b-coding-gpt4o-100k2

View all activity

Organizations

Posts 20

Post

3291

simple guide on the recipe for GRPO on Open-R1 which is built on top of TRL

I think FastAPI wrapper of vLLM with WeightSyncWorker is pretty cool feature. Also, we have many predefined reward functions out of the box!

View all Posts

Articles 5

Article

Explore papers with auto generated Q&As

pinned

Runtime error

142

Llama2 With Gradio Chat

Zero2Story

Co Write With Llama2

✍

pinned

Runtime error

LLMs As Chatbot

🦙

No application file

Adaptsum

📊

models 89

chansung/Qwen2.5-1.5B-CRL-Code-GRPO-exp1

Updated 1 day ago

chansung/Qwen2.5-1.5B-Coder-CRL-GRPO-exp1

Updated 1 day ago

chansung/Qwen2.5-1.5B-Instruct-CRL-Open-R1-Code-GRPO-exp1

Updated 1 day ago

chansung/Qwen2.5-1.5B-CRL-Open-R1-Code-GRPO-exp1

Text Generation • Updated 1 day ago • 2

chansung/Qwen2.5-1.5B-CRL-Open-R1-Code-GRPO

Updated 1 day ago • 1

chansung/Qwen2.5-1.5B-Open-R1-Code-GRPO

Updated 2 days ago

chansung/Qwen2.5-1.5B-Open-R1-GRPO

Updated 7 days ago

chansung/Qwen-2.5-7B-Simple-RL

Text Generation • Updated 14 days ago • 3

chansung/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

Updated 14 days ago

chansung/llama3-8b-lora-thinkcoder-sft

Text Generation • Updated 27 days ago • 18

datasets 58

chansung/verifiable-coding-problems-python

Viewer • Updated 3 days ago • 949 • 37

chansung/openthoughts-coding-llama-factory

Viewer • Updated 20 days ago • 19.9k • 63

chansung/cqa_synth_ds

Viewer • Updated Jun 3, 2024 • 111k • 138

chansung/coding_synth_ds

Viewer • Updated Jun 3, 2024 • 116k • 142 • 1

chansung/classification_synth_ds

Viewer • Updated Jun 2, 2024 • 92.3k • 100

chansung/classification_synth_ds2

Viewer • Updated Jun 1, 2024 • 424 • 50

chansung/aaa3

Updated Jun 1, 2024 • 4

chansung/aaa2

Updated Jun 1, 2024 • 5

chansung/synth_summarize_dataset

Viewer • Updated May 31, 2024 • 880k • 140

chansung/new_summarize_synth_ds3

Viewer • Updated May 31, 2024 • 301k • 198

chansung park PRO

AI & ML interests

Recent Activity

Organizations

Posts 20

Articles 5

Distilling from Dialogues: Finding Meaning in LLM Interactions

Papers 2

spaces 43 Sort: Recently updated

Paper Q&A

Llama2 With Gradio Chat

Zero2Story

Co Write With Llama2

LLMs As Chatbot

Adaptsum

models 89 Sort: Recently updated

datasets 58 Sort: Recently updated

spaces 43

models 89

datasets 58