chansung park's picture

chansung park PRO

chansung

AI & ML interests

None yet

Recent Activity

updated a model about 7 hours ago
llama-duo/llama3-3b-summarize-gpt4o-128k
published a model about 8 hours ago
llama-duo/llama3-3b-summarize-gpt4o-128k
updated a model about 8 hours ago
llama-duo/llama3-3b-coding-gpt4o-100k2
View all activity

Organizations

Notebooks-explorers's profile picture various keras sd deployment 's profile picture LLMs's profile picture Gradio-Themes-Party's profile picture Hugging Face Fellows's profile picture Alpaca LoRA's profile picture Webhooks Explorers (BETA)'s profile picture Deploy HF TF ViTs's profile picture Blog-explorers's profile picture Personal Coding Assistant's profile picture ZeroGPU Explorers's profile picture Social Post Explorers's profile picture Top Contributors: Dataset Downloads's profile picture llama-duo's profile picture klcsp's profile picture ExpanLLM's profile picture Adaptive Summarization's profile picture ThinkCoder's profile picture

Posts 20

view post
Post
3291
simple guide on the recipe for GRPO on Open-R1 which is built on top of TRL

I think FastAPI wrapper of vLLM with WeightSyncWorker is pretty cool feature. Also, we have many predefined reward functions out of the box!

Articles 5

Article
4

Distilling from Dialogues: Finding Meaning in LLM Interactions