Korek Rybens's picture

Korek Rybens

Rybens

·

AI & ML interests

None yet

Recent Activity

reacted to burtenshaw's post with 🤗 3 days ago

I’m super excited to work with @mlabonne to build the first practical example in the reasoning course. 🔗 https://huggingface.co./reasoning-course Here's a quick walk through of the first drop of material that works toward the use case: - a fundamental introduction to reinforcement learning. Answering questions like, ‘what is a reward?’ and ‘how do we create an environment for a language model?’ - Then it focuses on Deepseek R1 by walking through the paper and highlighting key aspects. This is an old school way to learn ML topics, but it always works. - Next, it takes to you Transformers Reinforcement Learning and demonstrates potential reward functions you could use. This is cool because it uses Marimo notebooks to visualise the reward. - Finally, Maxime walks us through a real training notebook that uses GRPO to reduce generation length. I’m really into this because it works and Maxime took the time to validate it share assets and logging from his own runs for you to compare with. Maxime’s work and notebooks have been a major part of the open source community over the last few years. I, like everyone, have learnt so much from them.

liked a Space 6 days ago

fffiloni/consistent-character

liked a model 14 days ago

featherless-ai/Qwerky-72B-Preview

View all activity

Organizations

models 8

Rybens/gemma-2-9b-it-Q4_0-GGUF

Text Generation • Updated Sep 13, 2024 • 11

Rybens/Hermes-3-Llama-3.1-8B-Q3_K_S-GGUF

Updated Aug 15, 2024 • 8 • 1

Rybens/RYS-gemma-2-2b-it-Q8_0-GGUF

Updated Aug 14, 2024 • 5

Rybens/Hermes-2-Pro-Mistral-7B-Imatrix-GGUF

Updated Apr 17, 2024 • 158

Rybens/Monarch-7B-dpo-mix-7k

Text Generation • Updated Mar 8, 2024 • 16 • 1

Rybens/finetuning-sn6-1-GGUF

Updated Feb 11, 2024 • 36

Rybens/truthful_dpo_tomgrc_fusionnet_7bx2_moe_13b_GGUF

Updated Jan 23, 2024 • 431 • 6

Rybens/FusionNet_7Bx2_MoE_14B_gguf

Updated Jan 20, 2024 • 30

datasets

None public yet