Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
17.5
TFLOPS
9
2
167
Korek Rybens
Rybens
Follow
fiches's profile picture
legolasyiu's profile picture
CleverConversationSeeker's profile picture
7 followers
·
24 following
AI & ML interests
None yet
Recent Activity
reacted
to
burtenshaw
's
post
with 🤗
3 days ago
I’m super excited to work with @mlabonne to build the first practical example in the reasoning course. 🔗 https://huggingface.co./reasoning-course Here's a quick walk through of the first drop of material that works toward the use case: - a fundamental introduction to reinforcement learning. Answering questions like, ‘what is a reward?’ and ‘how do we create an environment for a language model?’ - Then it focuses on Deepseek R1 by walking through the paper and highlighting key aspects. This is an old school way to learn ML topics, but it always works. - Next, it takes to you Transformers Reinforcement Learning and demonstrates potential reward functions you could use. This is cool because it uses Marimo notebooks to visualise the reward. - Finally, Maxime walks us through a real training notebook that uses GRPO to reduce generation length. I’m really into this because it works and Maxime took the time to validate it share assets and logging from his own runs for you to compare with. Maxime’s work and notebooks have been a major part of the open source community over the last few years. I, like everyone, have learnt so much from them.
liked
a Space
6 days ago
fffiloni/consistent-character
liked
a model
14 days ago
featherless-ai/Qwerky-72B-Preview
View all activity
Organizations
models
8
Sort: Recently updated
Rybens/gemma-2-9b-it-Q4_0-GGUF
Text Generation
•
Updated
Sep 13, 2024
•
11
Rybens/Hermes-3-Llama-3.1-8B-Q3_K_S-GGUF
Updated
Aug 15, 2024
•
8
•
1
Rybens/RYS-gemma-2-2b-it-Q8_0-GGUF
Updated
Aug 14, 2024
•
5
Rybens/Hermes-2-Pro-Mistral-7B-Imatrix-GGUF
Updated
Apr 17, 2024
•
158
Rybens/Monarch-7B-dpo-mix-7k
Text Generation
•
Updated
Mar 8, 2024
•
16
•
1
Rybens/finetuning-sn6-1-GGUF
Updated
Feb 11, 2024
•
36
Rybens/truthful_dpo_tomgrc_fusionnet_7bx2_moe_13b_GGUF
Updated
Jan 23, 2024
•
431
•
6
Rybens/FusionNet_7Bx2_MoE_14B_gguf
Updated
Jan 20, 2024
•
30
datasets
None public yet