Elie Bakouch's picture

Elie Bakouch

eliebak

·

AI & ML interests

Training LLM's @ 🤗

Recent Activity

updated a Space about 3 hours ago

open-r1/README

upvoted an article about 4 hours ago

Open-R1: a fully open reproduction of DeepSeek-R1

published a Space about 5 hours ago

open-r1/README

View all activity

Articles

Open-R1: a fully open reproduction of DeepSeek-R1

about 3 hours ago

Diving into MiniMax01 405B MoE

SmolVLM - small yet mighty Vision Language Model

SmolLM - blazingly fast and remarkably powerful

Organizations

eliebak's activity

updated a Space about 3 hours ago

README

upvoted an article about 4 hours ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

about 3 hours ago

• 21

published a Space about 5 hours ago

README

updated a Space 1 day ago

README

updated 7 models 2 days ago

HuggingFaceTB/Qwen-Math-1.5B-Bespoke-sys-ep3-6e5-32k

Text Generation • Updated 2 days ago • 15

HuggingFaceTB/Qwen-Math-1.5B-Bespoke-sys-ep3-6e5-32k

Text Generation • Updated 2 days ago • 15

HuggingFaceTB/Qwen-Math-1.5B-Bespoke-sys-ep3-linear-6e-5-optim-adamw_torch-4k

Text Generation • Updated 2 days ago • 9

HuggingFaceTB/Qwen-Math-1.5B-Bespoke-sys-ep3-3e5-32k-500k-rope

Text Generation • Updated 2 days ago • 12

HuggingFaceTB/Qwen-Math-1.5B-Bespoke-sys-ep3-cosine-1e-4-optim-adamw_torch-4k

Text Generation • Updated 2 days ago • 12

HuggingFaceTB/Qwen-Math-1.5B-Bespoke-sys-ep3-linear-1e-4-optim-adamw_torch-4k

Text Generation • Updated 2 days ago • 7

HuggingFaceTB/Qwen-Math-1.5B-Bespoke-sys-ep3-cosine-6e-5-optim-adamw_torch-4k

Text Generation • Updated 2 days ago • 15

published 6 models 2 days ago

HuggingFaceTB/Qwen-Math-1.5B-Bespoke-sys-ep3-linear-6e-5-optim-adamw_torch-4k

Text Generation • Updated 2 days ago • 9

HuggingFaceTB/Qwen-Math-1.5B-Bespoke-sys-ep3-linear-1e-4-optim-adamw_torch-4k

Text Generation • Updated 2 days ago • 7

HuggingFaceTB/Qwen-Math-1.5B-Bespoke-sys-ep3-cosine-1e-4-optim-adamw_torch-4k

Text Generation • Updated 2 days ago • 12

HuggingFaceTB/Qwen-Math-1.5B-Bespoke-sys-ep3-cosine-6e-5-optim-adamw_torch-4k

Text Generation • Updated 2 days ago • 15

HuggingFaceTB/Qwen-Math-1.5B-Bespoke-sys-ep3-6e5-32k

Text Generation • Updated 2 days ago • 15

HuggingFaceTB/Qwen-Math-1.5B-Bespoke-sys-ep3-3e5-32k-500k-rope

Text Generation • Updated 2 days ago • 12