Lewis Tunstall's picture

Lewis Tunstall PRO

lewtun

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

updated a Space about 13 hours ago

open-r1/open-r1-eval-leaderboard

updated a Space about 13 hours ago

open-r1/open-r1-eval-leaderboard

updated a Space about 13 hours ago

open-r1/open-r1-eval-leaderboard

View all activity

Organizations

Posts 8

Post

2457

Introducing OlympicCoder: a series of open reasoning models that can solve olympiad-level programming problems 🧑‍💻

- 7B open-r1/OlympicCoder-7B
- 32B open-r1/OlympicCoder-32B

We find that OlympicCoder models outperform Claude 3.7 Sonnet, as well as others over 100x larger 💪

Together with the models, we are releasing:

📊CodeForces-CoTs: new dataset of code problems from the most popular competitive coding platform, with R1 traces in C++ and Python open-r1/codeforces-cots

🏆 IOI'2024: a new benchmark of VERY hard programming problems where even frontier models struggle to match human performance open-r1/ioi

For links to the models and datasets, check out our latest progress report from Open R1: https://huggingface.co./blog/open-r1/update-3

Articles 31

Article

85

The NLP Course is becoming the LLM Course!

View all Articles

Collections 4

Papers 10

arxiv:2504.05299

arxiv:2503.07572

arxiv:2502.02737

arxiv:2310.16944

spaces 21

Python Interpreter

Chuck Norris Jokes

Fetch a random Chuck Norris joke

OpenGPT

Explain physics concepts like Feynman

Donut Docvqa

Argilla Space Template

No application file

Chip

models 275

lewtun/Qwen2.5-1.5B-Open-R1-Distill

Text Generation • Updated 10 days ago • 13

lewtun/does-deepspeed-still-work-sft

Text Generation • Updated 10 days ago • 3

lewtun/Llama-3.2-1B-SFT-Capybara-No-Packing-Llama

Text Generation • Updated 10 days ago • 17

lewtun/Qwen2.5-1.5B-SFT-Capybara-No-Packing

Text Generation • Updated 10 days ago • 4

lewtun/Llama-3.2-1B-SFT-Capybara-No-Packing-ChatML

Text Generation • Updated 10 days ago • 10

lewtun/Qwen2.5-7B-Instruct-GRPO

lewtun/Qwen2.5-Math-1.5B-Instruct-GRPO

lewtun/dummy-config-test

Text Generation • Updated Feb 20 • 3

lewtun/Qwen2.5-1.5B-Open-R1-Code-GRPO

lewtun/smollm2-distill-default-chat-template

Text Generation • Updated Feb 17 • 1

datasets 71

lewtun/details_Qwen__Qwen2.5-Coder-3B-Instruct

Viewer • Updated Feb 20 • 33 • 46

lewtun/details_deepseek-ai__DeepSeek-R1-Distill-Qwen-1.5B

Viewer • Updated Feb 6 • 1k • 60

lewtun/details_open-thoughts__OpenThinker-7B

Viewer • Updated Feb 5 • 597 • 122

lewtun/details_deepseek-ai__DeepSeek-R1-Distill-Qwen-7B

Viewer • Updated Feb 5 • 597 • 122

lewtun/details_meta-llama__Llama-3.2-3B-Instruct

Viewer • Updated Feb 5 • 1.74k • 76

lewtun/details_deepseek-ai__DeepSeek-R1-Distill-Llama-8B

Viewer • Updated Feb 5 • 598 • 165

lewtun/details_meta-llama__Llama-3.1-8B-Instruct

Viewer • Updated Feb 5 • 597 • 73

lewtun/details_Qwen__Qwen2.5-1.5B-Instruct

Viewer • Updated Feb 5 • 2.25k • 67

lewtun/details_Qwen__Qwen2.5-0.5B-Instruct

Viewer • Updated Feb 5 • 898 • 58

lewtun/details_meta-llama__Llama-3.2-1B-Instruct

Viewer • Updated Feb 5 • 898 • 76