Lewis Tunstall's picture

Lewis Tunstall PRO

lewtun

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

updated a collection about 12 hours ago

🧠 Reasoning datasets

liked a dataset about 12 hours ago

GeneralReasoning/GeneralThought-323K

updated a dataset about 12 hours ago

open-r1/verifiable-coding-problems-python_decontaminated

View all activity

Organizations

lewtun's activity

upvoted a paper 13 days ago

Small Models Struggle to Learn from Strong Reasoners

Paper • 2502.12143 • Published 19 days ago • 28

upvoted a collection 22 days ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 14 items • Updated about 12 hours ago • 90

upvoted a paper 24 days ago

Competitive Programming with Large Reasoning Models

Paper • 2502.06807 • Published Feb 3 • 67

upvoted a collection 26 days ago

OpenR1-Math

Dataset and SFT model distilled from DeepSeek-R1. Check out our blog post for more details: https://huggingface.co./blog/open-r1/update-2 • 3 items • Updated 22 days ago • 7

upvoted an article 26 days ago

Article

Open R1: Update #2

By

and 6 others •

26 days ago

• 197

upvoted a paper about 1 month ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 199

upvoted 5 articles about 1 month ago

Article

Smol but Mighty: Can Small Models Reason well? 🤔

By

•

Feb 4

• 9

Article

Open-R1: Update #1

By

and 7 others •

Feb 2

• 293

Article

Replicating DeepSeek R1 for Information Extraction

By

•

Jan 31

• 38

Article

Welcome to Inference Providers on the Hub 🔥

Jan 28

• 415

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 795

upvoted 3 articles about 2 months ago

Article

Gradio spaces are the perfect agent tools\!

By

•

Jan 17

• 14

Article

Introducing smolagents: simple agents that write actions in code.

Dec 31, 2024

• 839

Article

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

Jan 16

• 71

upvoted a paper about 2 months ago

A Simple and Provable Scaling Law for the Test-Time Compute of Large Language Models

Paper • 2411.19477 • Published Nov 29, 2024 • 6

upvoted 5 papers 2 months ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 80

Evaluating Large Language Models Trained on Code

Paper • 2107.03374 • Published Jul 7, 2021 • 8

Diverse Beam Search: Decoding Diverse Solutions from Neural Sequence Models

Paper • 1610.02424 • Published Oct 7, 2016 • 1

Scaling Laws for Neural Language Models

Paper • 2001.08361 • Published Jan 23, 2020 • 7

DeepSeek-V3 Technical Report

Paper • 2412.19437 • Published Dec 27, 2024 • 54