EleutherAI

non-profit

Verified

https://eleuther.ai

AIEleuther

EleutherAI

Activity Feed Request to join this org

AI & ML interests

Large language models, scaling laws, AI Alignment, democratization of DL

Recent Activity

davidoj01 published a model 2 days ago

EleutherAI/unsloth-phi-4-Instruct-LORA-Open-R1-Code-GRPO-b2-as4-lr2en5-encouraged

luciaquirke updated a dataset 2 days ago

EleutherAI/SmolLM2-1.7B-stage-4-100B

davidoj01 published a model 2 days ago

EleutherAI/unsloth-phi-4-Instruct-LORA-Open-R1-Code-GRPO-b2-as4-lr2en5-vuln

View all activity

EleutherAI's activity

davidoj01

published a model 2 days ago

EleutherAI/unsloth-phi-4-Instruct-LORA-Open-R1-Code-GRPO-b2-as4-lr2en5-encouraged

Updated 2 days ago

luciaquirke

updated a dataset 2 days ago

EleutherAI/SmolLM2-1.7B-stage-4-100B

Viewer • Updated 2 days ago • 88M • 78

davidoj01

published 2 models 2 days ago

EleutherAI/unsloth-phi-4-Instruct-LORA-Open-R1-Code-GRPO-b2-as4-lr2en5-vuln

Updated 2 days ago

EleutherAI/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

Updated 2 days ago

luciaquirke

published a dataset 2 days ago

EleutherAI/SmolLM2-1.7B-stage-4-100B

Viewer • Updated 2 days ago • 88M • 78

luciaquirke

updated a dataset 2 days ago

EleutherAI/SmolLM2-1.7B-stage-4-20B

Viewer • Updated 2 days ago • 17.6M • 46

luciaquirke

published a dataset 2 days ago

EleutherAI/SmolLM2-1.7B-stage-4-20B

Viewer • Updated 2 days ago • 17.6M • 46

luciaquirke

updated a dataset 2 days ago

EleutherAI/SmolLM2-1.7B-stage-4-10B

Viewer • Updated 2 days ago • 8.8M • 76

luciaquirke

published a dataset 2 days ago

EleutherAI/SmolLM2-1.7B-stage-4-10B

Viewer • Updated 2 days ago • 8.8M • 76

Kyle1668

updated a dataset 3 days ago

EleutherAI/filtering-pretraining-mix

Preview • Updated 3 days ago • 71 • 1

oskarvanderwal

authored 4 papers 10 days ago

Inseq: An Interpretability Toolkit for Sequence Generation Models

Paper • 2302.13942 • Published Feb 27, 2023 • 1

Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling

Paper • 2304.01373 • Published Apr 3, 2023 • 9

Identifying and Adapting Transformer-Components Responsible for Gender Bias in an English Language Model

Paper • 2310.12611 • Published Oct 19, 2023

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Paper • 2211.05100 • Published Nov 9, 2022 • 31

pietrolesci

authored a paper 10 days ago

PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training Runs

Paper • 2503.09543 • Published Mar 12

Skylion007

authored a paper about 1 month ago

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published Mar 12 • 68

hyunwoongko

authored a paper about 2 months ago

Kanana: Compute-efficient Bilingual Language Models

Paper • 2502.18934 • Published Feb 26 • 66

pietrolesci

authored a paper about 2 months ago

Self-Training Large Language Models for Tool-Use Without Demonstrations

Paper • 2502.05867 • Published Feb 9

bzantium

authored a paper about 2 months ago

Kanana: Compute-efficient Bilingual Language Models

Paper • 2502.18934 • Published Feb 26 • 66

avi-skowron

authored a paper about 2 months ago

Beyond Release: Access Considerations for Generative AI Systems

Paper • 2502.16701 • Published Feb 23 • 16