PolyPythias
- Preview • Updated • 63
EleutherAI/pile-preshuffled-seeds
Updated • 100 • 1Note Training data information for each seed.
EleutherAI/pythia-14m
Text Generation • Updated • 209k • 22EleutherAI/pythia-14m-seed1
Updated • 16.6kEleutherAI/pythia-14m-seed2
Updated • 1.05kEleutherAI/pythia-14m-seed3
Updated • 1.01kEleutherAI/pythia-14m-seed4
Updated • 974EleutherAI/pythia-14m-seed5
Updated • 889EleutherAI/pythia-14m-seed6
Updated • 105EleutherAI/pythia-14m-seed7
Updated • 104EleutherAI/pythia-14m-seed8
Updated • 103EleutherAI/pythia-14m-seed9
Updated • 103EleutherAI/pythia-31m
Text Generation • Updated • 95.3k • 5EleutherAI/pythia-31m-seed1
Updated • 2.82kEleutherAI/pythia-31m-seed2
Updated • 531EleutherAI/pythia-31m-seed3
Updated • 517EleutherAI/pythia-31m-seed4
Updated • 511EleutherAI/pythia-31m-seed5
Updated • 511EleutherAI/pythia-31m-seed6
Updated • 50EleutherAI/pythia-31m-seed7
Updated • 51EleutherAI/pythia-31m-seed8
Updated • 51EleutherAI/pythia-31m-seed9
Updated • 51EleutherAI/pythia-70m
Updated • 133k • 67EleutherAI/pythia-70m-seed1
Updated • 5.98kEleutherAI/pythia-70m-seed2
Updated • 587EleutherAI/pythia-70m-seed3
Updated • 579EleutherAI/pythia-70m-seed4
Updated • 575EleutherAI/pythia-70m-seed5
Updated • 579EleutherAI/pythia-70m-seed6
Updated • 113EleutherAI/pythia-70m-seed7
Updated • 107EleutherAI/pythia-70m-seed8
Updated • 101EleutherAI/pythia-70m-seed9
Updated • 100EleutherAI/pythia-160m
Text Generation • Updated • 164k • 31EleutherAI/pythia-160m-seed1
Text Generation • Updated • 3.09kEleutherAI/pythia-160m-seed2
Text Generation • Updated • 1.15kEleutherAI/pythia-160m-seed3
Text Generation • Updated • 1.08kEleutherAI/pythia-160m-seed4
Updated • 430 • 1EleutherAI/pythia-160m-seed5
Updated • 391EleutherAI/pythia-160m-seed6
Updated • 10EleutherAI/pythia-160m-seed7
Updated • 8EleutherAI/pythia-160m-seed8
Updated • 8EleutherAI/pythia-160m-seed9
Updated • 8EleutherAI/pythia-410m
Text Generation • Updated • 91.5k • 24EleutherAI/pythia-410m-seed1
Updated • 1.62kEleutherAI/pythia-410m-seed2
Updated • 3EleutherAI/pythia-410m-seed3
Updated • 3EleutherAI/pythia-410m-seed4
UpdatedEleutherAI/pythia-410m-seed5
UpdatedEleutherAI/pythia-410m-seed6
UpdatedEleutherAI/pythia-410m-seed7
UpdatedEleutherAI/pythia-410m-seed8
UpdatedEleutherAI/pythia-410m-seed9
Updated
EleutherAI/pythia-160m-data-seed1
UpdatedNote Version where the data order and weight initialization seeds are decoupled. Here, only the data seed is different from pythia-160m ("seed 0").
EleutherAI/pythia-160m-data-seed2
UpdatedNote Version where the data order and weight initialization seeds are decoupled. Here, only the data seed is different from pythia-160m ("seed 0").
EleutherAI/pythia-160m-data-seed3
UpdatedNote Version where the data order and weight initialization seeds are decoupled. Here, only the data seed is different from pythia-160m ("seed 0").
EleutherAI/pythia-160m-weight-seed1
UpdatedNote Version where the data order and weight initialization seeds are decoupled. Here, only the weight initialization seed is different from pythia-160m ("seed 0").
EleutherAI/pythia-160m-weight-seed2
UpdatedNote Version where the data order and weight initialization seeds are decoupled. Here, only the weight initialization seed is different from pythia-160m ("seed 0").
EleutherAI/pythia-160m-weight-seed3
UpdatedNote Version where the data order and weight initialization seeds are decoupled. Here, only the weight initialization seed is different from pythia-160m ("seed 0").
PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training Runs
Paper • 2503.09543 • Published