Eval Datasets - a automated-research-group Collection

Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

automated-research-group 's Collections

Models

Eval Datasets

updated Jan 11, 2024

openai/openai_humaneval

Viewer • Updated Jan 4, 2024 • 164 • 99.4k • 284
google-research-datasets/mbpp

Viewer • Updated Jan 4, 2024 • 1.4k • 94.8k • 166
ybisk/piqa

Updated Jan 18, 2024 • 334k • 89
lighteval/siqa

Viewer • Updated Oct 7, 2023 • 35.4k • 3.84k • 5
Rowan/hellaswag

Viewer • Updated Sep 28, 2023 • 60k • 353k • 110
allenai/winogrande

Updated Jan 18, 2024 • 301k • 61
allenai/ai2_arc

Viewer • Updated Dec 21, 2023 • 7.79k • 358k • 173
allenai/openbookqa

Viewer • Updated Jan 4, 2024 • 11.9k • 269k • 88
tau/commonsense_qa

Viewer • Updated Jan 4, 2024 • 12.1k • 247k • 91
google-research-datasets/natural_questions

Viewer • Updated Mar 11, 2024 • 26.3k • 8.04k • 95
mandarjoshi/trivia_qa

Viewer • Updated Jan 5, 2024 • 848k • 58.4k • 120
rajpurkar/squad

Viewer • Updated Mar 4, 2024 • 98.2k • 61.8k • 293
allenai/quac

Updated Jan 18, 2024 • 526 • 30
google/boolq

Viewer • Updated Jan 22, 2024 • 12.7k • 9.1k • 73
openai/gsm8k

Viewer • Updated Jan 4, 2024 • 17.6k • 361k • 625
hendrycks/competition_math

Updated Jun 8, 2023 • 141
cais/mmlu

Viewer • Updated Mar 8, 2024 • 231k • 164k • 408
maveriq/bigbenchhard

Viewer • Updated Sep 29, 2023 • 6.51k • 2.21k • 25
baber/agieval

Updated Oct 26, 2023 • 486 • 5

Collection guide
Browse collections

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs