Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
automated-research-group
's Collections
Models
Eval Datasets
Eval Datasets
updated
Jan 11
Upvote
-
openai/openai_humaneval
Viewer
•
Updated
Jan 4
•
164
•
84.6k
•
255
google-research-datasets/mbpp
Viewer
•
Updated
Jan 4
•
1.4k
•
70.2k
•
148
ybisk/piqa
Updated
Jan 18
•
79.4k
•
86
lighteval/siqa
Viewer
•
Updated
Oct 7, 2023
•
35.4k
•
1.57k
•
4
Rowan/hellaswag
Viewer
•
Updated
Sep 28, 2023
•
60k
•
111k
•
99
allenai/winogrande
Updated
Jan 18
•
85.4k
•
58
allenai/ai2_arc
Viewer
•
Updated
Dec 21, 2023
•
7.79k
•
111k
•
160
allenai/openbookqa
Viewer
•
Updated
Jan 4
•
11.9k
•
48.8k
•
81
tau/commonsense_qa
Viewer
•
Updated
Jan 4
•
12.1k
•
24.2k
•
84
google-research-datasets/natural_questions
Viewer
•
Updated
Mar 11
•
26.3k
•
5.9k
•
89
mandarjoshi/trivia_qa
Viewer
•
Updated
Jan 5
•
848k
•
46.3k
•
101
rajpurkar/squad
Viewer
•
Updated
Mar 4
•
98.2k
•
67.8k
•
280
allenai/quac
Updated
Jan 18
•
574
•
29
google/boolq
Viewer
•
Updated
Jan 22
•
12.7k
•
6.86k
•
70
openai/gsm8k
Viewer
•
Updated
Jan 4
•
17.6k
•
168k
•
458
hendrycks/competition_math
Updated
Jun 8, 2023
•
8.76k
•
136
cais/mmlu
Viewer
•
Updated
Mar 8
•
231k
•
113k
•
348
maveriq/bigbenchhard
Viewer
•
Updated
Sep 29, 2023
•
6.51k
•
1.65k
•
20
baber/agieval
Updated
Oct 26, 2023
•
83
•
4
Upvote
-
Share collection
View history
Collection guide
Browse collections