Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
8
2
10
Sam Paech
PRO
sam-paech
Follow
TW3Partners's profile picture
justinxzhao's profile picture
Prasad23's profile picture
17 followers
·
4 following
https://eqbench.com
sam_paech
sam-paech
AI & ML interests
Emotional intelligence, alignment, benchmarking
Articles
MMLU-Pro-NoMath
Jul 11
•
3
Organizations
spaces
1
Configuration error
18
💗
EQ Bench
models
None public yet
datasets
10
Sort:Â Recently updated
sam-paech/numina-math-heuristic-10k
Viewer
•
Updated
about 22 hours ago
•
10k
•
5
•
2
sam-paech/numina-math-heuristic-test
Viewer
•
Updated
2 days ago
•
1.12k
•
5
•
1
sam-paech/prm800k-validator-model-comparison
Viewer
•
Updated
3 days ago
•
1.32k
sam-paech/prm800k-sonnet3.5-validator-test
Viewer
•
Updated
7 days ago
•
200
sam-paech/mmlu-pro-nomath-sml-exclusion
Viewer
•
Updated
Jul 26
•
1.96k
•
2
sam-paech/mmlu-pro-nomath
Viewer
•
Updated
Jul 11
•
7.04k
•
3
•
1
sam-paech/mmlu-pro-nomath-sml
Viewer
•
Updated
Jul 11
•
2.71k
•
100
•
6
sam-paech/mmlu-pro-irt-1-0
Viewer
•
Updated
Jul 5
•
2.13k
•
2
•
4
sam-paech/magi_hard_1_0
Viewer
•
Updated
Apr 7
•
3.2k
•
10
•
2
sam-paech/magi_irt_1_0
Viewer
•
Updated
Apr 7
•
2.15k
•
2