Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
8
2
10
Sam Paech
PRO
sam-paech
Follow
Yasir22's profile picture
harisec's profile picture
alokabhishek's profile picture
17 followers
Β·
4 following
https://eqbench.com
sam_paech
sam-paech
AI & ML interests
Emotional intelligence, alignment, benchmarking
Articles
MMLU-Pro-NoMath
Jul 11
β’
3
Organizations
sam-paech
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
SkunkworksAI/reasoning-0.01
5 days ago
Which model was used?
#5 opened 5 days ago by
sam-paech
New activity in
google/gemma-2-27b-it
about 2 months ago
Hallucinations, misspellings etc. Something seems broken?
21
#10 opened 3 months ago by
sam-paech
New activity in
open-llm-leaderboard/open_llm_leaderboard
3 months ago
Running MMLU-Pro with Eleuther LM-Eval
2
#814 opened 3 months ago by
sam-paech
π Raw results link goes to old leaderboard results dataset
2
#808 opened 3 months ago by
sam-paech
New activity in
senseable/WestLake-7B-v2
5 months ago
EQ-Bench score
12
#8 opened 8 months ago by
sam-paech
New activity in
HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1
5 months ago
Prompt format
6
#4 opened 5 months ago by
sam-paech
New activity in
CausalLM/34b-beta
6 months ago
Regarding Concerns about MMLU Scores
71
#5 opened 6 months ago by
deleted
New activity in
froggeric/WestLake-10.7B-v2
6 months ago
Details of all the merge attempts until this one
6
#1 opened 6 months ago by
froggeric