Sam Paech's picture

8 2 10

Sam Paech PRO

sam-paech

·

https://eqbench.com

AI & ML interests

Emotional intelligence, alignment, benchmarking

Articles

MMLU-Pro-NoMath

Organizations

sam-paech's activity

New activity in SkunkworksAI/reasoning-0.01 5 days ago

Which model was used?

#5 opened 5 days ago by

New activity in google/gemma-2-27b-it about 2 months ago

Hallucinations, misspellings etc. Something seems broken?

#10 opened 3 months ago by

New activity in open-llm-leaderboard/open_llm_leaderboard 3 months ago

Running MMLU-Pro with Eleuther LM-Eval

#814 opened 3 months ago by

📑 Raw results link goes to old leaderboard results dataset

#808 opened 3 months ago by

New activity in senseable/WestLake-7B-v2 5 months ago

EQ-Bench score

#8 opened 8 months ago by

New activity in HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1 5 months ago

Prompt format

#4 opened 5 months ago by

New activity in CausalLM/34b-beta 6 months ago

Regarding Concerns about MMLU Scores

#5 opened 6 months ago by deleted

New activity in froggeric/WestLake-10.7B-v2 6 months ago

Details of all the merge attempts until this one

#1 opened 6 months ago by