Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Re-Eval-MMLU
community
Activity Feed
Follow
2
AI & ML interests
None defined yet.
Recent Activity
yuchenlin
authored
a paper
about 2 months ago
On Memorization of Large Language Models in Logical Reasoning
DongfuJiang
authored
a paper
2 months ago
MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks
yuchenlin
authored
a paper
6 months ago
WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs
View all activity
Team members
2
models
None public yet
datasets
1
Re-Eval-MMLU/open_mmlu
Viewer
•
Updated
Apr 26
•
28.1k
•
30