Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
Meg Tong
meg-tong
Follow
https://www.megtong.com
meg-tong
AI & ML interests
None yet
Recent Activity
authored
a paper
2 days ago
Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming
authored
a paper
about 1 year ago
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training
authored
a paper
about 1 year ago
Steering Llama 2 via Contrastive Activation Addition
View all activity
Organizations
None yet
meg-tong
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a dataset
over 1 year ago
meg-tong/sycophancy-eval
Preview
•
Updated
Oct 23, 2023
•
54
•
3