Hoagy Cunningham's picture

Hoagy Cunningham

HoagyC

HoagyC

AI & ML interests

None yet

Recent Activity

authored a paper 2 days ago

Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming

authored a paper over 1 year ago

Sparse Autoencoders Find Highly Interpretable Features in Language Models

View all activity

Organizations

None yet

HoagyC's activity

authored a paper 2 days ago

Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming

Paper • 2501.18837 • Published 5 days ago • 7

authored a paper over 1 year ago

Sparse Autoencoders Find Highly Interpretable Features in Language Models

Paper • 2309.08600 • Published Sep 15, 2023 • 13