arxiv:2406.07933
Chris (Yuhao) Liu
chrisliu298
AI & ML interests
Alignment and control
Organizations
Papers
2
models
10
chrisliu298/tofu_forget10_classifier
Text Classification
•
Updated
•
39
chrisliu298/tofu_forget05_classifier
Text Classification
•
Updated
•
60
chrisliu298/tofu_forget01_classifier
Text Classification
•
Updated
•
6
chrisliu298/mmlu-economics_classifier
Text Classification
•
Updated
•
7
chrisliu298/mmlu-physics_classifier
Text Classification
•
Updated
•
7
chrisliu298/mmlu-law_classifier
Text Classification
•
Updated
•
10
chrisliu298/synthetic_wmdp_classifier
Text Classification
•
Updated
•
17
chrisliu298/wmdp_classifier
Text Classification
•
Updated
•
66
chrisliu298/bbc_news_classifier
Text Classification
•
Updated
•
18
chrisliu298/hp_book_classifier
Text Classification
•
Updated
•
10
datasets
7
chrisliu298/magpie-air-standard
Viewer
•
Updated
•
98k
chrisliu298/magpie-pro-standard
Viewer
•
Updated
•
98k
chrisliu298/magpie-pro-llama3.1-standard
Viewer
•
Updated
•
98k
chrisliu298/magpie-ultra-standard
Viewer
•
Updated
•
50k
chrisliu298/wildguard-adv-standard
Viewer
•
Updated
•
8.96k
chrisliu298/offsetbias-standard
Viewer
•
Updated
•
8.5k
chrisliu298/helpsteer2-standard
Viewer
•
Updated
•
7.22k
•
5