arxiv:2410.18451
Chris (Yuhao) Liu
chrisliu298
AI & ML interests
Alignment and control
Recent Activity
liked
a dataset
25 days ago
argilla/magpie-ultra-v1.0
new activity
25 days ago
argilla/magpie-ultra-v1.0:Question About Dataset Content
new activity
26 days ago
Skywork/Skywork-Reward-Gemma-2-27B:Reward model returns 0 scores for all cases
Organizations
Papers
3
models
10
chrisliu298/synthetic_wmdp_classifier_llama_guard_3_1b_v2
Updated
•
2
chrisliu298/synthetic_mmlu_physics_classifier_llama_3.2_1b
Updated
•
2
chrisliu298/synthetic_mmlu_law_classifier_llama_3.2_1b
Updated
•
3
chrisliu298/synthetic_mmlu_economics_classifier_llama_3.2_1b
Updated
•
5
chrisliu298/synthetic_wmdp_classifier_llama_guard_3_1b
Updated
•
2
chrisliu298/tofu_forget10_classifier
Text Classification
•
Updated
•
9
chrisliu298/tofu_forget05_classifier
Text Classification
•
Updated
•
11
chrisliu298/tofu_forget01_classifier
Text Classification
•
Updated
•
69
chrisliu298/bbc_news_classifier
Text Classification
•
Updated
•
7
chrisliu298/hp_book_classifier
Text Classification
•
Updated
•
6
datasets
9
chrisliu298/Skywork-Reward-Preference-80K-v0.1-Contaminated
Viewer
•
Updated
•
4.96k
•
35
chrisliu298/wmdp_formatted
Viewer
•
Updated
•
3.97k
•
32
chrisliu298/magpie-air-standard
Viewer
•
Updated
•
98k
•
36
chrisliu298/magpie-pro-standard
Viewer
•
Updated
•
98k
•
34
chrisliu298/magpie-pro-llama3.1-standard
Viewer
•
Updated
•
98k
•
38
chrisliu298/magpie-ultra-standard
Viewer
•
Updated
•
50k
•
39
chrisliu298/wildguard-adv-standard
Viewer
•
Updated
•
8.96k
•
36
chrisliu298/offsetbias-standard
Viewer
•
Updated
•
8.5k
•
36
chrisliu298/helpsteer2-standard
Viewer
•
Updated
•
7.22k
•
37