arxiv:2507.01352
Chris (Yuhao) Liu
chrisliu298
AI & ML interests
Alignment
Organizations
models 9
chrisliu298/synthetic_wmdp_classifier_llama_guard_3_1b_v2
1B • Updated
chrisliu298/synthetic_mmlu_physics_classifier_llama_3.2_1b
1B • Updated
chrisliu298/synthetic_mmlu_law_classifier_llama_3.2_1b
1B • Updated
chrisliu298/synthetic_mmlu_economics_classifier_llama_3.2_1b
1B • Updated • 2
chrisliu298/tofu_forget10_classifier
Text Classification • 0.1B • Updated • 2
chrisliu298/tofu_forget05_classifier
Text Classification • 0.1B • Updated • 1
chrisliu298/tofu_forget01_classifier
Text Classification • 0.1B • Updated • 140
chrisliu298/bbc_news_classifier
Text Classification • 0.1B • Updated • 5
chrisliu298/hp_book_classifier
Text Classification • 0.1B • Updated • 2
datasets 9
chrisliu298/Skywork-Reward-Preference-80K-v0.1-Contaminated
Viewer • Updated • 4.96k • 9
chrisliu298/wmdp_formatted
Viewer • Updated • 3.97k • 15
chrisliu298/magpie-air-standard
Viewer • Updated • 98k • 10
chrisliu298/magpie-pro-standard
Viewer • Updated • 98k • 28
chrisliu298/magpie-pro-llama3.1-standard
Viewer • Updated • 98k • 10
chrisliu298/magpie-ultra-standard
Viewer • Updated • 50k • 9
chrisliu298/wildguard-adv-standard
Viewer • Updated • 8.96k • 11 • 1
chrisliu298/offsetbias-standard
Viewer • Updated • 8.5k • 18
chrisliu298/helpsteer2-standard
Viewer • Updated • 7.22k • 25