8 11 9

Richard Zhuang PRO

RZ412

https://richardzhuang0412.github.io

AI & ML interests

LLM Routing, LLM + Games, Post-Training, Agents

Recent Activity

updated a dataset 15 minutes ago

DCAgent2/dev_set_v2_rl__24GPU_shaped_entropy__swe_rebench_patched_oracle__100k_wd0__Qwen2646d499

published a dataset 15 minutes ago

DCAgent2/dev_set_v2_rl__24GPU_shaped_entropy__swe_rebench_patched_oracle__100k_wd0__Qwen2646d499

updated a dataset 20 minutes ago

DCAgent2/dev_set_v2_rl__24GPU_shaped__selfinstruct_naive_sandboxes_2_verified__exp_tas_o57316c9a

View all activity

Organizations

New activity in laion/exp_tas_optimal_combined_traces-Qwen3.5-9B about 16 hours ago

Add preprocessor_config.json from Qwen/Qwen3.5-9B base model

#2 opened about 16 hours ago by

RZ412

Upload preprocessor_config.json

#1 opened about 16 hours ago by

RZ412

New activity in open-r1/README 12 months ago

[Experiment] Training R1-Zero-like models with Open R1

👀🔥 11

#20 opened 12 months ago by

lewtun

New activity in huggingface/HuggingDiscussions about 1 year ago

[FEEDBACK] Daily Papers

🔥❤️ 21

185

#32 opened almost 2 years ago by

kramp

New activity in RZ412/PokerBench about 1 year ago

Fix formatting

#4 opened about 1 year ago by

nielsr

Add task category, paper and code links

#3 opened about 1 year ago by

nielsr

add minimal metadata

#2 opened about 1 year ago by

davanstrien

Richard Zhuang PRO

AI & ML interests

Recent Activity

Organizations

RZ412's activity

Add preprocessor_config.json from Qwen/Qwen3.5-9B base model

Upload preprocessor_config.json

[Experiment] Training R1-Zero-like models with Open R1

[FEEDBACK] Daily Papers

Fix formatting

Add task category, paper and code links

add minimal metadata