Hai's picture

9 1

Hai

fiowhahf

·

https://github.com/hearthht

AI & ML interests

LLM

Recent Activity

updated a dataset 27 days ago

fiowhahf/SVAMP

published a dataset 27 days ago

fiowhahf/SVAMP

updated a dataset 27 days ago

fiowhahf/OlympiadBench

View all activity

Organizations

None yet

authored a paper 5 months ago

Cooper: Co-Optimizing Policy and Reward Models in Reinforcement Learning for Large Language Models

Paper • 2508.05613 • Published Aug 7, 2025 • 17