The collection for the Paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"
Xinyu Zhu
TianHongZXY
AI & ML interests
Large Language Models; Reasoning; Reinforcement Learning
Recent Activity
updated
a model
about 9 hours ago
TianHongZXY/Qwen3-4B-NSR
published
a model
about 1 month ago
TianHongZXY/Qwen3-4B-Thinking-2507-SFT-10-epochs-synthesized-clear-problems-global_step_280
updated
a model
about 1 month ago
TianHongZXY/Qwen3-4B-Thinking-2507-SFT-10-epochs-synthesized-clear-problems-global_step_280