2 18 62

Zilin Zhu

zhuzilin

zhuzilin

AI & ML interests

MLSys

Recent Activity

upvoted a paper 12 days ago

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

liked a model 30 days ago

zai-org/GLM-4.7

upvoted a paper about 2 months ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

View all activity

Organizations

None yet

upvoted a paper 12 days ago

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

Paper • 2601.06021 • Published 15 days ago • 43

liked a model 30 days ago

zai-org/GLM-4.7

Text Generation • 358B • Updated 16 days ago • 81.5k • • 1.78k

upvoted a paper about 2 months ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 101

updated a dataset about 2 months ago

zhuzilin/aime-2025

Viewer • Updated Nov 27, 2025 • 30 • 20

published a dataset about 2 months ago

zhuzilin/aime-2025

Viewer • Updated Nov 27, 2025 • 30 • 20

upvoted a paper 2 months ago

IterResearch: Rethinking Long-Horizon Agents via Markovian State Reconstruction

Paper • 2511.07327 • Published Nov 10, 2025 • 78

liked a model 6 months ago

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26, 2025 • 3.05M • • 4.37k

updated 2 datasets 6 months ago

zhuzilin/dapo-math-17k

Viewer • Updated Jul 25, 2025 • 17.4k • 1.24k • 4

zhuzilin/gsm8k

Viewer • Updated Jul 25, 2025 • 8.79k • 266 • 1

published a dataset 6 months ago

zhuzilin/gsm8k

Viewer • Updated Jul 25, 2025 • 8.79k • 266 • 1

upvoted 2 papers 7 months ago

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1, 2025 • 250

LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning

Paper • 2506.18841 • Published Jun 23, 2025 • 56

updated a dataset 7 months ago

zhuzilin/aime-2024

Viewer • Updated Jun 19, 2025 • 30 • 952 • 2

published 2 datasets 7 months ago

zhuzilin/aime-2024

Viewer • Updated Jun 19, 2025 • 30 • 952 • 2

zhuzilin/dapo-math-17k

Viewer • Updated Jul 25, 2025 • 17.4k • 1.24k • 4

updated a model 8 months ago

zhuzilin/Moonlight-16B-A3B-Instruct

Updated May 31, 2025

published a model 8 months ago

zhuzilin/Moonlight-16B-A3B-Instruct

Updated May 31, 2025

upvoted a paper 9 months ago

Pre-DPO: Improving Data Utilization in Direct Preference Optimization Using a Guiding Reference Model

Paper • 2504.15843 • Published Apr 22, 2025 • 16

upvoted a paper 11 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6, 2025 • 113

liked a dataset 11 months ago

facebook/natural_reasoning

Viewer • Updated Feb 21, 2025 • 1.15M • 1.42k • 548

Zilin Zhu

AI & ML interests

Recent Activity

Organizations

zhuzilin's activity