arxiv:2505.19914
Siyu Yuan
siyuyuan
AI & ML interests
Knowledge generation
Recent Activity
upvoted
a
paper
about 1 month ago
Critique-RL: Training Language Models for Critiquing through Two-Stage
Reinforcement Learning
upvoted
a
paper
3 months ago
WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents
upvoted
a
paper
6 months ago
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning
Attention