-
yrshi/AutoRefine-Qwen2.5-3B-Base
Question Answering • 3B • Updated • 107 • 2 -
yrshi/AutoRefine-Qwen2.5-3B-Instruct
3B • Updated • 5 • 1 -
yrshi/AutoRefine-Qwen2.5-7B-Base
Question Answering • 8B • Updated • 43 • 1 -
yrshi/AutoRefine-Qwen2.5-7B-Instruct
Question Answering • 8B • Updated • 8 • 1
Yaorui SHI
yrshi
AI & ML interests
None yet
Recent Activity
upvoted a paper 28 minutes ago
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards upvoted a paper 2 days ago
SOD: Step-wise On-policy Distillation for Small Language Model Agents