3 9 3

Yi Ding

Tuwhy

https://dripnowhy.github.io/

DripNowhy

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

TEMPO: Scaling Test-time Training for Large Reasoning Models

updated a model 2 months ago

Tuwhy/Octopus-8B

upvoted a paper 3 months ago

Learning Self-Correction in Vision-Language Models via Rollout Augmentation

View all activity

Organizations

upvoted a paper 7 days ago

TEMPO: Scaling Test-time Training for Large Reasoning Models

Paper • 2604.19295 • Published 9 days ago • 34

updated a model 2 months ago

Tuwhy/Octopus-8B

Image-Text-to-Text • 9B • Updated Feb 16 • 20

upvoted a paper 3 months ago

Learning Self-Correction in Vision-Language Models via Rollout Augmentation

Paper • 2602.08503 • Published Feb 9 • 3

submitted a paper to Daily Papers 3 months ago

Learning Self-Correction in Vision-Language Models via Rollout Augmentation

Paper • 2602.08503 • Published Feb 9 • 3

updated a collection 3 months ago

Octopus

Collection

RL checkpoints of Octopus-8B and baselines of paper: Learning Self-Correction in Vision–Language Models via Rollout Augmentation • 6 items • Updated Feb 9

updated a model 3 months ago

Tuwhy/Qwen3-VL-8B-GRPO-n16

9B • Updated Jan 28 • 1

published a model 3 months ago

Tuwhy/Qwen3-VL-8B-GRPO-n16

9B • Updated Jan 28 • 1

updated a model 3 months ago

Tuwhy/Qwen3-VL-8B-DAPO-n16

9B • Updated Jan 26 • 1

published a model 3 months ago

Tuwhy/Qwen3-VL-8B-DAPO-n16

9B • Updated Jan 26 • 1

updated a model 3 months ago

Tuwhy/Qwen3-VL-8B-SRPO-n16

9B • Updated Jan 24 • 1

published a model 3 months ago

Tuwhy/Qwen3-VL-8B-SRPO-n16

9B • Updated Jan 24 • 1

updated a model 3 months ago

Tuwhy/Qwen3-VL-8B-SCPO-random

9B • Updated Jan 23 • 7

published a model 3 months ago

Tuwhy/Qwen3-VL-8B-SCPO-random

9B • Updated Jan 23 • 7

Yi Ding

AI & ML interests

Recent Activity

Organizations

Tuwhy's activity