Xin-Qiang Cai's picture

2

Xin-Qiang Cai

caixq

https://caixq1996.github.io/

caixq1996

AI & ML interests

RL, RLHF, Learning under Weak Supervision, Diffusion Model

Recent Activity

upvoted a paper 11 days ago

VI-CuRL: Stabilizing Verifier-Independent RL Reasoning via Confidence-Guided Variance Reduction

authored a paper 6 months ago

PIG-Nav: Key Insights for Pretrained Image Goal Navigation Models

authored a paper 6 months ago

Reinforcement Learning with Verifiable yet Noisy Rewards under Imperfect Verifiers

View all activity

Organizations

None yet

Papers 2

arxiv:2510.00915

arxiv:2507.17220

models 0

None public yet

datasets 0

None public yet