arxiv:2510.00915
Xin-Qiang Cai
caixq
AI & ML interests
RL, RLHF, Learning under Weak Supervision, Diffusion Model
Recent Activity
authored a paper 6 months ago
PIG-Nav: Key Insights for Pretrained Image Goal Navigation Models authored a paper 6 months ago
Reinforcement Learning with Verifiable yet Noisy Rewards under Imperfect
VerifiersOrganizations
None yet