Siteng Huang
huangsiteng
AI & ML interests
vision-language models
Recent Activity
authored
a paper
about 7 hours ago
Unicorn: Text-Only Data Synthesis for Vision Language Model Training
authored
a paper
about 7 hours ago
OpenHelix: A Short Survey, Empirical Analysis, and Open-Source
Dual-System VLA Model for Robotic Manipulation
authored
a paper
about 7 hours ago
SSR: Enhancing Depth Perception in Vision-Language Models via
Rationale-Guided Spatial Reasoning