A fine-grained visual reasoning benchmark (We show more question types in the extension dataset.)
Sicheng Feng
FSCCS
AI & ML interests
None yet
Recent Activity
updated
a dataset
7 days ago
FSCCS/ReasonMap
upvoted
a
paper
13 days ago
OmniAgent: Audio-Guided Active Perception Agent for Omnimodal Audio-Video Understanding