Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
rayruiyang
's Collections
VST
Haplo-VL
VST
updated
28 days ago
A comprehensive framework designed to cultivate VLMs with human-like visuospatial abilities.
Upvote
6
rayruiyang/VST-3B-RL
Image-Text-to-Text
•
4B
•
Updated
29 days ago
•
1.14k
•
3
rayruiyang/VST-3B-SFT
Image-Text-to-Text
•
4B
•
Updated
29 days ago
•
3.07k
rayruiyang/VST-7B-SFT
Image-Text-to-Text
•
8B
•
Updated
29 days ago
•
3.05k
rayruiyang/VST-7B-RL
Image-Text-to-Text
•
8B
•
Updated
29 days ago
•
655
Visual Spatial Tuning
Paper
•
2511.05491
•
Published
Nov 7
•
49
Upvote
6
+2
Share collection
View history
Collection guide
Browse collections