A comprehensive framework designed to cultivate VLMs with human-like visuospatial abilities.
Ray Yang
rayruiyang
AI & ML interests
None yet
Recent Activity
upvoted a paper about 13 hours ago
Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence upvoted a paper 6 days ago
Utonia: Toward One Encoder for All Point Clouds updated
a collection
about 1 month ago
VST Organizations
None yet