Submitted by
Xin Zhou
H-EmbodVis
university
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
HERMES++: Toward a Unified Driving World Model for 3D Scene Understanding and Generation
When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models