LTX-2: Efficient Joint Audio-Visual Foundation Model Paper • 2601.03233 • Published 18 days ago • 134
Pretraining Frame Preservation in Autoregressive Video Memory Compression Paper • 2512.23851 • Published 26 days ago • 24
SpatialTree: How Spatial Abilities Branch Out in MLLMs Paper • 2512.20617 • Published Dec 23, 2025 • 43
PersonaLive! Expressive Portrait Image Animation for Live Streaming Paper • 2512.11253 • Published Dec 12, 2025 • 36
StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation Paper • 2512.09363 • Published Dec 10, 2025 • 72
LongLive: Real-time Interactive Long Video Generation Paper • 2509.22622 • Published Sep 26, 2025 • 185
Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention Paper • 2410.10774 • Published Oct 14, 2024 • 25