EmbodMocap: In-the-Wild 4D Human-Scene Reconstruction for Embodied Agents Paper • 2602.23205 • Published 1 day ago • 8
Generated Reality: Human-centric World Simulation using Interactive Video Generation with Hand and Camera Control Paper • 2602.18422 • Published 7 days ago • 30
UniWeTok: An Unified Binary Tokenizer with Codebook Size 2^{128} for Unified Multimodal Large Language Model Paper • 2602.14178 • Published 12 days ago • 12
SemanticMoments: Training-Free Motion Similarity via Third Moment Features Paper • 2602.09146 • Published 18 days ago • 21
OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence Paper • 2602.08683 • Published 18 days ago • 49
Ex-Omni: Enabling 3D Facial Animation Generation for Omni-modal Large Language Models Paper • 2602.07106 • Published 21 days ago • 11
Skin Tokens: A Learned Compact Representation for Unified Autoregressive Rigging Paper • 2602.04805 • Published 23 days ago • 6
DreamActor-M2: Universal Character Image Animation via Spatiotemporal In-Context Learning Paper • 2601.21716 • Published 29 days ago • 13
360Anything: Geometry-Free Lifting of Images and Videos to 360° Paper • 2601.16192 • Published Jan 22 • 8
DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations Paper • 2505.18096 • Published May 23, 2025 • 1
SpeakerVid-5M: A Large-Scale High-Quality Dataset for Audio-Visual Dyadic Interactive Human Generation Paper • 2507.09862 • Published Jul 14, 2025 • 51
GaMO: Geometry-aware Multi-view Diffusion Outpainting for Sparse-View 3D Reconstruction Paper • 2512.25073 • Published Dec 31, 2025 • 41
Quantile Rendering: Efficiently Embedding High-dimensional Feature on 3D Gaussian Splatting Paper • 2512.20927 • Published Dec 24, 2025 • 17
Stream-DiffVSR: Low-Latency Streamable Video Super-Resolution via Auto-Regressive Diffusion Paper • 2512.23709 • Published Dec 29, 2025 • 50
WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling Paper • 2512.14614 • Published Dec 16, 2025 • 71
PersonaLive! Expressive Portrait Image Animation for Live Streaming Paper • 2512.11253 • Published Dec 12, 2025 • 38
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance Paper • 2512.08765 • Published Dec 9, 2025 • 132
Joint 3D Geometry Reconstruction and Motion Generation for 4D Synthesis from a Single Image Paper • 2512.05044 • Published Dec 4, 2025 • 17
TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows Paper • 2512.05150 • Published Dec 3, 2025 • 76