Tstars-Tryon 1.0: Robust and Realistic Virtual Try-On for Diverse Fashion Items Paper • 2604.19748 • Published 6 days ago • 244
HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds Paper • 2604.14268 • Published 12 days ago • 111
Seedance 2.0: Advancing Video Generation for World Complexity Paper • 2604.14148 • Published 12 days ago • 152
RefineAnything: Multimodal Region-Specific Refinement for Perfect Local Details Paper • 2604.06870 • Published 19 days ago • 41
HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents Paper • 2604.07430 • Published 19 days ago • 185
OpenWorldLib: A Unified Codebase and Definition of Advanced World Models Paper • 2604.04707 • Published 21 days ago • 200
GEMS: Agent-Native Multimodal Generation with Memory and Skills Paper • 2603.28088 • Published 27 days ago • 84
CutClaw: Agentic Hours-Long Video Editing via Music Synchronization Paper • 2603.29664 • Published 26 days ago • 48
Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis Paper • 2603.29620 • Published 26 days ago • 46
VGGRPO: Towards World-Consistent Video Generation with 4D Latent Reward Paper • 2603.26599 • Published 30 days ago • 65
LongCat-Next: Lexicalizing Modalities as Discrete Tokens Paper • 2603.27538 • Published 28 days ago • 144
Gen-Searcher: Reinforcing Agentic Search for Image Generation Paper • 2603.28767 • Published 27 days ago • 58
Kernel-Smith: A Unified Recipe for Evolutionary Kernel Optimization Paper • 2603.28342 • Published 27 days ago • 26
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models Paper • 2603.25716 • Published about 1 month ago • 156
RealRestorer: Towards Generalizable Real-World Image Restoration with Large-Scale Image Editing Models Paper • 2603.25502 • Published about 1 month ago • 57
Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model Paper • 2603.21986 • Published Mar 23 • 124
ShotVerse: Advancing Cinematic Camera Control for Text-Driven Multi-Shot Video Creation Paper • 2603.11421 • Published Mar 12 • 34