CiteAudit: You Cited It, But Did You Read It? A Benchmark for Verifying Scientific References in the LLM Era Paper • 2602.23452 • Published 5 days ago • 15
SkyReels-V4: Multi-modal Video-Audio Generation, Inpainting and Editing model Paper • 2602.21818 • Published 6 days ago • 51
OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence Paper • 2602.08683 • Published 22 days ago • 49
TimeChat-Captioner: Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions Paper • 2602.08711 • Published 22 days ago • 28
SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning Paper • 2602.08234 • Published 22 days ago • 68
Context Forcing: Consistent Autoregressive Video Generation with Long Context Paper • 2602.06028 • Published 26 days ago • 36
Unified Personalized Reward Model for Vision Generation Paper • 2602.02380 • Published 29 days ago • 20
3D-Aware Implicit Motion Control for View-Adaptive Human Video Generation Paper • 2602.03796 • Published 28 days ago • 58