JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence Paper • 2606.14777 • Published 24 days ago • 208
Geometric Context Transformer for Streaming 3D Reconstruction Paper • 2604.14141 • Published Apr 15 • 24
LoopCoder-v2: Only Loop Once for Efficient Test-Time Computation Scaling Paper • 2606.18023 • Published 18 days ago • 209
view article Article Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action nvidia • Jun 1 • 85
Nano-World-Model Collection 🌍 A minimalist repository for training video world models based on diffusion-forcing. • 20 items • Updated May 17 • 7
Running on CPU Upgrade Featured 3.23k The Smol Training Playbook 📚 3.23k The secrets to building world-class LLMs