view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 16 days ago • 854
Cosmos-Predict2.5 Collection Improved World Simulation with Video Foundation Models for Physical AI • 2 items • Updated 2 days ago • 20
Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation Paper • 2509.19296 • Published Sep 23, 2025 • 27
Running on Zero Agents Featured 1.88k Qwen3-TTS Demo 🎙 1.88k Generate speech audio from text with custom or cloned voices
Inferix: A Block-Diffusion based Next-Generation Inference Engine for World Simulation Paper • 2511.20714 • Published Nov 25, 2025 • 50
Running on Zero Agents Featured 833 FLUX.2 [dev] 💻 833 Generate or edit images from text prompts with optional pictures
PS3: Scaling Vision Pre-Training to 4K Resolution Collection Enabling 4k resolution for VLMs, CVPR 2025, https://nvlabs.github.io/PS3/ • 15 items • Updated 2 days ago • 9