microsoft/Phi-4-reasoning-vision-15B Image-Text-to-Text • 15B • Updated about 2 hours ago • 20.2k • 154
Mobile-O: Unified Multimodal Understanding and Generation on Mobile Device Paper • 2602.20161 • Published 23 days ago • 23
Running on Zero MCP Featured 84 BitDance-14B-64x 🚀 84 Open-source autoregressive model with binary visual tokens.