Submitted by weimin wang 35 Ovi: Twin Backbone Cross-Modal Fusion for Audio-Video Generation Character.AI 1.57k 7