arxiv:2605.18984
Yuran Wang
Ryann829
AI & ML interests
Multimodal Large Language Model
Recent Activity
upvoted a paper about 3 hours ago
LatentOmni: Rethinking Omni-Modal Understanding via Unified Audio-Visual Latent Reasoning authored a paper 1 day ago
Artifact-Bench: Evaluating MLLMs on Detecting and Assessing the Artifacts of AI-Generated Videos