arxiv:2603.14145
Ramani Duraiswami
RamaniD
AI & ML interests
algorithms, audio, speech, spatial audio, vision, parallel computing
Recent Activity
authored
a paper
about 5 hours ago
MMOU: A Massive Multi-Task Omni Understanding and Reasoning Benchmark for Long and Complex Real-World Videos authored
a paper
7 months ago
CompA: Addressing the Gap in Compositional Reasoning in Audio-Language
Models authored
a paper
7 months ago
A Closer Look at the Limitations of Instruction Tuning