AudioVisual-Caption/ASID-1M
Viewer
•
Updated
•
241k
•
65
•
3
Video Understanding, Audio-Visual Learning, Multimodal LLMs, Video Captioning, Instruction Tuning, Dataset Curation