arxiv:2604.24842
Yale Song
yalesong
AI & ML interests
Computer Vision, Machine Learning
Recent Activity
authored a paper 2 days ago
TGIF-QA: Toward Spatio-Temporal Reasoning in Visual Question Answering authored a paper 2 days ago
TGIF: A New Dataset and Benchmark on Animated GIF Description authored a paper 2 days ago
EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the
BackboneOrganizations
None yet