Epicure: Navigating the Emergent Geometry of Food Ingredient Embeddings Paper • 2605.22391 • Published 14 days ago • 37
Tarsier: Recipes for Training and Evaluating Large Video Description Models Paper • 2407.00634 • Published Jun 30, 2024 • 2
Fine-grained Video-Text Retrieval: A New Benchmark and Method Paper • 2501.00513 • Published Dec 31, 2024 • 2
TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs Paper • 2512.14698 • Published Dec 16, 2025 • 25