CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language Models Paper • 2412.12932 • Published Dec 17, 2024 • 2
Visual Thoughts: A Unified Perspective of Understanding Multimodal Chain-of-Thought Paper • 2505.15510 • Published May 21, 2025