Vision-Language Models Are Not Pragmatically Competent in Referring Expression Generation Paper • 2504.16060 • Published Apr 22, 2025
Can Vision Language Models Infer Human Gaze Direction? A Controlled Study Paper • 2506.05412 • Published Jun 4, 2025 • 5
Vision Language Models Cannot Reason About Physical Transformation Paper • 2603.07109 • Published Mar 7 • 2
Vision Language Models Cannot Reason About Physical Transformation Paper • 2603.07109 • Published Mar 7 • 2
Core Knowledge Deficits in Multi-Modal Language Models Paper • 2410.10855 • Published Oct 6, 2024 • 4