R1-V Towards the Aha Moment of Vision-Language Models MMInstruction/Clevr_CoGenT_TrainA_R1 Viewer • Updated Feb 13, 2025 • 37.8k • 139 • 48 MMInstruction/SuperClevr_Val Viewer • Updated Feb 18, 2025 • 5k • 46 • 1 MMInstruction/Clevr_CoGenT_ValA Viewer • Updated Feb 3, 2025 • 5k • 404 • 1 MMInstruction/Clevr_CoGenT_ValB Viewer • Updated Feb 3, 2025 • 5k • 3 • 2
R1-V Towards the Aha Moment of Vision-Language Models MMInstruction/Clevr_CoGenT_TrainA_R1 Viewer • Updated Feb 13, 2025 • 37.8k • 139 • 48 MMInstruction/SuperClevr_Val Viewer • Updated Feb 18, 2025 • 5k • 46 • 1 MMInstruction/Clevr_CoGenT_ValA Viewer • Updated Feb 3, 2025 • 5k • 404 • 1 MMInstruction/Clevr_CoGenT_ValB Viewer • Updated Feb 3, 2025 • 5k • 3 • 2