Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ZihuiCheng's picture
3 2

ZihuiCheng

czh-up
·

AI & ML interests

None yet

Recent Activity

authored a paper 1 day ago
CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language Models
authored a paper 1 day ago
STEP3-VL-10B Technical Report
authored a paper 1 day ago
Visual Thoughts: A Unified Perspective of Understanding Multimodal Chain-of-Thought
View all activity

Organizations

None yet

authored 3 papers 1 day ago

CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language Models

Paper • 2412.12932 • Published Dec 17, 2024 • 2

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published Jan 14 • 193

Visual Thoughts: A Unified Perspective of Understanding Multimodal Chain-of-Thought

Paper • 2505.15510 • Published May 21, 2025
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs