8 12 28

Yixiao Ge

yxgeee

https://geyixiao.com/

AI & ML interests

Computer Vision, Foundation Models

Recent Activity

upvoted a paper 14 days ago

UniT: Toward a Unified Physical Language for Human-to-Humanoid Policy Learning and World Modeling

upvoted a paper 9 months ago

ARC-Hunyuan-Video-7B: Structured Video Comprehension of Real-World Shorts

authored a paper 11 months ago

GRPO-CARE: Consistency-Aware Reinforcement Learning for Multimodal Reasoning

View all activity

Organizations

upvoted a paper 14 days ago

UniT: Toward a Unified Physical Language for Human-to-Humanoid Policy Learning and World Modeling

Paper • 2604.19734 • Published 17 days ago • 29

upvoted a paper 9 months ago

ARC-Hunyuan-Video-7B: Structured Video Comprehension of Real-World Shorts

Paper • 2507.20939 • Published Jul 28, 2025 • 57

authored a paper 11 months ago

GRPO-CARE: Consistency-Aware Reinforcement Learning for Multimodal Reasoning

Paper • 2506.16141 • Published Jun 19, 2025 • 27

upvoted 2 papers 11 months ago

GRPO-CARE: Consistency-Aware Reinforcement Learning for Multimodal Reasoning

Paper • 2506.16141 • Published Jun 19, 2025 • 27

Aligning Latent Spaces with Flow Priors

Paper • 2506.05240 • Published Jun 5, 2025 • 27

liked a model 11 months ago

TencentARC/TokLIP

Image-Text-to-Text • Updated Aug 21, 2025 • 14 • 13

upvoted a paper 11 months ago

AnimeShooter: A Multi-Shot Animation Dataset for Reference-Guided Video Generation

Paper • 2506.03126 • Published Jun 3, 2025 • 22

authored a paper 11 months ago

Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?

Paper • 2505.21374 • Published May 27, 2025 • 28

upvoted a paper 12 months ago

Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?

Paper • 2505.21374 • Published May 27, 2025 • 28

authored a paper about 1 year ago

AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction

Paper • 2504.01014 • Published Apr 1, 2025 • 70

upvoted a paper about 1 year ago

AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction

Paper • 2504.01014 • Published Apr 1, 2025 • 70

authored a paper about 1 year ago

Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1

Paper • 2503.24376 • Published Mar 31, 2025 • 38

upvoted a paper about 1 year ago

Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1

Paper • 2503.24376 • Published Mar 31, 2025 • 38

authored a paper about 1 year ago

GenHancer: Imperfect Generative Models are Secretly Strong Vision-Centric Enhancers

Paper • 2503.19480 • Published Mar 25, 2025 • 16

upvoted a paper about 1 year ago

GenHancer: Imperfect Generative Models are Secretly Strong Vision-Centric Enhancers

Paper • 2503.19480 • Published Mar 25, 2025 • 16

authored 3 papers over 1 year ago

Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation

Paper • 2412.04432 • Published Dec 5, 2024 • 16

Moto: Latent Motion Token as the Bridging Language for Robot Manipulation

Paper • 2412.04445 • Published Dec 5, 2024 • 22

Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual Generation

Paper • 2409.04410 • Published Sep 6, 2024 • 25

liked a Space almost 2 years ago

YOLO-World-Image

🚀

liked a dataset almost 2 years ago

TencentARC/StoryStream

Preview • Updated Jul 17, 2024 • 229 • 29

Yixiao Ge

AI & ML interests

Recent Activity

Organizations

yxgeee's activity

YOLO-World-Image