1 8 4

chenzehao

chhao

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

A Very Big Video Reasoning Suite

upvoted a paper 1 day ago

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

upvoted a paper 4 days ago

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

View all activity

Organizations

None yet

upvoted 2 papers 1 day ago

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published 3 days ago • 438

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published 15 days ago • 184

upvoted a paper 4 days ago

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

Paper • 2602.08354 • Published 17 days ago • 208

upvoted 2 papers 12 days ago

TermiGen: High-Fidelity Environment and Robust Trajectory Synthesis for Terminal Agents

Paper • 2602.07274 • Published 20 days ago • 204

Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs

Paper • 2602.10388 • Published 16 days ago • 230

New activity in chhao/Weak-Driven-Learning 13 days ago

Create README.md

#1 opened 13 days ago by

AlexGeek

liked a model 13 days ago

DMindAI/DMind-3-nano

Text Generation • Updated about 8 hours ago • 56

liked 2 datasets 13 days ago

TeichAI/Pony-Alpha-15k

Viewer • Updated 10 days ago • 14.9k • 413 • 54

openbmb/UltraData-Math

Viewer • Updated 6 days ago • 181M • 48.3k • 249

liked a model 16 days ago

chhao/Weak-Driven-Learning

Text Generation • Updated 13 days ago • 58 • 6

updated a model 16 days ago

chhao/Weak-Driven-Learning

Text Generation • Updated 13 days ago • 58 • 6

published a model 16 days ago

chhao/Weak-Driven-Learning

Text Generation • Updated 13 days ago • 58 • 6

upvoted a paper 16 days ago

Adaptive Batch-Wise Sample Scheduling for Direct Preference Optimization

Paper • 2506.17252 • Published Jun 8, 2025 • 2

authored 2 papers 16 days ago

Improving Viewpoint Consistency in 3D Generation via Structure Feature and CLIP Guidance

Paper • 2412.02287 • Published Dec 3, 2024 • 1

Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

Paper • 2602.08222 • Published 18 days ago • 269

upvoted 2 papers 16 days ago

Real-Time Aligned Reward Model beyond Semantics

Paper • 2601.22664 • Published 27 days ago • 13

Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

Paper • 2602.08222 • Published 18 days ago • 269

chenzehao

AI & ML interests

Recent Activity

Organizations

chhao's activity

Create README.md