Gongxun Li's picture

Gongxun Li

AlexGeek

·

AlexJJ009

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

upvoted a paper 6 days ago

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

liked a model 16 days ago

AIDC-AI/Ovis2.6-30B-A3B

View all activity

Organizations

upvoted 2 papers 6 days ago

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published 18 days ago • 215

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

Paper • 2602.08354 • Published 20 days ago • 247

liked 2 models 16 days ago

AIDC-AI/Ovis2.6-30B-A3B

Image-Text-to-Text • 31B • Updated 5 days ago • 25.4k • 141

Aryanne/acestep-v15-test-merges

Updated 18 days ago • 23

upvoted a paper 16 days ago

The Devil Behind Moltbook: Anthropic Safety is Always Vanishing in Self-Evolving AI Societies

Paper • 2602.09877 • Published 19 days ago • 197

liked a model 16 days ago

zai-org/GLM-5

Text Generation • 754B • Updated 16 days ago • 194k • • 1.66k

New activity in chhao/Weak-Driven-Learning 16 days ago

Create README.md

#1 opened 16 days ago by

liked a model 16 days ago

chhao/Weak-Driven-Learning

Text Generation • Updated 15 days ago • 59 • 6

authored a paper 18 days ago

Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

Paper • 2602.08222 • Published 20 days ago • 272

upvoted a paper 19 days ago

Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

Paper • 2602.08222 • Published 20 days ago • 272

upvoted a paper about 2 months ago

Your Group-Relative Advantage Is Biased

Paper • 2601.08521 • Published Jan 13 • 155