arxiv:2512.10756
Yuzhe Gu
vanilla1116
AI & ML interests
LLM; Reasoning; Hallucination; Self-Improvement
Recent Activity
liked a model 23 days ago
internlm/Intern-S1-Pro commentedon a paper 3 months ago
Long-horizon Reasoning Agent for Olympiad-Level Mathematical Problem Solving authored a paper 3 months ago
CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and
Outcome Reward