Critique to Verify: Accurate and Honest Test-Time Scaling with RL-Trained Verifiers (https://arxiv.org/abs/2509.23152)
Zhicheng YANG
yangzhch6
AI & ML interests
reasoning with LLMs
Recent Activity
updated
a model 1 day ago
yangzhch6/Qwen2.5-Math-7B-Think32k published
a model 1 day ago
yangzhch6/Qwen2.5-Math-7B-Think32k updated
a model 1 day ago
yangzhch6/Qwen2.5-Math-7B-Think32k-Openr1ColdStart46k-Syn Organizations
None yet