arxiv:2605.09959
Jiaxin Huang
teapot123
AI & ML interests
None yet
Recent Activity
upvoted a paper 13 days ago
You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories upvoted a paper 14 days ago
Process Rewards with Learned Reliability authored a paper 20 days ago
Generating Training Data with Language Models: Towards Zero-Shot
Language Understanding