arxiv:2507.21046
Huan-ang Gao
c7w
AI & ML interests
None yet
Recent Activity
upvoted a paper about 23 hours ago
How Far Can Unsupervised RLVR Scale LLM Training? upvoted a paper 9 days ago
CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation upvoted a paper 3 months ago
JustRL: Scaling a 1.5B LLM with a Simple RL Recipe