When Can LLMs Learn to Reason with Weak Supervision? Paper • 2604.18574 • Published about 1 month ago • 25
SUPERNOVA: Eliciting General Reasoning in LLMs with Reinforcement Learning on Natural Instructions Paper • 2604.08477 • Published Apr 9 • 1
Ashima/micro_top2_augmented_going_against_strong_prior_Mar19-2244 Viewer • Updated Mar 20 • 7.45k • 123
Ashima/micro_top2_augmented_going_against_strong_prior_Mar19-2244 Viewer • Updated Mar 20 • 7.45k • 123