Smaller Models are Natural Explorers for Policy-Level Diversity in GRPO Paper • 2605.30789 • Published 15 days ago • 22
X-Coder: Advancing Competitive Programming with Fully Synthetic Tasks, Solutions, and Tests Paper • 2601.06953 • Published Jan 11 • 47