SkillOpt: Executive Strategy for Self-Evolving Agent Skills Paper • 2605.23904 • Published 9 days ago • 206
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey Paper • 2509.02547 • Published Sep 2, 2025 • 238
Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models Paper • 2508.10751 • Published Aug 14, 2025 • 29
Agent Lightning: Train ANY AI Agents with Reinforcement Learning Paper • 2508.03680 • Published Aug 5, 2025 • 141
LeanK: Learnable K Cache Channel Pruning for Efficient Decoding Paper • 2508.02215 • Published Aug 4, 2025 • 12
A Survey of Context Engineering for Large Language Models Paper • 2507.13334 • Published Jul 17, 2025 • 264
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning Paper • 2506.01939 • Published Jun 2, 2025 • 190
Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs Paper • 2503.01743 • Published Mar 3, 2025 • 91
Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning Paper • 2502.14768 • Published Feb 20, 2025 • 47
Beyond Prompt Content: Enhancing LLM Performance via Content-Format Integrated Prompt Optimization Paper • 2502.04295 • Published Feb 6, 2025 • 13
Beyond Prompt Content: Enhancing LLM Performance via Content-Format Integrated Prompt Optimization Paper • 2502.04295 • Published Feb 6, 2025 • 13 • 2
The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper • 2501.07301 • Published Jan 13, 2025 • 101