view article Article Rank-Stabilized LoRA: Unlocking the Potential of LoRA Fine-Tuning Feb 20, 2024 • 33
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning Paper • 2503.09516 • Published Mar 12, 2025 • 39