DASH: Faster Shampoo via Batched Block Preconditioning and Efficient Inverse-Root Solvers Paper • 2602.02016 • Published 11 days ago • 11
Optimizers Qualitatively Alter Solutions And We Should Leverage This Paper • 2507.12224 • Published Jul 16, 2025 • 1
MicroAdam: Accurate Adaptive Optimization with Low Space Overhead and Provable Convergence Paper • 2405.15593 • Published May 24, 2024 • 1
SVD-Free Low-Rank Adaptive Gradient Optimization for Large Language Models Paper • 2505.17967 • Published May 23, 2025 • 17