Communication Efficient LLM Pre-training with SparseLoCo Paper • 2508.15706 • Published Aug 21, 2025 • 1
Running on CPU Upgrade Featured 2.97k The Smol Training Playbook 📚 2.97k The secrets to building world-class LLMs
casperhansen/llama-3-70b-instruct-awq Text Generation • 71B • Updated Apr 19, 2024 • 6.67k • 70