nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8 Text Generation • 124B • Updated 16 days ago • 268k • 235
Running 3.81k The Ultra-Scale Playbook 🌌 3.81k The ultimate guide to training LLM on large GPU Clusters