unsloth/NVIDIA-Nemotron-3-Super-120B-A12B-GGUF Text Generation β’ 121B β’ Updated Mar 20 β’ 67.4k β’ 113
Running Featured 74 QED-Nano: Teaching a Tiny Model to Prove Hard Theorems π 74 Who needs 1T parameters? Olympiad proofs with a 4B model
Running on CPU Upgrade Featured 3.12k The Smol Training Playbook π 3.12k The secrets to building world-class LLMs
SmolVLM Collection State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct. Check our blog: https://huggingface.co/blog/smolvlm β’ 5 items β’ Updated May 5, 2025 β’ 42
Running on CPU Upgrade Agents Featured 1.32k Open ASR Leaderboard π 1.32k Explore and compare speech-to-text model benchmarks
view article Article Prefill and Decode for Concurrent Requests - Optimizing LLM Performance Apr 16, 2025 β’ 72
view article Article The Transformers Library: standardizing model definitions +2 May 15, 2025 β’ 121