Running Featured 73 Distilling 100B+ Models 40x Faster with TRL π 73 TRL distillation for 100B+ teachers, 40x faster
Running Featured 74 QED-Nano: Teaching a Tiny Model to Prove Hard Theorems π 74 Who needs 1T parameters? Olympiad proofs with a 4B model
Running on CPU Upgrade Featured 3.12k The Smol Training Playbook π 3.12k The secrets to building world-class LLMs
HuggingFaceH4/zephyr-7b-alpha Text Generation β’ 7B β’ Updated Oct 16, 2024 β’ 5.44k β’ β’ 1.12k
Running 62 Bringing paper to life: A modern template for scientific writing π 62 Explore a scientific article with interactive visualizations
Running Agents Featured 253 Jupyter Agent 2 π 253 Generate Jupyter notebooks from natural language tasks
google/embeddinggemma-300m Sentence Similarity β’ 0.3B β’ Updated Sep 25, 2025 β’ 1.07M β’ β’ 1.62k