google/diffusiongemma-26B-A4B-it Image-Text-to-Text β’ 26B β’ Updated 15 days ago β’ 1.08M β’ 1.06k
openai/whisper-large-v3 Automatic Speech Recognition β’ 2B β’ Updated Aug 12, 2024 β’ 5.73M β’ β’ 5.87k
HuggingFaceM4/siglip-so400m-14-980-flash-attn2-navit Zero-Shot Image Classification β’ 0.9B β’ Updated Mar 7, 2024 β’ 127 β’ 53
Alibaba-NLP/Tongyi-DeepResearch-30B-A3B Text Generation β’ 31B β’ Updated Oct 10, 2025 β’ 104k β’ 813
Running on CPU Upgrade Featured 3.22k The Smol Training Playbook π 3.22k The secrets to building world-class LLMs
Running 3.9k The Ultra-Scale Playbook π 3.9k The ultimate guide to training LLM on large GPU Clusters
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B Text Generation β’ 2B β’ Updated Feb 24, 2025 β’ 614k β’ β’ 1.53k
Running 601 Scaling test-time compute π 601 Boost LLM answers with flexible testβtime search strategies