view article Article A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes Aug 17, 2022 • 128
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 20 days ago • 480
view article Article I Let a Lobster Run My Jetson: What OpenClaw Taught Me About the Future of Computing 21 days ago • 15
view article Article From Golden Gate Bridge to Broken JSON: Why Anthropic's SAE Steering Fails for Structured Output Feb 7 • 22
Beyond Transcription: Mechanistic Interpretability in ASR Paper • 2508.15882 • Published Aug 21, 2025 • 87
view article Article Making LLMs Smaller Without Breaking Them: A GLU-Aware Pruning Approach Nov 24, 2024 • 20
Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation Paper • 2601.22813 • Published Jan 30 • 57
Running Featured 46 Pocket TTS ONNX Web Demo 🌖 46 Real-time voice cloning entirely in your browser! (CPU)
State-of-the-art Danish Models Collection These models constitute state-of-the-art models for Danish within their respective domain (highlighted below the model). • 18 items • Updated Nov 4, 2025 • 18