Speculators testing - a nm-testing Collection

nm-testing 's Collections

KV Cache Quantization

FP8-Block Quantized Models

LLM Compressor testing

Speculators testing

Sparse-Llama-3.1-8B-2of4

Speculators testing

updated May 7

Models used by https://github.com/vllm-project/speculators CI system