Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
nm-testing
's Collections
KV Cache Quantization
Models in CI
FP8-Block Quantized Models
LLM Compressor testing
Speculators testing
Sparse-Llama-3.1-8B-2of4
SparseGPT LLMs
FP8 Models
Speculators testing
updated
2 days ago
Models used by https://github.com/vllm-project/speculators CI system
Upvote
-
nm-testing/random-weights-llama3.1.8b-2layer-eagle3-unconverted
Updated
Oct 27, 2025
•
208
nm-testing/tiny-testing-random-weights
584k
•
Updated
Oct 24, 2025
•
2.96k
nm-testing/Speculator-Qwen3-8B-Eagle3
Updated
Jul 17, 2025
•
1.76k
nm-testing/SpeculatorLlama3-1-8B-Eagle3-converted-0717-quantized
1.0B
•
Updated
Jul 29, 2025
•
13.4k
RedHatAI/Qwen3-8B-speculator.eagle3
Text Generation
•
1B
•
Updated
Apr 8
•
71.7k
•
28
yuhuili/EAGLE-LLaMA3.1-Instruct-8B
Updated
Sep 19, 2025
•
418k
•
2
nm-testing/tinysmokellama-3.2
354k
•
Updated
Sep 17, 2025
•
89.1k
RedHatAI/Qwen3-235B-A22B-Instruct-2507-speculator.eagle3
Text Generation
•
1B
•
Updated
Apr 8
•
130
nm-testing/testing-llama3.1.8b-2layer-eagle3
Updated
Jan 5
•
1.6k
nm-testing/dflash-qwen3-8b-speculators
2B
•
Updated
24 days ago
•
13.2k
nm-testing/qwen3-8b-peagle-speculators
2B
•
Updated
2 days ago
•
65
Upvote
-
Share collection
View history
Collection guide
Browse collections