2-layer truncated models used for Bertha CI regression tests.
AI & ML interests
None defined yet.
Recent Activity
View all activity
models 57
hyper-accel/Qwen3-VL-2B-Instruct-W4A16-dequant-bf16
2B • Updated
hyper-accel/ci-2layer-llama3-8b
1B • Updated • 7
hyper-accel/ci-random-w4a16-asym-g64-llama3-3b
0.6B • Updated • 11
hyper-accel/Qwen3-VL-2B-Instruct-W4A16_ASYM-G64
2B • Updated • 91
hyper-accel/Llama-3.2-3B-Instruct-W4A16_ASYM-G64
3B • Updated • 243
hyper-accel/ci-random-qwen2-moe-a3b
Text Generation • 2B • Updated • 1.71k
hyper-accel/qwen2-moe-a3b-2layer
2B • Updated • 3
hyper-accel/ci-2layer-llama2-7b
0.7B • Updated • 22.6k • 1
hyper-accel/ci-random-bfloat16-llama3-3b
Text Generation • 0.6B • Updated • 6.23k
hyper-accel/ci-random-llama3-3b
Text Generation • 0.6B • Updated • 38
datasets 0
None public yet