Two students, one methodology. 30B teacher → 1.7B and 0.6B via proof-weighted distillation + legal SFT. Six models, Apache 2.0.
-
reaperdoesntknow/Qwen3-1.7B-Distilled-30B-A3B
Text Generation • 2B • Updated • 92 -
reaperdoesntknow/Qwen3-1.7B-Distilled-30B-A3B-SFT-GGUF
Text Generation • 2B • Updated • 157 -
reaperdoesntknow/Qwen3-1.7B-Distilled-30B-A3B-SFT
Text Generation • 2B • Updated • 36 -
reaperdoesntknow/Qwen3-0.6B-Distilled-30B-A3B
Text Generation • 0.8B • Updated • 35