deepseek-ai/DeepSeek-V3.2-Speciale
Text Generation
•
685B
•
Updated
•
26.7k
•
646
None defined yet.
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
mHC: Manifold-Constrained Hyper-Connections