-
BitNet: Scaling 1-bit Transformers for Large Language Models
Paper • 2310.11453 • Published • 106 -
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Paper • 2404.14219 • Published • 259 -
LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding
Paper • 2404.16710 • Published • 80 -
Advancing Open-source World Models
Paper • 2601.20540 • Published • 133
Mayor PRO
Eric111
AI & ML interests
None yet
Recent Activity
liked a model 1 day ago
mistralai/Mistral-Small-4-119B-2603 liked a model 1 day ago
nvidia/Nemotron-Cascade-2-30B-A3B liked a model 4 days ago
ibm-granite/granite-4.0-1b-speech