Hugging Face Smol Models Research

Team

community

Activity Feed

AI & ML interests

Exploring smol models (for text, vision and video) and high quality web and synthetic datasets

Recent Activity

craffel authored a paper 12 days ago

Risk Under Pressure: Compute-Aware Evaluation of Adversarial Robustness in Language Models

cfahlgren1 submitted a paper 12 days ago

From AGI to ASI

clefourrier authored a paper 4 months ago

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

View all activity

Papers

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

View all Papers

HuggingFaceTB 's collections 16

🧠 SmolLM3

Smol, multilingual, long-context reasoner

HuggingFaceTB/SmolLM3-3B

Text Generation • 3B • Updated Sep 10, 2025 • 649k • 981
HuggingFaceTB/SmolLM3-3B-Base

Text Generation • 3B • Updated Aug 14, 2025 • 208k • 163
ggml-org/SmolLM3-3B-GGUF

3B • Updated Jul 8, 2025 • 8.04k • 63
HuggingFaceTB/SmolLM3-3B-ONNX

Text Generation • Updated Jul 14, 2025 • 176 • 27

SmolLM3 evaluation datasets

Datasets to decontaminate the post-training mixtures against. Use the subset and column values described per entry

Idavidrein/gpqa

Benchmark • Updated Mar 5 • 1.25k • 96.8k • 470
HuggingFaceH4/aime_2024

Viewer • Updated Jan 26, 2025 • 30 • 46.9k • 63
MathArena/aime_2025

Viewer • Updated May 15 • 30 • 31.8k • 13
MathArena/hmmt_feb_2025

Viewer • Updated May 15 • 30 • 11.9k • 11

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M

HuggingFaceTB/SmolLM2-1.7B-Instruct

Text Generation • 2B • Updated Apr 21, 2025 • 132k • 733
HuggingFaceTB/SmolLM2-1.7B

Text Generation • 2B • Updated Feb 6, 2025 • 191k • 153
HuggingFaceTB/SmolLM2-360M-Instruct

Text Generation • 0.4B • Updated Sep 22, 2025 • 246k • 196
HuggingFaceTB/SmolLM2-360M

Text Generation • 0.4B • Updated Feb 6, 2025 • 86.6k • 110

📚 LLM pretraining datasets

A collection of datasets for LLM pretraining

HuggingFaceFW/fineweb

Viewer • Updated Jul 11, 2025 • 52.5B • 274k • 2.9k
HuggingFaceFW/fineweb-2

Viewer • Updated Oct 27, 2025 • 4.48B • 99.2k • 827
HuggingFaceFW/fineweb-edu

Viewer • Updated Jul 11, 2025 • 3.5B • 387k • 1.16k
mlfoundations/dclm-baseline-1.0

Preview • Updated Jul 22, 2024 • 503k • 288

🧩 SmolLM2 Intermediate Checkpoints

HuggingFaceTB/SmolLM2-1.7B-intermediate-checkpoints

Updated Feb 27, 2025 • 107 • 3
HuggingFaceTB/SmolLM2-360M-intermediate-checkpoints

Updated Feb 27, 2025 • 316 • 1
HuggingFaceTB/SmolLM2-135M-intermediate-checkpoints

Updated Feb 27, 2025 • 477 • 2

SmolVLM 256M & 500M

Collection for models & demos for even smoller SmolVLM release

HuggingFaceTB/SmolVLM-256M-Instruct

Image-Text-to-Text • 0.3B • Updated Apr 8, 2025 • 975k • 370
HuggingFaceTB/SmolVLM-500M-Instruct

Image-Text-to-Text • 0.5B • Updated Apr 8, 2025 • 349k • 195
Running on Zero

Agents

67

SmolVLM

📊

67

Generate descriptions from images and text prompts
HuggingFaceTB/SmolVLM-256M-Base

Image-Text-to-Text • 0.3B • Updated Jan 20, 2025 • 526 • 23

💻 Local SmolLMs

SmolLM models in MLC, ONNX and GGUF format for local applications + in-browser demos

Running

54

Instant SmolLM

🤏

54

Run SmolLM-360M-Instruct in realtime with MLC WebLLM
Running

162

SmolLM 360M Instruct WebGPU

🚀

162

A blazingly fast and powerful AI chatbot that runs locally.
Running

109

Wllama

🦙

109

Run GGUF directly on your browser!
Running

9

SmolPilot

🌖

9

Interact with a 360M parameter language model

Instruct datasets

HuggingFaceTB/everyday-conversations-llama3.1-2k

Viewer • Updated Jan 29, 2025 • 2.38k • 1.89k • 133
HuggingFaceTB/Magpie-Pro-300K-Filtered-H4

Viewer • Updated Aug 17, 2024 • 300k • 90 • 6
HuggingFaceTB/OpenHermes-2.5-H4

Viewer • Updated Aug 17, 2024 • 1M • 121 • 7
HuggingFaceTB/self-oss-instruct-sc2-H4

Viewer • Updated Aug 17, 2024 • 50.7k • 52 • 6

SmolLM3 pretraining datasets

datasets used in SmolLM3 pretraining

HuggingFaceFW/fineweb-edu

Viewer • Updated Jul 11, 2025 • 3.5B • 387k • 1.16k
mlfoundations/dclm-baseline-1.0

Preview • Updated Jul 22, 2024 • 503k • 288
epfml/FineWeb2-HQ

Viewer • Updated Feb 19, 2025 • 380M • 28.1k • 68
HuggingFaceFW/fineweb-2

Viewer • Updated Oct 27, 2025 • 4.48B • 99.2k • 827

Reasoning datasets

nvidia/OpenMathReasoning

Viewer • Updated May 27, 2025 • 5.68M • 17.9k • 467
nvidia/OpenCodeReasoning

Viewer • Updated May 4, 2025 • 753k • 10.4k • 546
nvidia/AceMath-Instruct-Training-Data

Viewer • Updated Jan 17, 2025 • 5.56M • 220 • 65
nvidia/OpenMathInstruct-2

Viewer • Updated Nov 25, 2024 • 22M • 79.8k • 247

SmolVLM2 📺 Smallest video LM ever 🤏🏻

Build error

Agents

81

SmolVLM

📊

81

Generate answers by combining text and images
Build error

Agents

59

SmolVLM2 HighlightGenerator

🐨

59

Generate video highlights from uploaded video
Running

Agents

19

SmolVLM2 IPhone Waitlist

⏰

19

sign in to receive news on the iPhone app
Build error

Agents

32

SmolVLM2 XSPFGenerator (VLC prototype)

🎞

32

Generate video highlights and playlist

SmolVLM

State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct. Check our blog: https://huggingface.co/blog/smolvlm

HuggingFaceTB/SmolVLM-Instruct

Image-Text-to-Text • 2B • Updated Apr 8, 2025 • 20.2k • 589
HuggingFaceTB/SmolVLM-Base

Image-Text-to-Text • 2B • Updated Nov 28, 2024 • 2.27k • 89
HuggingFaceTB/SmolVLM-Synthetic

Image-Text-to-Text • 2B • Updated Nov 26, 2024 • 120 • 12
HuggingFaceTB/SmolVLM-Instruct-DPO

Image-Text-to-Text • Updated Nov 26, 2024 • 10 • 22

The Ultimate Collection of Code Classifiers

🔥 15 classifiers, 124M parameters, one per programming language— for assessing the educational value of GitHub code

HuggingFaceTB/stack-edu-classifier-python

0.1B • Updated Feb 19, 2025 • 60 • 7
HuggingFaceTB/stack-edu-classifier-sql

Text Classification • 0.1B • Updated Feb 19, 2025 • 30 • 1
HuggingFaceTB/stack-edu-classifier-c

Text Classification • 0.1B • Updated Feb 19, 2025 • 43
HuggingFaceTB/stack-edu-classifier-cpp

0.1B • Updated Feb 19, 2025 • 34

📐 FineMath

FineMath datasets and ablation models

HuggingFaceTB/finemath

Viewer • Updated Feb 6, 2025 • 48.3M • 22.4k • 367
HuggingFaceTB/FineMath-Llama-3B

3B • Updated Nov 27, 2025 • 27 • 22
HuggingFaceTB/finemath-classifier

Text Classification • 0.1B • Updated Dec 19, 2024 • 370 • 16
HuggingFaceTB/finemath-ablation-finemath-4plus

3B • Updated Dec 19, 2024 • 7 • 1

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos

HuggingFaceTB/smollm-corpus

Viewer • Updated Sep 6, 2024 • 237M • 31.8k • 467
HuggingFaceTB/SmolLM-135M

Text Generation • 0.1B • Updated Aug 1, 2024 • 183k • 261
HuggingFaceTB/SmolLM-360M

Text Generation • 0.4B • Updated Aug 1, 2024 • 8.62k • 70
HuggingFaceTB/SmolLM-1.7B

Text Generation • 2B • Updated Oct 16, 2024 • 67.9k • 181

🌌 Cosmopedia

Resources for Cosmopedia dataset

HuggingFaceTB/cosmopedia

Viewer • Updated Aug 12, 2024 • 31.1M • 19.2k • 721
HuggingFaceTB/cosmo-1b

Text Generation • 2B • Updated Jul 8, 2024 • 189 • 135
Sleeping

6

Web clusters

🕸

6

Inspect web clusters by educational score
HuggingFaceTB/cosmopedia-100k

Viewer • Updated Feb 19, 2024 • 100k • 668 • 49

🧠 SmolLM3

Smol, multilingual, long-context reasoner

HuggingFaceTB/SmolLM3-3B

Text Generation • 3B • Updated Sep 10, 2025 • 649k • 981
HuggingFaceTB/SmolLM3-3B-Base

Text Generation • 3B • Updated Aug 14, 2025 • 208k • 163
ggml-org/SmolLM3-3B-GGUF

3B • Updated Jul 8, 2025 • 8.04k • 63
HuggingFaceTB/SmolLM3-3B-ONNX

Text Generation • Updated Jul 14, 2025 • 176 • 27

SmolLM3 pretraining datasets

datasets used in SmolLM3 pretraining

HuggingFaceFW/fineweb-edu

Viewer • Updated Jul 11, 2025 • 3.5B • 387k • 1.16k
mlfoundations/dclm-baseline-1.0

Preview • Updated Jul 22, 2024 • 503k • 288
epfml/FineWeb2-HQ

Viewer • Updated Feb 19, 2025 • 380M • 28.1k • 68
HuggingFaceFW/fineweb-2

Viewer • Updated Oct 27, 2025 • 4.48B • 99.2k • 827

SmolLM3 evaluation datasets

Datasets to decontaminate the post-training mixtures against. Use the subset and column values described per entry

Idavidrein/gpqa

Benchmark • Updated Mar 5 • 1.25k • 96.8k • 470
HuggingFaceH4/aime_2024

Viewer • Updated Jan 26, 2025 • 30 • 46.9k • 63
MathArena/aime_2025

Viewer • Updated May 15 • 30 • 31.8k • 13
MathArena/hmmt_feb_2025

Viewer • Updated May 15 • 30 • 11.9k • 11

Reasoning datasets

nvidia/OpenMathReasoning

Viewer • Updated May 27, 2025 • 5.68M • 17.9k • 467
nvidia/OpenCodeReasoning

Viewer • Updated May 4, 2025 • 753k • 10.4k • 546
nvidia/AceMath-Instruct-Training-Data

Viewer • Updated Jan 17, 2025 • 5.56M • 220 • 65
nvidia/OpenMathInstruct-2

Viewer • Updated Nov 25, 2024 • 22M • 79.8k • 247

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M

HuggingFaceTB/SmolLM2-1.7B-Instruct

Text Generation • 2B • Updated Apr 21, 2025 • 132k • 733
HuggingFaceTB/SmolLM2-1.7B

Text Generation • 2B • Updated Feb 6, 2025 • 191k • 153
HuggingFaceTB/SmolLM2-360M-Instruct

Text Generation • 0.4B • Updated Sep 22, 2025 • 246k • 196
HuggingFaceTB/SmolLM2-360M

Text Generation • 0.4B • Updated Feb 6, 2025 • 86.6k • 110

SmolVLM2 📺 Smallest video LM ever 🤏🏻

Build error

Agents

81

SmolVLM

📊

81

Generate answers by combining text and images
Build error

Agents

59

SmolVLM2 HighlightGenerator

🐨

59

Generate video highlights from uploaded video
Running

Agents

19

SmolVLM2 IPhone Waitlist

⏰

19

sign in to receive news on the iPhone app
Build error

Agents

32

SmolVLM2 XSPFGenerator (VLC prototype)

🎞

32

Generate video highlights and playlist

📚 LLM pretraining datasets

A collection of datasets for LLM pretraining

HuggingFaceFW/fineweb

Viewer • Updated Jul 11, 2025 • 52.5B • 274k • 2.9k
HuggingFaceFW/fineweb-2

Viewer • Updated Oct 27, 2025 • 4.48B • 99.2k • 827
HuggingFaceFW/fineweb-edu

Viewer • Updated Jul 11, 2025 • 3.5B • 387k • 1.16k
mlfoundations/dclm-baseline-1.0

Preview • Updated Jul 22, 2024 • 503k • 288

SmolVLM

State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct. Check our blog: https://huggingface.co/blog/smolvlm

HuggingFaceTB/SmolVLM-Instruct

Image-Text-to-Text • 2B • Updated Apr 8, 2025 • 20.2k • 589
HuggingFaceTB/SmolVLM-Base

Image-Text-to-Text • 2B • Updated Nov 28, 2024 • 2.27k • 89
HuggingFaceTB/SmolVLM-Synthetic

Image-Text-to-Text • 2B • Updated Nov 26, 2024 • 120 • 12
HuggingFaceTB/SmolVLM-Instruct-DPO

Image-Text-to-Text • Updated Nov 26, 2024 • 10 • 22

🧩 SmolLM2 Intermediate Checkpoints

HuggingFaceTB/SmolLM2-1.7B-intermediate-checkpoints

Updated Feb 27, 2025 • 107 • 3
HuggingFaceTB/SmolLM2-360M-intermediate-checkpoints

Updated Feb 27, 2025 • 316 • 1
HuggingFaceTB/SmolLM2-135M-intermediate-checkpoints

Updated Feb 27, 2025 • 477 • 2

The Ultimate Collection of Code Classifiers

🔥 15 classifiers, 124M parameters, one per programming language— for assessing the educational value of GitHub code

HuggingFaceTB/stack-edu-classifier-python

0.1B • Updated Feb 19, 2025 • 60 • 7
HuggingFaceTB/stack-edu-classifier-sql

Text Classification • 0.1B • Updated Feb 19, 2025 • 30 • 1
HuggingFaceTB/stack-edu-classifier-c

Text Classification • 0.1B • Updated Feb 19, 2025 • 43
HuggingFaceTB/stack-edu-classifier-cpp

0.1B • Updated Feb 19, 2025 • 34

SmolVLM 256M & 500M

Collection for models & demos for even smoller SmolVLM release

HuggingFaceTB/SmolVLM-256M-Instruct

Image-Text-to-Text • 0.3B • Updated Apr 8, 2025 • 975k • 370
HuggingFaceTB/SmolVLM-500M-Instruct

Image-Text-to-Text • 0.5B • Updated Apr 8, 2025 • 349k • 195
Running on Zero

Agents

67

SmolVLM

📊

67

Generate descriptions from images and text prompts
HuggingFaceTB/SmolVLM-256M-Base

Image-Text-to-Text • 0.3B • Updated Jan 20, 2025 • 526 • 23

📐 FineMath

FineMath datasets and ablation models

HuggingFaceTB/finemath

Viewer • Updated Feb 6, 2025 • 48.3M • 22.4k • 367
HuggingFaceTB/FineMath-Llama-3B

3B • Updated Nov 27, 2025 • 27 • 22
HuggingFaceTB/finemath-classifier

Text Classification • 0.1B • Updated Dec 19, 2024 • 370 • 16
HuggingFaceTB/finemath-ablation-finemath-4plus

3B • Updated Dec 19, 2024 • 7 • 1

💻 Local SmolLMs

SmolLM models in MLC, ONNX and GGUF format for local applications + in-browser demos

Running

54

Instant SmolLM

🤏

54

Run SmolLM-360M-Instruct in realtime with MLC WebLLM
Running

162

SmolLM 360M Instruct WebGPU

🚀

162

A blazingly fast and powerful AI chatbot that runs locally.
Running

109

Wllama

🦙

109

Run GGUF directly on your browser!
Running

9

SmolPilot

🌖

9

Interact with a 360M parameter language model

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos

HuggingFaceTB/smollm-corpus

Viewer • Updated Sep 6, 2024 • 237M • 31.8k • 467
HuggingFaceTB/SmolLM-135M

Text Generation • 0.1B • Updated Aug 1, 2024 • 183k • 261
HuggingFaceTB/SmolLM-360M

Text Generation • 0.4B • Updated Aug 1, 2024 • 8.62k • 70
HuggingFaceTB/SmolLM-1.7B

Text Generation • 2B • Updated Oct 16, 2024 • 67.9k • 181

Instruct datasets

HuggingFaceTB/everyday-conversations-llama3.1-2k

Viewer • Updated Jan 29, 2025 • 2.38k • 1.89k • 133
HuggingFaceTB/Magpie-Pro-300K-Filtered-H4

Viewer • Updated Aug 17, 2024 • 300k • 90 • 6
HuggingFaceTB/OpenHermes-2.5-H4

Viewer • Updated Aug 17, 2024 • 1M • 121 • 7
HuggingFaceTB/self-oss-instruct-sc2-H4

Viewer • Updated Aug 17, 2024 • 50.7k • 52 • 6

🌌 Cosmopedia

Resources for Cosmopedia dataset

HuggingFaceTB/cosmopedia

Viewer • Updated Aug 12, 2024 • 31.1M • 19.2k • 721
HuggingFaceTB/cosmo-1b

Text Generation • 2B • Updated Jul 8, 2024 • 189 • 135
Sleeping

6

Web clusters

🕸

6

Inspect web clusters by educational score
HuggingFaceTB/cosmopedia-100k

Viewer • Updated Feb 19, 2024 • 100k • 668 • 49

AI & ML interests

Recent Activity

Papers

Team members 32

HuggingFaceTB 's collections 16

SmolVLM

Instant SmolLM

SmolLM 360M Instruct WebGPU

Wllama

SmolPilot

SmolVLM

SmolVLM2 HighlightGenerator

SmolVLM2 IPhone Waitlist

SmolVLM2 XSPFGenerator (VLC prototype)

Web clusters

SmolVLM

SmolVLM2 HighlightGenerator

SmolVLM2 IPhone Waitlist

SmolVLM2 XSPFGenerator (VLC prototype)

SmolVLM

Instant SmolLM

SmolLM 360M Instruct WebGPU

Wllama

SmolPilot

Web clusters