-
-
-
-
-
-
Inference Providers
Active filters:
4-bit
MaziyarPanahi/Saul-Instruct-v1-GGUF
Text Generation
•
7B
•
Updated
•
217
•
9
unsloth/llama-3-8b-bnb-4bit
Text Generation
•
8B
•
Updated
•
62.5k
•
203
zementalist/llama-3-8B-chat-psychotherapist
Text Generation
•
8B
•
Updated
•
20
•
30
unsloth/mistral-7b-v0.3-bnb-4bit
Text Generation
•
7B
•
Updated
•
347k
•
22
unsloth/Meta-Llama-3.1-8B-bnb-4bit
Text Generation
•
8B
•
Updated
•
54.1k
•
109
unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit
Text Generation
•
8B
•
Updated
•
227k
•
92
shuyuej/Llama-Guard-3-8B-GPTQ
Text Generation
•
8B
•
Updated
•
8
•
1
MaziyarPanahi/Qwen2.5-1.5B-Instruct-GGUF
Text Generation
•
2B
•
Updated
•
124k
•
8
Qwen/Qwen2.5-Coder-7B-Instruct-GPTQ-Int4
Text Generation
•
8B
•
Updated
•
943k
•
11
Qwen/Qwen2.5-Coder-7B-Instruct-AWQ
Text Generation
•
8B
•
Updated
•
327k
•
19
unsloth/Qwen2.5-Coder-7B-Instruct-bnb-4bit
Text Generation
•
4B
•
Updated
•
41.6k
•
10
unsloth/Llama-3.2-1B-Instruct-bnb-4bit
Text Generation
•
1B
•
Updated
•
19.8k
•
22
4B
•
Updated
•
38
•
20
Ayush12a/llama3.1_finetuned_on_indian_legal_dataset
Text Generation
•
8B
•
Updated
•
4
Qwen/Qwen2.5-Coder-32B-Instruct-GPTQ-Int4
Text Generation
•
33B
•
Updated
•
6.15k
•
24
Qwen/Qwen2.5-Coder-14B-Instruct-AWQ
Text Generation
•
15B
•
Updated
•
37.1k
•
13
unsloth/llava-1.5-7b-hf-bnb-4bit
Image-Text-to-Text
•
4B
•
Updated
•
175k
•
7
ibnzterrell/Meta-Llama-3.3-70B-Instruct-AWQ-INT4
Text Generation
•
71B
•
Updated
•
132k
•
29
mlx-community/DeepSeek-R1-Distill-Qwen-32B-4bit
Text Generation
•
5B
•
Updated
•
1.41k
•
45
casperhansen/mistral-small-24b-instruct-2501-awq
24B
•
Updated
•
2.98k
•
9
Text Generation
•
12B
•
Updated
•
6
•
2
Qwen/Qwen2.5-VL-72B-Instruct-AWQ
Image-Text-to-Text
•
74B
•
Updated
•
141k
•
72
MaziyarPanahi/gemma-3-4b-it-GGUF
Text Generation
•
4B
•
Updated
•
175k
•
17
unsloth/orpheus-3b-0.1-ft-unsloth-bnb-4bit
Text-to-Speech
•
3B
•
Updated
•
42.8k
•
16
ethicalabs/TowerInstruct-7B-v0.2-mlx-4Bit
Translation
•
1B
•
Updated
•
16
•
2
unsloth/Qwen3-1.7B-unsloth-bnb-4bit
Text Generation
•
2B
•
Updated
•
31k
•
12
unsloth/Qwen3-30B-A3B-bnb-4bit
31B
•
Updated
•
608
•
20
MaziyarPanahi/Qwen3-14B-GGUF
Text Generation
•
15B
•
Updated
•
178k
•
5
Text Generation
•
8B
•
Updated
•
125k
•
33
Qwen/Qwen3-30B-A3B-GPTQ-Int4
Text Generation
•
31B
•
Updated
•
302k
•
44