-
-
-
-
-
-
Inference Providers
Active filters:
8-bit
Text Generation
•
22B
•
Updated
•
6.66M
•
•
4.22k
Text Generation
•
120B
•
Updated
•
3.08M
•
•
4.36k
GadflyII/GLM-4.7-Flash-NVFP4
Text Generation
•
18B
•
Updated
•
2.27k
•
17
microsoft/bitnet-b1.58-2B-4T
Text Generation
•
0.8B
•
Updated
•
5.22k
•
1.25k
mlx-community/GLM-4.7-Flash-8bit
Text Generation
•
30B
•
Updated
•
1.05k
•
11
AlicanKiraz0/Mihenk-LLM-14B-Turkish-Financial-Model-mlx-8Bit
15B
•
Updated
•
23
•
5
MultiverseComputingCAI/HyperNova-60B
Text Generation
•
60B
•
Updated
•
1.14k
•
43
nvidia/Qwen2.5-VL-7B-Instruct-NVFP4
Text Generation
•
5B
•
Updated
•
2.73k
•
11
Octen/Octen-Embedding-8B-INT8
Sentence Similarity
•
8B
•
Updated
•
27
•
3
mlx-community/translategemma-4b-it-8bit
Text Generation
•
1B
•
Updated
•
655
•
3
mlx-community/translategemma-27b-it-8bit
Text Generation
•
27B
•
Updated
•
655
•
3
MaziyarPanahi/Mistral-7B-Instruct-Aya-101-GGUF
Text Generation
•
7B
•
Updated
•
158
•
12
HF1BitLLM/Llama3-8B-1.58-100B-tokens
Text Generation
•
3B
•
Updated
•
1.13k
•
200
drwlf/medgemma-4b-it-abliterated
Text Generation
•
Updated
•
11
•
5
nvidia/Qwen3-235B-A22B-NVFP4
Text Generation
•
133B
•
Updated
•
4.41k
•
11
nvidia/Qwen3-30B-A3B-NVFP4
Text Generation
•
16B
•
Updated
•
28.9k
•
20
huihui-ai/Huihui-gpt-oss-20b-mxfp4-abliterated-v2
Text Generation
•
21B
•
Updated
•
2.13k
•
15
FabioSarracino/VibeVoice-Large-Q8
Text-to-Audio
•
9B
•
Updated
•
2.58k
•
76
Tengyunw/MiniMax-M2.1-NVFP4
Text Generation
•
115B
•
Updated
•
96
•
5
mlx-community/translategemma-12b-it-8bit
Text Generation
•
12B
•
Updated
•
618
•
2
LiquidAI/LFM2.5-1.2B-Thinking-MLX-8bit
Text Generation
•
0.3B
•
Updated
•
2
MaziyarPanahi/BioMistral-7B-GGUF
Text Generation
•
7B
•
Updated
•
1.12k
•
56
MaziyarPanahi/Saul-Instruct-v1-GGUF
Text Generation
•
7B
•
Updated
•
217
•
9
ragraph-ai/stable-cypher-instruct-3b
Text Generation
•
3B
•
Updated
•
343
•
29
Qwen/Qwen2-VL-7B-Instruct-GPTQ-Int8
Image-Text-to-Text
•
8B
•
Updated
•
12.5k
•
31
Qwen/Qwen2.5-7B-Instruct-GPTQ-Int8
Text Generation
•
8B
•
Updated
•
4.19k
•
18
Qwen/Qwen2.5-72B-Instruct-GPTQ-Int8
Text Generation
•
73B
•
Updated
•
21.3k
•
28
MaziyarPanahi/Qwen2.5-1.5B-Instruct-GGUF
Text Generation
•
2B
•
Updated
•
124k
•
8
brunopio/Llama3-8B-1.58-100B-tokens-GGUF
Text Generation
•
3B
•
Updated
•
668
•
19
mlx-community/Qwen2.5-Coder-32B-Instruct-8bit
Text Generation
•
Updated
•
221
•
13