-
-
-
-
-
-
Inference Providers
Active filters:
4bit
bullpoint/Qwen3-Coder-Next-AWQ-4bit
Text Generation
•
14B
•
Updated
•
426k
•
8
mlx-community/Qwen3.5-397B-A17B-nvfp4
Text Generation
•
396B
•
Updated
•
3.35k
•
4
legraphista/DeepSeek-Coder-V2-Lite-Instruct-IMat-GGUF
Text Generation
•
16B
•
Updated
•
624
•
9
ModelCloud/DeepSeek-R1-Distill-Qwen-7B-gptqmodel-4bit-vortex-v1
Text Generation
•
8B
•
Updated
•
6
•
6
ModelCloud/DeepSeek-R1-Distill-Qwen-7B-gptqmodel-4bit-vortex-v2
Text Generation
•
8B
•
Updated
•
418
•
8
ModelCloud/QwQ-32B-gptqmodel-4bit-vortex-v1
Text Generation
•
33B
•
Updated
•
6
•
12
TheCluster/amoral-gemma-3-12B-v2-mlx-4bit
Image-Text-to-Text
•
Updated
•
53
•
2
ModelCloud/Marin-32B-Base-GPTQMODEL-AWQ-W4A16
Text Generation
•
33B
•
Updated
•
5
•
2
ModelCloud/Granite-4.0-H-1B-GPTQMODEL-W4A16
Text Generation
•
1B
•
Updated
•
3
•
1
ModelCloud/Granite-4.0-H-350M-GPTQMODEL-W4A16
Text Generation
•
0.3B
•
Updated
•
22
•
1
ModelCloud/Brumby-14B-Base-GPTQMODEL-W4A16
Text Generation
•
15B
•
Updated
•
3
•
1
ModelCloud/Brumby-14B-Base-GPTQMODEL-W4A16-v2
Text Generation
•
15B
•
Updated
•
3
•
1
marksverdhai/vibevoice-7b-bnb-4bit
Text-to-Speech
•
10B
•
Updated
•
296
•
5
0xSero/GLM-4.7-REAP-218B-A32B-W4A16
Text Generation
•
2B
•
Updated
•
355
•
19
JEILDLWLRMA/Qwen3-VL-4B-Instruct-NVFP4
Image-to-Text
•
3B
•
Updated
•
39
•
2
sugam24/dots-ocr-awq-4bit
Image-to-Text
•
0.8B
•
Updated
•
177
•
1
manu02/Octen-Embedding-8B-bnb-4bit-nf4-dq
Text Generation
•
8B
•
Updated
•
12
•
2
andrevp/Nanbeige4.1-3B-MLX-4bit
Text Generation
•
0.6B
•
Updated
•
88
•
1
mayaeary/pygmalion-6b-4bit-128g
Text Generation
•
Updated
•
7
•
40
mayaeary/pygmalion-6b_dev-4bit-128g
Text Generation
•
Updated
•
16
•
121
mayaeary/PPO_Pygway-V8p4_Dev-6b-4bit-128g
Text Generation
•
Updated
•
2
•
2
mayaeary/PPO_Pygway-6b-Mix-4bit-128g
Text Generation
•
Updated
•
2
•
2
Ancestral/Dolly_Shygmalion-6b-4bit-128g
Text Generation
•
Updated
•
19
•
5
Ancestral/PPO_Shygmalion-6b-4bit-128g
Text Generation
•
Updated
•
3
Ancestral/Dolly_Malion-6b-4bit-128g
Text Generation
•
Updated
•
3
•
1
4bit/pygmalion-6b-4bit-128g
Text Generation
•
Updated
•
1
•
3
Text Generation
•
Updated
•
6
•
1
seonglae/opt-125m-4bit-gptq
Text Generation
•
Updated
•
3
seonglae/wizardlm-7b-uncensored-gptq
Text Generation
•
Updated
•
1
seonglae/llama-2-7b-chat-hf-gptq
Text Generation
•
Updated
•
3