Inference Providers
Active filters: vLLM
mistralai/Mistral-Small-4-119B-2603
119B • Updated • 51.3k
• 384
mistralai/Mistral-Medium-3.5-128B
128B • Updated • 384k
• 347
mistralai/Mistral-Small-4-119B-2603-NVFP4
Updated • 1.26k
• 93
mistralai/Mistral-Medium-3.5-128B-EAGLE
Updated • 409
• 41
QuantTrio/Qwen3-30B-A3B-Thinking-2507-AWQ-BF16Mix
Text Generation
• 31B • Updated • 1.56k
• 5
QuantTrio/Qwen3-VL-30B-A3B-Instruct-AWQ
Text Generation
• 31B • Updated • 1.29M
• 43
QuantTrio/Qwen3.6-27B-AWQ-6Bit
Image-Text-to-Text
• 28B • Updated • 30.2k
• 10
bartowski/mistralai_Mistral-Medium-3.5-128B-GGUF
Image-Text-to-Text
• 125B • Updated • 7.72k
• 8
RohitUltimate/Qwen3.5-2B_20K
Image-Text-to-Text
• 2B • Updated • 11
• 1
model-scope/glm-4-9b-chat-GPTQ-Int4
Text Generation
• 9B • Updated • 69
• 6
model-scope/glm-4-9b-chat-GPTQ-Int8
Text Generation
• 9B • Updated • 8
• 2
tclf90/qwen2.5-72b-instruct-gptq-int4
Text Generation
• 73B • Updated • 67
• 2
tclf90/qwen2.5-72b-instruct-gptq-int3
Text Generation
• 69B • Updated • 61
prithivMLmods/Nu2-Lupi-Qwen-14B
Text Generation
• 15B • Updated • 6
• 2
mradermacher/Nu2-Lupi-Qwen-14B-GGUF
15B • Updated • 171
• 1
mradermacher/Nu2-Lupi-Qwen-14B-i1-GGUF
15B • Updated • 452
• 1
JunHowie/Qwen3-0.6B-GPTQ-Int4
Text Generation
• 0.6B • Updated • 217
• 1
JunHowie/Qwen3-0.6B-GPTQ-Int8
Text Generation
• 0.6B • Updated • 10
JunHowie/Qwen3-1.7B-GPTQ-Int4
Text Generation
• 2B • Updated • 2.67k
• 1
JunHowie/Qwen3-1.7B-GPTQ-Int8
Text Generation
• 2B • Updated • 21
JunHowie/Qwen3-32B-GPTQ-Int4
Text Generation
• 33B • Updated • 26.2k
• 4
JunHowie/Qwen3-32B-GPTQ-Int8
Text Generation
• 33B • Updated • 413
• 4
JunHowie/Qwen3-30B-A3B-GPTQ-Int4
Text Generation
• 5B • Updated • 21
• 1
JunHowie/Qwen3-14B-GPTQ-Int8
Text Generation
• 15B • Updated • 73
• 1
JunHowie/Qwen3-14B-GPTQ-Int4
Text Generation
• 15B • Updated • 45.8k
• 4
JunHowie/Qwen3-8B-GPTQ-Int8
Text Generation
• 8B • Updated • 705
JunHowie/Qwen3-8B-GPTQ-Int4
Text Generation
• 8B • Updated • 313
• 4
JunHowie/Qwen3-4B-GPTQ-Int4
Text Generation
• 4B • Updated • 2.13k
• 1
JunHowie/Qwen3-4B-GPTQ-Int8
Text Generation
• 4B • Updated • 29
JunHowie/Qwen3-30B-A3B-GPTQ-Int8
Text Generation
• 8B • Updated • 100