-
-
-
-
-
-
Inference Providers
Active filters: vLLM
Image-Text-to-Text
• 17B • Updated
• 295
• 19
QuantTrio/Seed-OSS-36B-Instruct-AWQ
Text Generation
• 36B • Updated
• 254
• 8
QuantTrio/Seed-OSS-36B-Instruct-GPTQ-Int8
Text Generation
• 36B • Updated
• 136
• 4
QuantTrio/Seed-OSS-36B-Instruct-GPTQ-Int4
Text Generation
• 36B • Updated
• 38
• 5
QuantTrio/Seed-OSS-36B-Instruct-GPTQ-Int3
Text Generation
• 34B • Updated
• 4
• 3
amakhov/tiny-random-llama
Text Generation
• 4.18M • Updated
• 7
Text Generation
• 41B • Updated
• 2
• 2
QuantTrio/DeepSeek-V3.1-AWQ
Text Generation
• 485B • Updated
• 856
• 5
QuantTrio/DeepSeek-V3.1-AWQ-Fp16Mix
Text Generation
• 286B • Updated
• 7
• 1
QuantTrio/DeepSeek-V3.1-AWQ-Lite
Text Generation
• 684B • Updated
• 170
• 3
JunHowie/Qwen3-4B-Instruct-2507-GPTQ-Int4
Text Generation
• 4B • Updated
• 75.4k
• 2
JunHowie/Qwen3-4B-Instruct-2507-GPTQ-Int8
Text Generation
• 4B • Updated
• 12
JunHowie/Qwen3-4B-Thinking-2507-GPTQ-Int4
Text Generation
• 4B • Updated
• 1.28k
• 1
JunHowie/Qwen3-4B-Thinking-2507-GPTQ-Int8
Text Generation
• 4B • Updated
• 147
• 2
JunHowie/Qwen3-30B-A3B-Instruct-2507-GPTQ-Int4
Text Generation
• 31B • Updated
• 1.88k
JunHowie/Qwen3-30B-A3B-Instruct-2507-GPTQ-Int8
Text Generation
• 31B • Updated
• 1
JunHowie/Qwen3-30B-A3B-Thinking-2507-GPTQ-Int4
Text Generation
• 31B • Updated
• 93
JunHowie/Qwen2-7B-Instruct-GPTQ-Int4
Text Generation
• 8B • Updated
• 1
JunHowie/Qwen2-7B-Instruct-GPTQ-Int8
Text Generation
• 8B • Updated
• 2
EliovpAI/Deepseek-R1-0528-Qwen3-8B-FP8-KV
Text Generation
• 8B • Updated
JunHowie/Qwen3-30B-A3B-Thinking-2507-GPTQ-Int8
Text Generation
• 31B • Updated
JunHowie/Seed-OSS-36B-Instruct-GPTQ-Int4
Text Generation
• 36B • Updated
• 1
JunHowie/Seed-OSS-36B-Instruct-GPTQ-Int8
Text Generation
• 36B • Updated
• 1
QuantTrio/Qwen3-VL-235B-A22B-Instruct-AWQ
Text Generation
• 236B • Updated
• 2.48k
• 13
QuantTrio/Qwen3-VL-235B-A22B-Instruct-FP8
Text Generation
• Updated
• 31
QuantTrio/Qwen3-VL-235B-A22B-Thinking-AWQ
Text Generation
• 236B • Updated
• 662
• 8
QuantTrio/Qwen3-VL-235B-A22B-Thinking-FP8
Text Generation
• 236B • Updated
• 29
QuantTrio/DeepSeek-V3.2-Exp-AWQ
Text Generation
• 486B • Updated
• 57
• 4
QuantTrio/DeepSeek-V3.2-Exp-AWQ-Lite
Text Generation
• 685B • Updated
• 50
• 4
Text Generation
• 50B • Updated
• 140
• 5