-
-
-
-
-
-
Inference Providers
Active filters: FP8
duydq12/GLM-Z1-32B-0414-FP8-dynamic
Text Generation
• 33B • Updated
• 3
duydq12/nomic-embed-code-FP8-dynamic
Text Generation
• 8B • Updated
• 372
• 1
duydq12/Qwen2.5-Coder-1.5B-Instruct-FP8-dynamic
Text Generation
• 2B • Updated
• 2
duydq12/Qwen2.5-Coder-3B-Instruct-FP8-dynamic
Text Generation
• 3B • Updated
• 2
nvidia/Qwen3-235B-A22B-FP8
Text Generation
• 235B • Updated
• 631
• 3
Image-Text-to-Text
• 109B • Updated
• 1
EliovpAI/Qwen3-14B-FP8-KV
Text Generation
• 15B • Updated
• 4
• 2
clarifai/Qwen3-Coder-30B-A3B-Instruct-FP8-Dynamic
Text Generation
• 31B • Updated
• 16
• 4
EliovpAI/Qwen3-0.6B-FP8-KV
Text Generation
• 0.6B • Updated
• 3
RedHatAI/Devstral-Small-2507-FP8-Dynamic
Text Generation
• 24B • Updated
• 21
• 4
nvidia/Phi-4-multimodal-instruct-FP8
6B • Updated
• 15.2k
• 4
nvidia/Phi-4-reasoning-plus-FP8
15B • Updated
• 491
• 3
EliovpAI/Deepseek-R1-0528-Qwen3-8B-FP8-KV
Text Generation
• 8B • Updated
• 2
Text Generation
• 8B • Updated
• 5.34k
• 3
nvidia/Qwen2.5-VL-7B-Instruct-FP8
Text Generation
• 8B • Updated
• 332
• 7
QuantTrio/Qwen3-VL-235B-A22B-Instruct-FP8
Text Generation
• Updated
• 105
QuantTrio/Qwen3-VL-235B-A22B-Thinking-FP8
Text Generation
• 236B • Updated
• 29
nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-FP8
Image-Text-to-Text
• 13B • Updated
• 11.4k
• 47
tokenlabsdotrun/Llama-3.1-8B-ModelOpt-FP8-QAT