Inference Providers
Active filters: fp4
RedHatAI/Mistral-Small-3.2-24B-Instruct-2506-NVFP4
Text Generation
• 14B • Updated • 7.71k
• 7
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-NVFP4
Text Generation
• 229B • Updated • 810
• 5
RedHatAI/Qwen3-235B-A22B-NVFP4
Text Generation
• 136B • Updated • 40
• 1
RedHatAI/Qwen3-235B-A22B-Instruct-2507-NVFP4
Text Generation
• 136B • Updated • 594
• 4
prithivMLmods/Nanonets-OCR2-3B-AWQ-nvfp4
Image-Text-to-Text
• 3B • Updated • 26
eousphoros/DeepSeek-V3.2-NVFP4
Text Generation
• 387B • Updated • 21
• 5
trithemius/Velvet-14B-nvfp4
8B • Updated Text Generation
• 199B • Updated • 170
josephdowling10/Mixtral-8x7B-Instruct-v0.1-NVFP4
Text Generation
• 23B • Updated • 26
Shifusen/L3.3-70B-Magnum-v4-SE-NVFP4
Text Generation
• 41B • Updated • 9
Firworks/Snowpiercer-15B-v4-nvfp4
9B • Updated • 1
cybermotaz/nemotron3-nano-nvfp4-w4a16
Text Generation
• 18B • Updated • 10.6k
• 14
Shifusen/Strawberrylemonade-L3-70B-v1.2-NVFP4
Text Generation
• 41B • Updated • 5
cybermotaz/qwen3-vl-2b-thinking-nvfp4-w4a16
Image-Text-to-Text
• 2B • Updated • 5
• 1
cybermotaz/qwen3-vl-4b-thinking-nvfp4-w4a16
Image-Text-to-Text
• 3B • Updated • 8
• 1
cybermotaz/qwen3-vl-8b-thinking-nvfp4-w4a16
Image-Text-to-Text
• 5B • Updated • 108
• 2
Shifusen/72B-Qwen2.5-Kunou-v1-NVFP4
Text Generation
• 42B • Updated • 15
Shifusen/L3.3-The-Omega-Directive-70B-Unslop-v2.1-NVFP4
Text Generation
• 41B • Updated • 1
Shifusen/Forgotten-Safeword-70B-v5.0-NVFP4
Text Generation
• 41B • Updated • 7
• 1
Shifusen/Draconic-Tease-70B-NVFP4
41B • Updated cybermotaz/Qwen3-VL-32B-Instruct-NVFP4
Image-Text-to-Text
• 18B • Updated • 46
Shifusen/dolphin-2.9.1-llama-3-70b-NVFP4-vllm
41B • Updated • 1
nvidia/Qwen3-VL-235B-A22B-Instruct-NVFP4
119B • Updated • 824
Shifusen/Negative_LLAMA_70B-NVFP4
Text Generation
• 41B • Updated • 1
• 1
Shifusen/L3.3-70B-PippaMaid-1.0-NVFP4
Text Generation
• 41B • Updated • 9
ussoewwin/Hybrid-Sensitivity-Weighted-Quantization-SDXL-fp8e4m3
Text-to-Image
• Updated • 6
GadflyII/Qwen3-VL-235B-A22B-Instruct-NVFP4
Image-Text-to-Text
• 133B • Updated • 10
GadflyII/Qwen3-VL-235B-A22B-Thinking-NVFP4
Image-Text-to-Text
• 133B • Updated • 256
Firworks/Llama-3.3-8B-Instruct-nvfp4
5B • Updated • 11
Firworks/Cydonia-24B-v4.3-heretic-nvfp4
14B • Updated • 3