Inference Providers
Active filters: fp4
nvidia/Qwen3.5-397B-A17B-NVFP4
Text Generation
• Updated • 235k
• 85
Text Generation
• 435B • Updated • 25.3k
• 18
Text Generation
• 183B • Updated • 2.57k
• 12
tonera/FLUX.2-klein-9B-Nunchaku
Image-to-Image
• Updated • 171
• 5
chankhavu/Nemotron-Cascade-2-30B-A3B-NVFP4
Text Generation
• 16B • Updated • 9.65k
• 8
nvidia/Phi-4-multimodal-instruct-NVFP4
4B • Updated • 1.65k
• 10
txn545/Qwen3.5-122B-A10B-NVFP4
Text Generation
• 64B • Updated • 266k
• 23
nvidia/Qwen3-30B-A3B-NVFP4
Text Generation
• 16B • Updated • 149k
• 27
nvidia/Phi-4-reasoning-plus-NVFP4
8B • Updated • 1.08k
• 8
nvidia/Qwen3-Coder-480B-A35B-Instruct-NVFP4
Text Generation
• 241B • Updated • 1.99k
• 10
Image-Text-to-Text
• 8B • Updated • 3.18k
• 4
nvidia/Qwen3-235B-A22B-NVFP4
Text Generation
• 133B • Updated • 5.12k
• 15
nvidia/Llama-3.1-8B-Instruct-NVFP4
5B • Updated • 100k
• 8
Text Generation
• 5B • Updated • 32.1k
• 16
Text Generation
• 8B • Updated • 368k
• 6
Text Generation
• 17B • Updated • 84.8k
• 14
nvidia/Qwen2.5-VL-7B-Instruct-NVFP4
Text Generation
• 5B • Updated • 23.9k
• 14
nvidia/Kimi-K2-Thinking-NVFP4
Text Generation
• Updated • 5.13k
• 30
cybermotaz/Qwen3-Omni-30B-A3B-Instruct-NVFP4
Text Generation
• Updated • 6
ussoewwin/HSWQ-Z-Image-fp8e4m3
Text-to-Image
• Updated • 2
tacos4me/Step-3.5-Flash-NVFP4
Text Generation
• 111B • Updated • 797
• 10
apolo13x/Qwen3.5-27B-NVFP4
Image-Text-to-Text
• 17B • Updated • 120k
• 29
lukealonso/Qwen3.5-397B-A17B-NVFP4
Text Generation
• Updated • 13.8k
• 6
tonera/Nepotism_xii-Nunchaku
Text-to-Image
• Updated • 42
• 1
mengqin1/RedidreamNSFWI1-bnb-4bit
Updated
Text Generation
• 19B • Updated • 1
• 3
qingcheng-ai/Qwen3-32B-fp4
Text Generation
• 19B • Updated • 87
• 4
qingcheng-ai/Qwen3-8B-fp4
Text Generation
• 5B • Updated • 10
• 1
RedHatAI/Qwen3-30B-A3B-NVFP4
Text Generation
• 17B • Updated • 26k
• 2
RedHatAI/Llama-3.1-70B-Instruct-NVFP4
Text Generation
• 41B • Updated • 1.9k