-
-
-
-
-
-
Inference Providers
Active filters: nm-vllm
RedHatAI/TinyLlama-1.1B-Chat-v1.0-pruned2.4
Text Generation
• Updated
• 20
• 1
RedHatAI/MiniChat-2-3B-pruned2.4
Text Generation
• Updated
• 4
RedHatAI/OpenHermes-2.5-Mistral-7B-pruned2.4
Text Generation
• Updated
• 9
RedHatAI/OpenHermes-2.5-Mistral-7B-pruned50
Text Generation
• Updated
• 39
• 1
RedHatAI/Nous-Hermes-2-SOLAR-10.7B-pruned2.4
Text Generation
• Updated
• 1
RedHatAI/Nous-Hermes-2-Yi-34B-pruned2.4
Text Generation
• Updated
• 2
RedHatAI/Nous-Hermes-2-Yi-34B-pruned50
Text Generation
• Updated
• 1
RedHatAI/zephyr-7b-beta-marlin
Text Generation
• 1B • Updated
• 23
RedHatAI/llama2.c-stories110M-pruned2.4
Text Generation
• Updated
• 5
RedHatAI/llama2.c-stories110M-pruned50
Text Generation
• Updated
• 1.05k
Text Generation
• 3B • Updated
• 1
RedHatAI/TinyLlama-1.1B-Chat-v1.0-marlin
Text Generation
• 0.3B • Updated
• 316
• 2
RedHatAI/OpenHermes-2.5-Mistral-7B-marlin
Text Generation
• 1B • Updated
• 90
• 2
RedHatAI/Nous-Hermes-2-Yi-34B-marlin
Text Generation
• 5B • Updated
• 3
• 5
softmax/Llama-2-70b-chat-hf-marlin
Text Generation
• 10B • Updated
• 1
softmax/falcon-180B-chat-marlin
Text Generation
• 26B • Updated
• 3
dtransposed/llama2.c-stories110M-pruned50-compressed-tensors
Text Generation
• Updated
• 3
mradermacher/Nous-Hermes-2-SOLAR-10.7B-pruned2.4-GGUF
11B • Updated
• 95
mradermacher/Nous-Hermes-2-SOLAR-10.7B-pruned2.4-i1-GGUF
11B • Updated
• 258
tensorblock/llama2.c-stories110M-pruned50-GGUF
0.1B • Updated
• 41
mradermacher/phi-2-pruned50-GGUF
3B • Updated
• 169
mradermacher/llama2.c-stories110M-pruned50-GGUF
0.1B • Updated
• 86
mradermacher/OpenHermes-2.5-Mistral-7B-pruned50-GGUF
7B • Updated
• 31
• 1
mradermacher/MiniChat-2-3B-pruned2.4-GGUF
3B • Updated
• 52
mradermacher/OpenHermes-2.5-Mistral-7B-pruned50-i1-GGUF
7B • Updated
• 60
mradermacher/llama2.c-stories110M-pruned50-i1-GGUF
0.1B • Updated
• 84
mradermacher/OpenHermes-2.5-Mistral-7B-pruned2.4-GGUF
7B • Updated
• 46
mradermacher/OpenHermes-2.5-Mistral-7B-pruned2.4-i1-GGUF
7B • Updated
• 93
tensorblock/OpenHermes-2.5-Mistral-7B-pruned2.4-GGUF
tensorblock/OpenHermes-2.5-Mistral-7B-pruned50-GGUF