Mukul's picture

Mukul

mtcl

·

mtcl
mtcl

AI & ML interests

None yet

Recent Activity

new activity 2 days ago

nvidia/nemotron-3.5-asr-streaming-0.6b:vllm support ?

new activity 4 days ago

DevQuasar/MiniMaxAI.MiniMax-M3-GGUF:thank you for ggufs

new activity 5 days ago

unsloth/MiniMax-M3-GGUF:tool call parser and reasoning parser

View all activity

Organizations

None yet

New activity in nvidia/nemotron-3.5-asr-streaming-0.6b 2 days ago

vllm support ?

#6 opened 12 days ago by

New activity in DevQuasar/MiniMaxAI.MiniMax-M3-GGUF 4 days ago

thank you for ggufs

#1 opened 4 days ago by

New activity in unsloth/MiniMax-M3-GGUF 5 days ago

tool call parser and reasoning parser

#4 opened 5 days ago by

New activity in cyankiwi/gemma-4-12B-it-qat-AWQ-INT4 10 days ago

Vllm and SgLang command please

#1 opened 10 days ago by

New activity in nvidia/DeepSeek-V4-Pro-NVFP4 13 days ago

nvidia/DeepSeek-V4-flash-NVFP4

#1 opened 21 days ago by

New activity in canada-quant/DeepSeek-V4-Flash-NVFP4-FP8-MTP 20 days ago

Docker Image

#1 opened 21 days ago by

New activity in unsloth/DeepSeek-V4-Flash 21 days ago

Worse than (smaller) MiniMax M2.7??

#2 opened about 2 months ago by deleted

New activity in deepseek-ai/DeepSeek-V4-Flash about 1 month ago

Unable to run on 2x RTX Pro 6000 (DEEP_GEMM problem)

#15 opened about 2 months ago by

New activity in mistralai/Mistral-Medium-3.5-128B about 1 month ago

Running on 2 RTX Pro 6000 Blackwell GPUs at ~30 tps (Instructions that worked for me)

#17 opened about 2 months ago by

New activity in RedHatAI/DeepSeek-V4-Flash-NVFP4-FP8 about 1 month ago

2x Nvidia 6000 Pros

#2 opened about 2 months ago by

New activity in lukealonso/MiMo-V2.5-NVFP4 about 2 months ago

Will it work on 2X6000 Pros

#1 opened about 2 months ago by

New activity in Intel/DeepSeek-V4-Flash-W4A16-AutoRound about 2 months ago

Can I deploy it with sglang at my 8*4090 ubuntu sever?

#1 opened about 2 months ago by

New activity in nvidia/MiniMax-M2.7-NVFP4 about 2 months ago

Context Length for 2X6000 Pros (2x96 = 192GB VRAM)

#2 opened about 2 months ago by

New activity in ubergarm/Kimi-K2.6-GGUF about 2 months ago

really awesome speeds! running at 256k context.

#11 opened about 2 months ago by

New activity in Qwen/Qwen3.6-27B about 2 months ago

MOE 122b and 397b please!

#7 opened about 2 months ago by

New activity in ubergarm/Kimi-K2.6-GGUF about 2 months ago

How to disable thinking?

#9 opened about 2 months ago by

New activity in demon-zombie/MiniMax-M2.7-AWQ-4bit about 2 months ago

These are NOT actual AWQ-quantized models.

#1 opened 2 months ago by

New activity in NinjaBoffin/MiniMax-M2.7-NVFP4 about 2 months ago

max context

#2 opened about 2 months ago by

New activity in ubergarm/Kimi-K2.6-GGUF about 2 months ago

No think tags.

#4 opened about 2 months ago by

New activity in nvidia/MiniMax-M2.5-NVFP4 about 2 months ago

Minimax M2.7 NVFP4

#4 opened 2 months ago by