Is it possible to make smaller NVFP4 quant at 340-360GB to fit in 4x96gb?
👍 1
68
#1 opened 3 months ago
by
Fernanda24
How did you bypass deepseek-v32 not recognized in Tranformers?
3
#3 opened 3 months ago
by
Fernanda24
Wondering how could you tested this with 2xRTX 6000 pro
6
#2 opened 3 months ago
by
csabakecskemeti