Can we perform 4-bit quantization for the awq of the Step-3.5-Flash model? The VLLM can run it.

#3
by lsm03624 - opened

As the title says

cyankiwi org

Thank you for your interest. Step-3.5-Flash is in calibration, and its AWQ version should be available soon!

Sign up or log in to comment