huawei-csl/Qwen3-Next-80B-A3B-Instruct-3bit-SINQ
Text Generation
•
Updated
•
11
None defined yet.
SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights
Top-Theta Attention: Sparsifying Transformers by Compensated Thresholding