Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
bdbj
/
Llama-3.1-8b-qpal-df-msq-fig1d
like
0
arxiv:
2509.20214
License:
llama3.1
Model card
Files
Files and versions
xet
Community
Model Card
How to run
References
Model Card
Base model:
meta-llama/Llama-3.1-8B
Quantization method: Latency constrained fusion-aware MSQ with Q-Palette
Backend kernel: Q-Palette kernel
Calibration data: N/A (data-free)
See Figure 1 (d) of our paper.
How to run
Follow the instruction in
https://github.com/snu-mllab/Q-Palette
.
References
Model Paper
Downloads last month
-
Downloads are not tracked for this model.
How to track
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support
Model tree for
bdbj/Llama-3.1-8b-qpal-df-msq-fig1d
Base model
meta-llama/Llama-3.1-8B
Quantized
(
297
)
this model
Collection including
bdbj/Llama-3.1-8b-qpal-df-msq-fig1d
Data-free quantization w/ Q-Palette
Collection
30 items
โข
Updated
26 days ago