nvidia
/

Qwen2.5-VL-7B-Instruct-NVFP4

Text Generation

Model Optimizer

8-bit precision

Model card Files Files and versions

Qwen2.5-VL-7B-Instruct-NVFP4 / hf_quant_config.json

zhiyucheng's picture

Use actual module path in ignore (#2)

d13bb1f verified about 17 hours ago

history blame contribute delete

318 Bytes

	{
	"producer": {
	"name": "modelopt",
	"version": "0.37.0.dev16+ga6fa34cda.d20250909"
	},
	"quantization": {
	"quant_algo": "NVFP4",
	"kv_cache_quant_algo": null,
	"group_size": 16,
	"exclude_modules": [
	"visual*",
	"lm_head"
	]
	}
	}