dicksonhk/Qwen2.5-VL-7B-Instruct-AWQ-mlx-4Bit
The Model dicksonhk/Qwen2.5-VL-7B-Instruct-AWQ-mlx-4Bit was converted to MLX format from Qwen/Qwen2.5-VL-7B-Instruct-AWQ using mlx-vlm version 0.1.15.
pip install -U mlx-vlm
python -m mlx_vlm.generate --model dicksonhk/Qwen2.5-VL-7B-Instruct-AWQ-mlx-4Bit --max-tokens 100 --temp 0.0 --prompt "Describe this image." --image <path_to_image>
- Downloads last month
- 6
Model size
1B params
Tensor type
F16
·
U32
·
Hardware compatibility
Log In
to add your hardware
4-bit
Model tree for dicksonhk/Qwen2.5-VL-7B-Instruct-AWQ-mlx-4Bit
Base model
Qwen/Qwen2.5-VL-7B-Instruct
Quantized
Qwen/Qwen2.5-VL-7B-Instruct-AWQ