dicksonhk/Qwen2.5-VL-7B-Instruct-AWQ-mlx-4Bit

The Model dicksonhk/Qwen2.5-VL-7B-Instruct-AWQ-mlx-4Bit was converted to MLX format from Qwen/Qwen2.5-VL-7B-Instruct-AWQ using mlx-vlm version 0.1.15.

pip install -U mlx-vlm

python -m mlx_vlm.generate --model dicksonhk/Qwen2.5-VL-7B-Instruct-AWQ-mlx-4Bit --max-tokens 100 --temp 0.0 --prompt "Describe this image." --image <path_to_image>

Downloads last month: 6

Safetensors

Model size

1B params

Tensor type

F16

U32

MLX

Hardware compatibility

4-bit

Model tree for dicksonhk/Qwen2.5-VL-7B-Instruct-AWQ-mlx-4Bit

Base model

Qwen/Qwen2.5-VL-7B-Instruct

Quantized

Qwen/Qwen2.5-VL-7B-Instruct-AWQ

Quantized

(1)

this model