openbmb
/

MiniCPM-Llama3-V-2_5-int4

Visual Question Answering

feature-extraction

4-bit precision

Model card Files Files and versions

finalf0 commited on May 20, 2024

Commit

91abf6f

·

verified ·

1 Parent(s): d6b3c68

Update README.md

Files changed (1) hide show

README.md +4 -2

README.md CHANGED Viewed

@@ -2,8 +2,10 @@
 pipeline_tag: visual-question-answering
 ---
-## MiniCPM-Llama3-V 2.5
-More detail about [MiniCPM-Llama3-V 2.5](https://huggingface.co/openbmb/MiniCPM-Llama3-V-2_5).
 ## Usage
 Inference using Huggingface transformers on NVIDIA GPUs. Requirements tested on python 3.10：

 pipeline_tag: visual-question-answering
 ---
+## MiniCPM-Llama3-V 2.5 int4
+This is the int4 quantized version of [MiniCPM-Llama3-V 2.5](https://huggingface.co/openbmb/MiniCPM-Llama3-V-2_5).
+Running with int4 version would use lower GPU mermory (about 9GB).
 ## Usage
 Inference using Huggingface transformers on NVIDIA GPUs. Requirements tested on python 3.10：