doberst commited on
Commit
9825d57
·
verified ·
1 Parent(s): da50bba

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +34 -3
README.md CHANGED
@@ -1,3 +1,34 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ inference: false
4
+ base_model: Qwen/Qwen3-8b-ov
5
+ base_model_relation: quantized
6
+ tags: [green, llmware-chat, p8, ov, emerald]
7
+ ---
8
+
9
+ # qwen3-8b-ov
10
+
11
+ **qwen3-8b-ov** is an OpenVino int4 quantized version of [Qwen3-8B](https://www.huggingface.co/Qwen/Qwen3-8B), providing a very fast, very small inference implementation, optimized for AI PCs using Intel GPU, CPU and NPU.
12
+
13
+ This is from the latest release series from Qwen.
14
+
15
+ ### Model Description
16
+
17
+ - **Developed by:** Qwen
18
+ - **Quantized by:** llmware
19
+ - **Model type:** qwen3
20
+ - **Parameters:** 8 billion
21
+ - **Model Parent:** Qwen/Qwen3-8B
22
+ - **Language(s) (NLP):** English
23
+ - **License:** Apache 2.0
24
+ - **Uses:** Chat, general-purpose LLM
25
+ - **Quantization:** int4
26
+
27
+
28
+ ## Model Card Contact
29
+
30
+ [llmware on github](https://www.github.com/llmware-ai/llmware)
31
+
32
+ [llmware on hf](https://www.huggingface.co/llmware)
33
+
34
+ [llmware website](https://www.llmware.ai)