Pinkstack
/

DistilGPT-OSS-qwen3-4B

Text Generation

text-generation-inference

Model card Files Files and versions

Pinkstack commited on Sep 20

Commit

39010c8

·

verified ·

1 Parent(s): dbe8be3

Update README.md

Files changed (1) hide show

README.md +29 -1

README.md CHANGED Viewed

@@ -21,6 +21,34 @@ DistilGPT-OSS-qwen3-4B is a Qwen3 4B-2507 thinking fine tune, it supports up to
 Keep in mind, this is a community project and we are NOT related to qwen by Alibaba nor GPT-OSS by OpenAi.
 # Format
 This is the chat format of this model (you can also check the Jinja template file in "Files and versions"):
 ```
@@ -59,4 +87,4 @@ Keep in mind, these tests were done in LM Studio, GGUF q8_0 on a single consumer
 # Additional information
-The model was trained using unsloth, using a mix of private datasets and public datasets.

 Keep in mind, this is a community project and we are NOT related to qwen by Alibaba nor GPT-OSS by OpenAi.
+# Use cases & benefits
+Benefits of using this model over standard qwen3 4b thinking:
+- You decide how much it would think (low, medium, high)
+- completely different style of answers (more similar to ChatGPT)
+- Produces less emoji (qwen3 4b uses quite a lot in its responses which some may not like)
+- Less censored/limiting than qwen3 4b
+DistilGPT-OSS-qwen3-4B should be used for the following:
+- Local on device efficient assistance
+- Code generation
+- Summary generation
+- General use
+Or anything else
+❌⚠️ It should ABSOLUTELY **not** be used for:
+- High-risk workspaces
+- Medical questions
+- Anything high risk which requires 1:1 accuracy.
+It is a small model thus general knowledge is limited to its size.
 # Format
 This is the chat format of this model (you can also check the Jinja template file in "Files and versions"):
 ```
 # Additional information
+The model was trained using unsloth, using a mix of private datasets and public datasets.