fix typo
Browse files
README.md
CHANGED
|
@@ -93,7 +93,7 @@ However, we do not recommend using them for tasks that are knowledge-intensive o
|
|
| 93 |
| --------------------- | ----------------------------- |
|
| 94 |
| **Total parameters** | 8.3B |
|
| 95 |
| **Active parameters** | 1.5B |
|
| 96 |
-
| **Layers** | 24 (
|
| 97 |
| **Context length** | 32,768 tokens |
|
| 98 |
| **Vocabulary size** | 65,536 |
|
| 99 |
| **Training precision**| Mixed BF16/FP8 |
|
|
|
|
| 93 |
| --------------------- | ----------------------------- |
|
| 94 |
| **Total parameters** | 8.3B |
|
| 95 |
| **Active parameters** | 1.5B |
|
| 96 |
+
| **Layers** | 24 (18 conv + 6 attn) |
|
| 97 |
| **Context length** | 32,768 tokens |
|
| 98 |
| **Vocabulary size** | 65,536 |
|
| 99 |
| **Training precision**| Mixed BF16/FP8 |
|