Training in progress epoch 0

Files changed (6) hide show

README.md ADDED Viewed

+---
+license: apache-2.0
+tags:
+- generated_from_keras_callback
+model-index:
+- name: Rocketknight1/distilgpt2-finetuned-wikitext2
+  results: []
+---
+<!-- This model card has been generated automatically according to the information Keras had access to. You should
+probably proofread and complete it, then remove this comment. -->
+# Rocketknight1/distilgpt2-finetuned-wikitext2
+This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Train Loss: 3.8582
+- Validation Loss: 3.6760
+- Epoch: 0
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- optimizer: {'name': 'AdamWeightDecay', 'learning_rate': 2e-05, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False, 'weight_decay_rate': 0.01}
+- training_precision: float32
+### Training results
+| Train Loss | Validation Loss | Epoch |
+|:----------:|:---------------:|:-----:|
+| 3.8582     | 3.6760          | 0     |
+### Framework versions
+- Transformers 4.16.0.dev0
+- TensorFlow 2.8.0-rc0
+- Datasets 1.17.0
+- Tokenizers 0.11.0

config.json CHANGED Viewed

@@ -39,7 +39,7 @@
       "max_length": 50
     }
   },
-  "transformers_version": "4.13.0.dev0",
   "use_cache": true,
   "vocab_size": 50257
 }

       "max_length": 50
     }
   },
+  "transformers_version": "4.16.0.dev0",
   "use_cache": true,
   "vocab_size": 50257
 }

logs/train/events.out.tfevents.1642694107.matt-TRX40-AORUS-PRO-WIFI.55731.0.v2 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:578a34ede96296aa54f0c79158daf837e174089bbde6268d94596a149df1aecb
+size 1201176

logs/validation/events.out.tfevents.1642694231.matt-TRX40-AORUS-PRO-WIFI.55731.1.v2 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:c119f58d33acc6a2e13e856ff7fa1b3a0ceeb6fb42b5c1b30a84e84a98aae6c4
+size 194

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4629114894f92b0fe9981e5e0a1c167366ecbaafeb2d12d4a38d5800789bae99
-size 327744824

 version https://git-lfs.github.com/spec/v1
+oid sha256:a0b95485418933705fc3d20bd869a10ff413f3113db04ff12ca6afc4f3520131
+size 327745496

tokenizer.json CHANGED Viewed

The diff for this file is too large to render. See raw diff