vignesh2404/shawgpt-ft

Browse files

Files changed (3) hide show

README.md +72 -3
runs/Dec02_12-32-13_f1e0413fdd13/events.out.tfevents.1733142737.f1e0413fdd13.879.0 +3 -0
training_args.bin +3 -0

README.md CHANGED Viewed

@@ -1,3 +1,72 @@
----
-license: apache-2.0
----

+---
+library_name: peft
+license: mit
+base_model: openai-community/gpt2
+tags:
+- generated_from_trainer
+model-index:
+- name: shawgpt-ft
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# shawgpt-ft
+This model is a fine-tuned version of [openai-community/gpt2](https://huggingface.co/openai-community/gpt2) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 4.3263
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 0.0002
+- train_batch_size: 4
+- eval_batch_size: 4
+- seed: 42
+- gradient_accumulation_steps: 4
+- total_train_batch_size: 16
+- optimizer: Use paged_adamw_8bit with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
+- lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 2
+- num_epochs: 10
+- mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch  | Step | Validation Loss |
+|:-------------:|:------:|:----:|:---------------:|
+| 5.1488        | 0.9231 | 3    | 4.3509          |
+| 5.1158        | 1.8462 | 6    | 4.3486          |
+| 5.1306        | 2.7692 | 9    | 4.3430          |
+| 3.8451        | 4.0    | 13   | 4.3394          |
+| 5.143         | 4.9231 | 16   | 4.3357          |
+| 5.1448        | 5.8462 | 19   | 4.3339          |
+| 5.1081        | 6.7692 | 22   | 4.3313          |
+| 3.8084        | 8.0    | 26   | 4.3275          |
+| 5.0699        | 8.9231 | 29   | 4.3276          |
+| 3.4863        | 9.2308 | 30   | 4.3263          |
+### Framework versions
+- PEFT 0.13.2
+- Transformers 4.46.2
+- Pytorch 2.5.1+cu121
+- Datasets 3.1.0
+- Tokenizers 0.20.3

runs/Dec02_12-32-13_f1e0413fdd13/events.out.tfevents.1733142737.f1e0413fdd13.879.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:19ce41fddf093caecf29ba2f4d01c14f5a71c0a6c0836b9aa59637c5f52fd614
+size 10378

training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:74377ff53feb22c465fb13cb90d81868d24266d3498bfe4e3aa29f45c89feec7
+size 5240