rishavranaut
/

flanT5_Task2

@@ -1,5 +1,6 @@
 ---
 library_name: transformers
 base_model: google/flan-t5-large
 tags:
 - generated_from_trainer
@@ -19,11 +20,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/flan-t5-large](https://huggingface.co/google/flan-t5-large) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.9154
-- Accuracy: 0.7753
-- Precision: 0.7868
-- Recall: 0.7553
-- F1 score: 0.7707
 ## Model description
@@ -46,7 +47,7 @@ The following hyperparameters were used during training:
 - train_batch_size: 1
 - eval_batch_size: 1
 - seed: 42
-- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 5
@@ -54,22 +55,22 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step  | Validation Loss | Accuracy | Precision | Recall | F1 score |
 |:-------------:|:------:|:-----:|:---------------:|:--------:|:---------:|:------:|:--------:|
-| 1.0303        | 0.4205 | 2500  | 0.8270          | 0.7471   | 0.8088    | 0.6471 | 0.7190   |
-| 0.9652        | 0.8410 | 5000  | 0.6861          | 0.7118   | 0.9245    | 0.4612 | 0.6154   |
-| 0.9232        | 1.2616 | 7500  | 0.9442          | 0.7471   | 0.8017    | 0.6565 | 0.7219   |
-| 0.8457        | 1.6821 | 10000 | 0.9311          | 0.7471   | 0.7869    | 0.6776 | 0.7282   |
-| 0.7519        | 2.1026 | 12500 | 1.0887          | 0.7682   | 0.8065    | 0.7059 | 0.7528   |
-| 0.6462        | 2.5231 | 15000 | 1.1780          | 0.7706   | 0.8125    | 0.7035 | 0.7541   |
-| 0.642         | 2.9437 | 17500 | 1.2434          | 0.7718   | 0.7954    | 0.7318 | 0.7623   |
-| 0.4436        | 3.3642 | 20000 | 1.3026          | 0.76     | 0.7931    | 0.7035 | 0.7456   |
-| 0.3762        | 3.7847 | 22500 | 1.6051          | 0.7659   | 0.7678    | 0.7624 | 0.7651   |
-| 0.2798        | 4.2052 | 25000 | 1.9011          | 0.7671   | 0.7683    | 0.7647 | 0.7665   |
-| 0.2012        | 4.6257 | 27500 | 1.9154          | 0.7753   | 0.7868    | 0.7553 | 0.7707   |
 ### Framework versions
-- Transformers 4.44.2
 - Pytorch 2.3.0+cu121
-- Datasets 2.19.1
-- Tokenizers 0.19.1

 ---
 library_name: transformers
+license: apache-2.0
 base_model: google/flan-t5-large
 tags:
 - generated_from_trainer
 This model is a fine-tuned version of [google/flan-t5-large](https://huggingface.co/google/flan-t5-large) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.1812
+- Accuracy: 0.7706
+- Precision: 0.7861
+- Recall: 0.7435
+- F1 score: 0.7642
 ## Model description
 - train_batch_size: 1
 - eval_batch_size: 1
 - seed: 42
+- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - num_epochs: 5
 | Training Loss | Epoch  | Step  | Validation Loss | Accuracy | Precision | Recall | F1 score |
 |:-------------:|:------:|:-----:|:---------------:|:--------:|:---------:|:------:|:--------:|
+| 1.1443        | 0.4205 | 2500  | 1.6635          | 0.6718   | 0.7829    | 0.4753 | 0.5915   |
+| 1.0447        | 0.8410 | 5000  | 0.5585          | 0.7282   | 0.8149    | 0.5906 | 0.6849   |
+| 0.9057        | 1.2616 | 7500  | 0.9051          | 0.7318   | 0.7275    | 0.7412 | 0.7343   |
+| 0.8348        | 1.6821 | 10000 | 0.6307          | 0.7659   | 0.8742    | 0.6212 | 0.7263   |
+| 0.7331        | 2.1026 | 12500 | 0.9500          | 0.7612   | 0.7489    | 0.7859 | 0.7669   |
+| 0.6167        | 2.5231 | 15000 | 1.1524          | 0.7788   | 0.7970    | 0.7482 | 0.7718   |
+| 0.6209        | 2.9437 | 17500 | 1.1690          | 0.7635   | 0.7872    | 0.7224 | 0.7534   |
+| 0.4411        | 3.3642 | 20000 | 1.7563          | 0.7847   | 0.8438    | 0.6988 | 0.7645   |
+| 0.4196        | 3.7847 | 22500 | 1.7767          | 0.7412   | 0.7204    | 0.7882 | 0.7528   |
+| 0.292         | 4.2052 | 25000 | 2.0410          | 0.7624   | 0.7648    | 0.7576 | 0.7612   |
+| 0.1791        | 4.6257 | 27500 | 2.1812          | 0.7706   | 0.7861    | 0.7435 | 0.7642   |
 ### Framework versions
+- Transformers 4.48.3
 - Pytorch 2.3.0+cu121
+- Datasets 3.2.0
+- Tokenizers 0.21.0

tokenizer_config.json CHANGED Viewed

@@ -1,4 +1,5 @@
 {
   "added_tokens_decoder": {
     "0": {
       "content": "<pad>",
@@ -927,9 +928,10 @@
     "<extra_id_98>",
     "<extra_id_99>"
   ],
-  "clean_up_tokenization_spaces": true,
   "eos_token": "</s>",
   "extra_ids": 100,
   "model_max_length": 512,
   "pad_token": "<pad>",
   "sp_model_kwargs": {},

 {
+  "add_prefix_space": null,
   "added_tokens_decoder": {
     "0": {
       "content": "<pad>",
     "<extra_id_98>",
     "<extra_id_99>"
   ],
+  "clean_up_tokenization_spaces": false,
   "eos_token": "</s>",
   "extra_ids": 100,
+  "extra_special_tokens": {},
   "model_max_length": 512,
   "pad_token": "<pad>",
   "sp_model_kwargs": {},