oracool commited on
Commit
3b7d203
·
1 Parent(s): d6f00a5

End of training

Browse files
README.md CHANGED
@@ -1,6 +1,5 @@
1
  ---
2
- license: apache-2.0
3
- base_model: google/flan-t5-small
4
  tags:
5
  - generated_from_trainer
6
  metrics:
@@ -15,14 +14,14 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  # myspace
17
 
18
- This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: nan
21
- - Rouge1: 0.0117
22
- - Rouge2: 0.0043
23
- - Rougel: 0.0114
24
- - Rougelsum: 0.0117
25
- - Gen Len: 18.5663
26
 
27
  ## Model description
28
 
@@ -42,8 +41,8 @@ More information needed
42
 
43
  The following hyperparameters were used during training:
44
  - learning_rate: 2e-05
45
- - train_batch_size: 4
46
- - eval_batch_size: 4
47
  - seed: 42
48
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
  - lr_scheduler_type: linear
@@ -54,7 +53,7 @@ The following hyperparameters were used during training:
54
 
55
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
56
  |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
57
- | 0.0 | 1.0 | 861 | nan | 0.0117 | 0.0043 | 0.0114 | 0.0117 | 18.5663 |
58
 
59
 
60
  ### Framework versions
 
1
  ---
2
+ base_model: d0rj/rut5-base-summ
 
3
  tags:
4
  - generated_from_trainer
5
  metrics:
 
14
 
15
  # myspace
16
 
17
+ This model is a fine-tuned version of [d0rj/rut5-base-summ](https://huggingface.co/d0rj/rut5-base-summ) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 1.9404
20
+ - Rouge1: 0.29
21
+ - Rouge2: 0.1344
22
+ - Rougel: 0.2793
23
+ - Rougelsum: 0.2798
24
+ - Gen Len: 80.3965
25
 
26
  ## Model description
27
 
 
41
 
42
  The following hyperparameters were used during training:
43
  - learning_rate: 2e-05
44
+ - train_batch_size: 1
45
+ - eval_batch_size: 1
46
  - seed: 42
47
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
  - lr_scheduler_type: linear
 
53
 
54
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
55
  |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
56
+ | 2.1069 | 1.0 | 3444 | 1.9404 | 0.29 | 0.1344 | 0.2793 | 0.2798 | 80.3965 |
57
 
58
 
59
  ### Framework versions
generation_config.json CHANGED
@@ -1,6 +1,10 @@
1
  {
2
  "decoder_start_token_id": 0,
3
- "eos_token_id": 1,
 
 
 
 
4
  "pad_token_id": 0,
5
  "transformers_version": "4.35.2"
6
  }
 
1
  {
2
  "decoder_start_token_id": 0,
3
+ "eos_token_id": 2,
4
+ "length_penalty": 0.6,
5
+ "max_length": 256,
6
+ "no_repeat_ngram_size": 2,
7
+ "num_beams": 10,
8
  "pad_token_id": 0,
9
  "transformers_version": "4.35.2"
10
  }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:dbfbfb517f9679dc1b12cd63841e10502552c7bc932956a70d8cc4cf8e2a5a12
3
  size 891644712
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5f8a7ba8001a2ff543ff5d97e3c15ba56b41fe8e583235c50e1b6c38ff03b23a
3
  size 891644712
runs/Dec09_19-18-26_b937ba0803e6/events.out.tfevents.1702149507.b937ba0803e6.2305.2 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ab3ff2f873ae3a375c68c37100cb03b67d598bed48d12b09ad6a6074d3d18eb3
3
- size 6261
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8a57e2da6c4d59715b9d70ed2add85023e34fb2c2e003f6887954d1f12e6ab94
3
+ size 7140