bunbohue commited on
Commit
e875ea1
·
1 Parent(s): 3af7c24

End of training

Browse files
README.md ADDED
@@ -0,0 +1,68 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: facebook/bart-large
4
+ tags:
5
+ - generated_from_trainer
6
+ metrics:
7
+ - rouge
8
+ model-index:
9
+ - name: bart-large_readme_summarization
10
+ results: []
11
+ ---
12
+
13
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
+ should probably proofread and complete it, then remove this comment. -->
15
+
16
+ # bart-large_readme_summarization
17
+
18
+ This model is a fine-tuned version of [facebook/bart-large](https://huggingface.co/facebook/bart-large) on the None dataset.
19
+ It achieves the following results on the evaluation set:
20
+ - Loss: 1.8286
21
+ - Rouge1: 0.5485
22
+ - Rouge2: 0.4096
23
+ - Rougel: 0.5242
24
+ - Rougelsum: 0.524
25
+ - Gen Len: 15.1271
26
+
27
+ ## Model description
28
+
29
+ More information needed
30
+
31
+ ## Intended uses & limitations
32
+
33
+ More information needed
34
+
35
+ ## Training and evaluation data
36
+
37
+ More information needed
38
+
39
+ ## Training procedure
40
+
41
+ ### Training hyperparameters
42
+
43
+ The following hyperparameters were used during training:
44
+ - learning_rate: 2e-05
45
+ - train_batch_size: 2
46
+ - eval_batch_size: 2
47
+ - seed: 42
48
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
+ - lr_scheduler_type: linear
50
+ - num_epochs: 4
51
+ - mixed_precision_training: Native AMP
52
+
53
+ ### Training results
54
+
55
+ | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
56
+ |:-------------:|:-----:|:-----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
57
+ | 2.1578 | 1.0 | 2916 | 1.9917 | 0.489 | 0.3382 | 0.4619 | 0.4618 | 15.9544 |
58
+ | 1.5841 | 2.0 | 5832 | 1.8486 | 0.5197 | 0.3778 | 0.4948 | 0.4942 | 15.0384 |
59
+ | 1.2896 | 3.0 | 8748 | 1.8169 | 0.5445 | 0.3982 | 0.5188 | 0.5192 | 13.994 |
60
+ | 1.0315 | 4.0 | 11664 | 1.8286 | 0.5485 | 0.4096 | 0.5242 | 0.524 | 15.1271 |
61
+
62
+
63
+ ### Framework versions
64
+
65
+ - Transformers 4.35.0
66
+ - Pytorch 2.1.0+cu118
67
+ - Datasets 2.14.6
68
+ - Tokenizers 0.14.1
generation_config.json ADDED
@@ -0,0 +1,12 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token_id": 0,
3
+ "decoder_start_token_id": 2,
4
+ "early_stopping": true,
5
+ "eos_token_id": 2,
6
+ "forced_bos_token_id": 0,
7
+ "forced_eos_token_id": 2,
8
+ "no_repeat_ngram_size": 3,
9
+ "num_beams": 4,
10
+ "pad_token_id": 1,
11
+ "transformers_version": "4.35.0"
12
+ }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3cf505f057cd2d26da7b37f15fc9c5e9eba6f20e05dac5620239585dd4fee69c
3
  size 1625426996
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:22a1173802997796d5c8d06a1aefab9787a15fbd7f97492a4898cb35385941a7
3
  size 1625426996
runs/Nov09_08-38-54_fd617d0ef83f/events.out.tfevents.1699519142.fd617d0ef83f.826.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a268e19c3115551ba0b0d160581260c2f98bfc497558faf7c7c8c74f99224289
3
- size 10613
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4402a35f36a65395d58f19743e79899a4983691b618a5b8102bb2d8bc633aca6
3
+ size 11492