adamjweintraut commited on
Commit
acd98c3
·
1 Parent(s): 4c26888

adamjweintraut/bart-finetuned-eli5_precomputed_best_256

Browse files
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ rouge_eval_2023-12-08_run.csv filter=lfs diff=lfs merge=lfs -text
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [facebook/bart-large](https://huggingface.co/facebook/bart-large) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 1.1839
19
 
20
  ## Model description
21
 
@@ -35,8 +35,8 @@ More information needed
35
 
36
  The following hyperparameters were used during training:
37
  - learning_rate: 5e-05
38
- - train_batch_size: 2
39
- - eval_batch_size: 2
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
@@ -44,28 +44,13 @@ The following hyperparameters were used during training:
44
 
45
  ### Training results
46
 
47
- | Training Loss | Epoch | Step | Validation Loss |
48
- |:-------------:|:-----:|:-----:|:---------------:|
49
- | 1.4316 | 0.1 | 500 | 1.2440 |
50
- | 1.2915 | 0.2 | 1000 | 1.2470 |
51
- | 1.3313 | 0.3 | 1500 | 1.2286 |
52
- | 1.2829 | 0.4 | 2000 | 1.2258 |
53
- | 1.1722 | 0.5 | 2500 | 1.2660 |
54
- | 1.1883 | 0.6 | 3000 | 1.2323 |
55
- | 1.2833 | 0.7 | 3500 | 1.2372 |
56
- | 1.264 | 0.8 | 4000 | 1.2048 |
57
- | 1.1157 | 0.9 | 4500 | 1.2108 |
58
- | 1.1908 | 1.0 | 5000 | 1.2033 |
59
- | 1.1009 | 1.1 | 5500 | 1.2196 |
60
- | 1.1563 | 1.2 | 6000 | 1.2087 |
61
- | 1.1294 | 1.3 | 6500 | 1.2080 |
62
- | 1.0821 | 1.4 | 7000 | 1.2225 |
63
- | 1.1232 | 1.5 | 7500 | 1.1925 |
64
- | 1.1462 | 1.6 | 8000 | 1.1941 |
65
- | 1.019 | 1.7 | 8500 | 1.2010 |
66
- | 1.0976 | 1.8 | 9000 | 1.1831 |
67
- | 1.1216 | 1.9 | 9500 | 1.1868 |
68
- | 1.0345 | 2.0 | 10000 | 1.1839 |
69
 
70
 
71
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [facebook/bart-large](https://huggingface.co/facebook/bart-large) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 1.8045
19
 
20
  ## Model description
21
 
 
35
 
36
  The following hyperparameters were used during training:
37
  - learning_rate: 5e-05
38
+ - train_batch_size: 8
39
+ - eval_batch_size: 8
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
 
44
 
45
  ### Training results
46
 
47
+ | Training Loss | Epoch | Step | Validation Loss |
48
+ |:-------------:|:-----:|:----:|:---------------:|
49
+ | 2.0094 | 0.4 | 500 | 1.8642 |
50
+ | 1.808 | 0.8 | 1000 | 1.8719 |
51
+ | 1.7532 | 1.2 | 1500 | 1.8353 |
52
+ | 1.7879 | 1.6 | 2000 | 1.8151 |
53
+ | 1.7312 | 2.0 | 2500 | 1.8045 |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
54
 
55
 
56
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:bc3eb9d058344fc91e8244c2807a730238c5f7ef6fd2c313bbe1aade454fc9e4
3
  size 1625426996
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f3cda06bb1698c83b59a937463733266a5874063058c1f345556a0569c2326c7
3
  size 1625426996
rouge_eval_2023-12-08_run.csv ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8886fff402a295b514f9814d697d8e02b988af87ca453a36c69c9d36aa81f790
3
+ size 15639483
runs/Dec09_03-19-26_e8e0be95dd85/events.out.tfevents.1702091966.e8e0be95dd85.1299.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b38a5d289f3f2ec268801a25718ab399564d09dcab0ac32e5531efde23488551
3
+ size 5721
runs/Dec09_04-48-05_e8e0be95dd85/events.out.tfevents.1702097285.e8e0be95dd85.10313.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:58d18547866016abeb03fd23e0ae95cb1078f9b279ab25fbdbf5fef3447e8c0e
3
+ size 8215
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7b428a4c98af614923446ce14db02a9fb458df8b039f8dd14efbb4ae35214fe3
3
  size 4984
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:01f52467614696ecdc3e88b6adeeace617cb32c214abc46b8e28a432dfd73d3b
3
  size 4984