ZenPuzzle commited on
Commit
5c4349b
·
1 Parent(s): 69b78ef

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -5
README.md CHANGED
@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
13
 
14
  This model is a fine-tuned version of [ai-forever/rugpt3small_based_on_gpt2](https://huggingface.co/ai-forever/rugpt3small_based_on_gpt2) on the None dataset.
15
  It achieves the following results on the evaluation set:
16
- - Loss: 3.5235
17
 
18
  ## Model description
19
 
@@ -38,15 +38,22 @@ The following hyperparameters were used during training:
38
  - seed: 42
39
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
40
  - lr_scheduler_type: linear
41
- - num_epochs: 3.0
42
 
43
  ### Training results
44
 
45
  | Training Loss | Epoch | Step | Validation Loss |
46
  |:-------------:|:-----:|:----:|:---------------:|
47
- | No log | 1.0 | 5 | 3.7794 |
48
- | No log | 2.0 | 10 | 3.5884 |
49
- | No log | 3.0 | 15 | 3.5235 |
 
 
 
 
 
 
 
50
 
51
 
52
  ### Framework versions
 
13
 
14
  This model is a fine-tuned version of [ai-forever/rugpt3small_based_on_gpt2](https://huggingface.co/ai-forever/rugpt3small_based_on_gpt2) on the None dataset.
15
  It achieves the following results on the evaluation set:
16
+ - Loss: 2.7020
17
 
18
  ## Model description
19
 
 
38
  - seed: 42
39
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
40
  - lr_scheduler_type: linear
41
+ - num_epochs: 10
42
 
43
  ### Training results
44
 
45
  | Training Loss | Epoch | Step | Validation Loss |
46
  |:-------------:|:-----:|:----:|:---------------:|
47
+ | No log | 1.0 | 5 | 3.7450 |
48
+ | No log | 2.0 | 10 | 3.4688 |
49
+ | No log | 3.0 | 15 | 3.2667 |
50
+ | No log | 4.0 | 20 | 3.1067 |
51
+ | No log | 5.0 | 25 | 2.9798 |
52
+ | No log | 6.0 | 30 | 2.8797 |
53
+ | No log | 7.0 | 35 | 2.8035 |
54
+ | No log | 8.0 | 40 | 2.7485 |
55
+ | No log | 9.0 | 45 | 2.7147 |
56
+ | No log | 10.0 | 50 | 2.7020 |
57
 
58
 
59
  ### Framework versions