HarsimarSingh commited on
Commit
2e4560c
·
verified ·
1 Parent(s): c446eda

Model save

Browse files
README.md CHANGED
@@ -14,7 +14,7 @@ should probably proofread and complete it, then remove this comment. -->
14
 
15
  This model was trained from scratch on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
- - Loss: 2.5755
18
  - Cer: 1.0
19
 
20
  ## Model description
@@ -34,39 +34,28 @@ More information needed
34
  ### Training hyperparameters
35
 
36
  The following hyperparameters were used during training:
37
- - learning_rate: 3e-05
38
  - train_batch_size: 4
39
  - eval_batch_size: 1
40
  - seed: 42
41
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
42
  - lr_scheduler_type: linear
43
- - lr_scheduler_warmup_ratio: 0.1
44
- - num_epochs: 20
45
 
46
  ### Training results
47
 
48
- | Training Loss | Epoch | Step | Validation Loss | Cer |
49
- |:-------------:|:-----:|:------:|:---------------:|:------:|
50
- | 3.1527 | 1.0 | 5883 | 3.4368 | 1.0 |
51
- | 3.271 | 2.0 | 11766 | 3.2032 | 1.0001 |
52
- | 2.9587 | 3.0 | 17649 | 3.0627 | 1.0 |
53
- | 2.4732 | 4.0 | 23532 | 3.0279 | 1.0108 |
54
- | 2.1076 | 5.0 | 29415 | 2.5229 | 0.9997 |
55
- | 1.8924 | 6.0 | 35298 | 2.4476 | 1.0 |
56
- | 1.7687 | 7.0 | 41181 | 2.4754 | 1.0 |
57
- | 1.5841 | 8.0 | 47064 | 2.5346 | 1.0 |
58
- | 1.6036 | 9.0 | 52947 | 2.4699 | 1.0 |
59
- | 1.3961 | 10.0 | 58830 | 2.4823 | 1.0 |
60
- | 1.2846 | 11.0 | 64713 | 2.5168 | 1.0 |
61
- | 1.0196 | 12.0 | 70596 | 2.4514 | 1.0 |
62
- | 1.1475 | 13.0 | 76479 | 2.5678 | 1.0 |
63
- | 1.0415 | 14.0 | 82362 | 2.6223 | 1.0 |
64
- | 1.0472 | 15.0 | 88245 | 2.6513 | 1.0 |
65
- | 0.7244 | 16.0 | 94128 | 2.4963 | 1.0 |
66
- | 0.6422 | 17.0 | 100011 | 2.5385 | 1.0 |
67
- | 0.7878 | 18.0 | 105894 | 2.6305 | 1.0 |
68
- | 0.6919 | 19.0 | 111777 | 2.5720 | 1.0 |
69
- | 0.5415 | 20.0 | 117660 | 2.5755 | 1.0 |
70
 
71
 
72
  ### Framework versions
 
14
 
15
  This model was trained from scratch on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
+ - Loss: 1.3457
18
  - Cer: 1.0
19
 
20
  ## Model description
 
34
  ### Training hyperparameters
35
 
36
  The following hyperparameters were used during training:
37
+ - learning_rate: 0.0001
38
  - train_batch_size: 4
39
  - eval_batch_size: 1
40
  - seed: 42
41
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
42
  - lr_scheduler_type: linear
43
+ - num_epochs: 10
 
44
 
45
  ### Training results
46
 
47
+ | Training Loss | Epoch | Step | Validation Loss | Cer |
48
+ |:-------------:|:-----:|:-----:|:---------------:|:---:|
49
+ | 0.8961 | 1.0 | 5883 | 1.6239 | 1.0 |
50
+ | 0.7295 | 2.0 | 11766 | 1.4693 | 1.0 |
51
+ | 0.5127 | 3.0 | 17649 | 1.4575 | 1.0 |
52
+ | 0.3661 | 4.0 | 23532 | 1.4469 | 1.0 |
53
+ | 0.3952 | 5.0 | 29415 | 1.4075 | 1.0 |
54
+ | 0.2733 | 6.0 | 35298 | 1.4193 | 1.0 |
55
+ | 0.1959 | 7.0 | 41181 | 1.3834 | 1.0 |
56
+ | 0.1422 | 8.0 | 47064 | 1.4467 | 1.0 |
57
+ | 0.0834 | 9.0 | 52947 | 1.3748 | 1.0 |
58
+ | 0.0703 | 10.0 | 58830 | 1.3457 | 1.0 |
 
 
 
 
 
 
 
 
 
 
59
 
60
 
61
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:51dceadbbe19ede77dc752d07d8c049bc42f60acb166958b908591bda3bc94a1
3
  size 1337035896
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8123186a490ad16d35e0751bec84f55645f865c6c229c54c3fe9cbe5af96f76a
3
  size 1337035896
runs/Jan09_23-06-16_DESKTOP-0UF7HQR/events.out.tfevents.1736444176.DESKTOP-0UF7HQR.14296.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e6fa330222188beaeed2761d05181a2ca99b795be589962e7e60f5f05d701602
3
- size 211623
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3bce6c8e7013e3839afba9980951cb97cce7f431f96106f10cb14d8fb99f43e2
3
+ size 212307