Bigheadjoshy commited on
Commit
81f1ad3
·
verified ·
1 Parent(s): 9592981

End of training

Browse files
Files changed (4) hide show
  1. README.md +25 -11
  2. config.json +5 -3
  3. model.safetensors +2 -2
  4. training_args.bin +1 -1
README.md CHANGED
@@ -4,6 +4,9 @@ license: cc-by-nc-sa-4.0
4
  base_model: microsoft/layoutlmv3-base
5
  tags:
6
  - generated_from_trainer
 
 
 
7
  model-index:
8
  - name: layoutlmv3_document_classification
9
  results: []
@@ -16,14 +19,9 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [microsoft/layoutlmv3-base](https://huggingface.co/microsoft/layoutlmv3-base) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
- - eval_loss: 0.7203
20
- - eval_accuracy: 0.8538
21
- - eval_f1: 0.8446
22
- - eval_runtime: 58.2806
23
- - eval_samples_per_second: 25.24
24
- - eval_steps_per_second: 1.064
25
- - epoch: 7.3171
26
- - step: 1800
27
 
28
  ## Model description
29
 
@@ -49,12 +47,28 @@ The following hyperparameters were used during training:
49
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
50
  - lr_scheduler_type: linear
51
  - lr_scheduler_warmup_ratio: 0.06
52
- - num_epochs: 10
53
  - mixed_precision_training: Native AMP
54
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
55
  ### Framework versions
56
 
57
- - Transformers 4.48.2
58
  - Pytorch 2.5.1+cu124
59
- - Datasets 3.2.0
60
  - Tokenizers 0.21.0
 
4
  base_model: microsoft/layoutlmv3-base
5
  tags:
6
  - generated_from_trainer
7
+ metrics:
8
+ - accuracy
9
+ - f1
10
  model-index:
11
  - name: layoutlmv3_document_classification
12
  results: []
 
19
 
20
  This model is a fine-tuned version of [microsoft/layoutlmv3-base](https://huggingface.co/microsoft/layoutlmv3-base) on the None dataset.
21
  It achieves the following results on the evaluation set:
22
+ - Loss: 0.6825
23
+ - Accuracy: 0.8626
24
+ - F1: 0.8556
 
 
 
 
 
25
 
26
  ## Model description
27
 
 
47
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
48
  - lr_scheduler_type: linear
49
  - lr_scheduler_warmup_ratio: 0.06
50
+ - num_epochs: 5
51
  - mixed_precision_training: Native AMP
52
 
53
+ ### Training results
54
+
55
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 |
56
+ |:-------------:|:------:|:----:|:---------------:|:--------:|:------:|
57
+ | 0.6422 | 0.4983 | 150 | 0.8329 | 0.8127 | 0.7921 |
58
+ | 0.6726 | 0.9967 | 300 | 0.7887 | 0.8310 | 0.8159 |
59
+ | 0.5329 | 1.4950 | 450 | 0.7981 | 0.8183 | 0.8055 |
60
+ | 0.5147 | 1.9934 | 600 | 0.7746 | 0.8360 | 0.8273 |
61
+ | 0.4119 | 2.4917 | 750 | 0.7384 | 0.8438 | 0.8329 |
62
+ | 0.4011 | 2.9900 | 900 | 0.7318 | 0.8465 | 0.8392 |
63
+ | 0.3469 | 3.4884 | 1050 | 0.7317 | 0.8488 | 0.8412 |
64
+ | 0.3148 | 3.9867 | 1200 | 0.7218 | 0.8548 | 0.8472 |
65
+ | 0.2974 | 4.4850 | 1350 | 0.6903 | 0.8620 | 0.8559 |
66
+ | 0.297 | 4.9834 | 1500 | 0.6825 | 0.8626 | 0.8556 |
67
+
68
+
69
  ### Framework versions
70
 
71
+ - Transformers 4.48.3
72
  - Pytorch 2.5.1+cu124
73
+ - Datasets 3.3.2
74
  - Tokenizers 0.21.0
config.json CHANGED
@@ -106,7 +106,8 @@
106
  "89": "LABEL_89",
107
  "90": "LABEL_90",
108
  "91": "LABEL_91",
109
- "92": "LABEL_92"
 
110
  },
111
  "initializer_range": 0.02,
112
  "input_size": 224,
@@ -204,7 +205,8 @@
204
  "LABEL_9": 9,
205
  "LABEL_90": 90,
206
  "LABEL_91": 91,
207
- "LABEL_92": 92
 
208
  },
209
  "layer_norm_eps": 1e-05,
210
  "max_2d_position_embeddings": 1024,
@@ -224,7 +226,7 @@
224
  "shape_size": 128,
225
  "text_embed": true,
226
  "torch_dtype": "float32",
227
- "transformers_version": "4.48.2",
228
  "type_vocab_size": 1,
229
  "visual_embed": true,
230
  "vocab_size": 50265
 
106
  "89": "LABEL_89",
107
  "90": "LABEL_90",
108
  "91": "LABEL_91",
109
+ "92": "LABEL_92",
110
+ "93": "LABEL_93"
111
  },
112
  "initializer_range": 0.02,
113
  "input_size": 224,
 
205
  "LABEL_9": 9,
206
  "LABEL_90": 90,
207
  "LABEL_91": 91,
208
+ "LABEL_92": 92,
209
+ "LABEL_93": 93
210
  },
211
  "layer_norm_eps": 1e-05,
212
  "max_2d_position_embeddings": 1024,
 
226
  "shape_size": 128,
227
  "text_embed": true,
228
  "torch_dtype": "float32",
229
+ "transformers_version": "4.48.3",
230
  "type_vocab_size": 1,
231
  "visual_embed": true,
232
  "vocab_size": 50265
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:610fbaac601b34398ec326194ea9ecf3310c8f1d2ac5aebd6483c650c2533fe8
3
- size 503982668
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7a4884c45d251ad8167c8e5668fa85ae0090a4783c4477809da974de2d648cc8
3
+ size 503985744
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:76f4fd1c90339fcbd74254e3d2c0112c4481691964b86166db203d717189a314
3
  size 5304
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b2f5911b50ef133334301d90d0075ec8a0b5e0d573f61bc6d5b27a2f9810b8b4
3
  size 5304