brad1141
/

bert-finetuned-ner

Token Classification

Generated from Trainer

Model card Files Files and versions Metrics Training metrics Community

brad1141 commited on Mar 8, 2022

Commit

00e5fbc

·

1 Parent(s): 12e4be0

update model card README.md

Files changed (1) hide show

README.md +20 -18

README.md CHANGED Viewed

@@ -1,5 +1,4 @@
 ---
-license: apache-2.0
 tags:
 - generated_from_trainer
 metrics:
@@ -17,13 +16,13 @@ should probably proofread and complete it, then remove this comment. -->
 # bert-finetuned-ner
-This model is a fine-tuned version of [bert-base-cased](https://huggingface.co/bert-base-cased) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5907
-- Precision: 0.5789
-- Recall: 0.9167
-- F1: 0.7097
-- Accuracy: 0.8091
 ## Model description
@@ -42,28 +41,31 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 2e-05
-- train_batch_size: 8
-- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 5
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1     | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
-| 0.631         | 1.0   | 1747 | 0.5743          | 0.5789    | 0.9167 | 0.7097 | 0.7985   |
-| 0.5177        | 2.0   | 3494 | 0.5425          | 0.3810    | 0.8889 | 0.5333 | 0.8088   |
-| 0.4494        | 3.0   | 5241 | 0.5425          | 0.5652    | 0.9286 | 0.7027 | 0.8113   |
-| 0.3763        | 4.0   | 6988 | 0.5653          | 0.5882    | 0.9091 | 0.7143 | 0.8080   |
-| 0.335         | 5.0   | 8735 | 0.5907          | 0.5789    | 0.9167 | 0.7097 | 0.8091   |
 ### Framework versions
-- Transformers 4.16.1
 - Pytorch 1.10.0+cu111
-- Datasets 1.18.2
-- Tokenizers 0.11.0

 ---
 tags:
 - generated_from_trainer
 metrics:
 # bert-finetuned-ner
+This model is a fine-tuned version of [allenai/longformer-base-4096](https://huggingface.co/allenai/longformer-base-4096) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.6434
+- Precision: 0.8589
+- Recall: 0.8686
+- F1: 0.8637
+- Accuracy: 0.8324
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 5e-05
+- train_batch_size: 1
+- eval_batch_size: 1
 - seed: 42
+- gradient_accumulation_steps: 8
+- total_train_batch_size: 8
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_ratio: 0.1
 - num_epochs: 5
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1     | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
+| 0.615         | 1.0   | 1741 | 0.6111          | 0.8200    | 0.8652 | 0.8420 | 0.8046   |
+| 0.4795        | 2.0   | 3482 | 0.5366          | 0.8456    | 0.8803 | 0.8626 | 0.8301   |
+| 0.3705        | 3.0   | 5223 | 0.5412          | 0.8527    | 0.8786 | 0.8655 | 0.8339   |
+| 0.2749        | 4.0   | 6964 | 0.5906          | 0.8559    | 0.8711 | 0.8634 | 0.8316   |
+| 0.2049        | 5.0   | 8705 | 0.6434          | 0.8589    | 0.8686 | 0.8637 | 0.8324   |
 ### Framework versions
+- Transformers 4.17.0
 - Pytorch 1.10.0+cu111
+- Datasets 1.18.4
+- Tokenizers 0.11.6