itsbilal90 commited on
Commit
3b0fd72
·
verified ·
1 Parent(s): a896d55

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +39 -9
README.md CHANGED
@@ -5,6 +5,17 @@ tags:
5
  model-index:
6
  - name: distilbert-base-uncased-ner-finer
7
  results: []
 
 
 
 
 
 
 
 
 
 
 
8
  ---
9
 
10
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -12,21 +23,40 @@ should probably proofread and complete it, then remove this comment. -->
12
 
13
  # distilbert-base-uncased-ner-finer
14
 
15
- This model was trained from scratch on an unknown dataset.
16
-
17
  ## Model description
18
 
19
- More information needed
20
-
21
- ## Intended uses & limitations
22
 
23
- More information needed
 
 
 
 
 
24
 
25
  ## Training and evaluation data
26
 
27
- More information needed
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
28
 
29
- ## Training procedure
30
 
31
  ### Training hyperparameters
32
 
@@ -44,4 +74,4 @@ The following hyperparameters were used during training:
44
  - Transformers 4.47.1
45
  - Pytorch 2.5.1+cu121
46
  - Datasets 3.2.0
47
- - Tokenizers 0.21.0
 
5
  model-index:
6
  - name: distilbert-base-uncased-ner-finer
7
  results: []
8
+ datasets:
9
+ - nlpaueb/finer-139
10
+ language:
11
+ - en
12
+ metrics:
13
+ - accuracy
14
+ - precision
15
+ - f1
16
+ - confusion_matrix
17
+ base_model:
18
+ - distilbert/distilbert-base-uncased
19
  ---
20
 
21
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
23
 
24
  # distilbert-base-uncased-ner-finer
25
 
 
 
26
  ## Model description
27
 
28
+ This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the [Finer-139](https://huggingface.co/datasets/nlpaueb/finer-139) dataset.
 
 
29
 
30
+ It achieves the following results on the evaluation set:
31
+ - Loss: 0.0293
32
+ - Precision: 0.8768
33
+ - Recall: 0.9064
34
+ - F1: 0.8914
35
+ - Accuracy: 0.9901
36
 
37
  ## Training and evaluation data
38
 
39
+ The training data consists of the top 4 ner_tags having the most occurence from the Finer-139 dataset plus the outside tag "O".
40
+
41
+ ## Training results
42
+
43
+ | Epoch | Training Loss | Validation Loss | Precision | Recall | F1 | Accuracy |
44
+ |---|---|---|---|---|---|---|
45
+ | 1 | 0.035700 | 0.035880 | 0.847873 | 0.890125 | 0.868486 | 0.987242 |
46
+ | 2 | 0.023700 | 0.029618 | 0.867055 | 0.906431 | 0.886306 | 0.989505 |
47
+ | 3 | 0.017000 | 0.029322 | 0.876898 | 0.906431 | 0.891420 | 0.990180 |
48
+
49
+ ## Valiadtion results
50
+
51
+ | ner_tag | precision | recall | f1-score | support |
52
+ |--------------|-----------|--------|----------|---------|
53
+ | O | 1.00 | 0.99 | 1.00 | 229573 |
54
+ | I-DebtInstrumentInterestRateStatedPercentage | 0.94 | 0.94 | 0.94 | 5412 |
55
+ | I-LineOfCreditFacilityMaximumBorrowingCapacity | 0.82 | 0.88 | 0.85 | 4288 |
56
+ | I-DebtInstrumentBasisSpreadOnVariableRate1 | 0.89 | 0.97 | 0.93 | 4788 |
57
+ | I-DebtInstrumentFaceAmount | 0.79 | 0.76 | 0.78 | 3398 |
58
 
59
+ ![confusion matrix](https://cdn-uploads.huggingface.co/production/uploads/6791ddd9f0ecdeb1a8aa6883/3CftA28uzQAU6Oqi_Iddl.png)
60
 
61
  ### Training hyperparameters
62
 
 
74
  - Transformers 4.47.1
75
  - Pytorch 2.5.1+cu121
76
  - Datasets 3.2.0
77
+ - Tokenizers 0.21.0