Commit
·
f04404f
1
Parent(s):
9351a39
Update README.md
Browse files
README.md
CHANGED
@@ -34,7 +34,9 @@ language:
|
|
34 |
|
35 |
# req_mod_ner_modelv2
|
36 |
|
37 |
-
This model is a fine-tuned version of [pdelobelle/robbert-v2-dutch-ner](https://huggingface.co/pdelobelle/robbert-v2-dutch-ner) on a
|
|
|
|
|
38 |
- Actor (used for all types of software users and groups of users)
|
39 |
- COTS (abbreviation for Commercial Off-The-Shelf Software)
|
40 |
- Function (used for functions, functionality, features)
|
@@ -71,9 +73,7 @@ The model performs automated extraction of functionality concepts from source do
|
|
71 |
|
72 |
## Training and evaluation data
|
73 |
|
74 |
-
The model was trained on the
|
75 |
-
|
76 |
-
## Training procedure
|
77 |
|
78 |
### Training hyperparameters
|
79 |
|
|
|
34 |
|
35 |
# req_mod_ner_modelv2
|
36 |
|
37 |
+
This model is a fine-tuned version of [pdelobelle/robbert-v2-dutch-ner](https://huggingface.co/pdelobelle/robbert-v2-dutch-ner) on a
|
38 |
+
private dataset with 300 sentences/phrases with 1,954 token labels (IOB2 format) aimed at extracting software requirements
|
39 |
+
related named entities. The following labels are used:
|
40 |
- Actor (used for all types of software users and groups of users)
|
41 |
- COTS (abbreviation for Commercial Off-The-Shelf Software)
|
42 |
- Function (used for functions, functionality, features)
|
|
|
73 |
|
74 |
## Training and evaluation data
|
75 |
|
76 |
+
The model was trained on the ReqModNer dataset. This dataset is private and contains 300 sentences/phrases and 1,954 IOB2 labels. The dataset is split 240/30/30 into train, validation and test. The reported metrics are from the evaluation on the test set. The validation set was used for cross-validation during training.
|
|
|
|
|
77 |
|
78 |
### Training hyperparameters
|
79 |
|