mateiaassAI commited on
Commit
13943ef
·
verified ·
1 Parent(s): 7d77c3e

End of training

Browse files
Files changed (4) hide show
  1. README.md +19 -19
  2. config.json +12 -4
  3. model.safetensors +2 -2
  4. training_args.bin +1 -1
README.md CHANGED
@@ -5,7 +5,7 @@ base_model: dumitrescustefan/bert-base-romanian-cased-v1
5
  tags:
6
  - generated_from_trainer
7
  datasets:
8
- - laroseda
9
  metrics:
10
  - f1
11
  - accuracy
@@ -18,24 +18,24 @@ model-index:
18
  name: Text Classification
19
  type: text-classification
20
  dataset:
21
- name: laroseda
22
- type: laroseda
23
- config: laroseda
24
- split: train
25
- args: laroseda
26
  metrics:
27
  - name: F1
28
  type: f1
29
- value: 0.9484126984126985
30
  - name: Accuracy
31
  type: accuracy
32
- value: 0.948
33
  - name: Precision
34
  type: precision
35
- value: 0.9465346534653465
36
  - name: Recall
37
  type: recall
38
- value: 0.9502982107355865
39
  ---
40
 
41
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -43,14 +43,14 @@ should probably proofread and complete it, then remove this comment. -->
43
 
44
  # teacher_laroseda
45
 
46
- This model is a fine-tuned version of [dumitrescustefan/bert-base-romanian-cased-v1](https://huggingface.co/dumitrescustefan/bert-base-romanian-cased-v1) on the laroseda dataset.
47
  It achieves the following results on the evaluation set:
48
- - Loss: 0.0784
49
- - F1: 0.9484
50
  - Roc Auc: None
51
- - Accuracy: 0.948
52
- - Precision: 0.9465
53
- - Recall: 0.9503
54
 
55
  ## Model description
56
 
@@ -69,7 +69,7 @@ More information needed
69
  ### Training hyperparameters
70
 
71
  The following hyperparameters were used during training:
72
- - learning_rate: 1.7e-05
73
  - train_batch_size: 16
74
  - eval_batch_size: 16
75
  - seed: 42
@@ -81,8 +81,8 @@ The following hyperparameters were used during training:
81
 
82
  | Training Loss | Epoch | Step | Validation Loss | F1 | Roc Auc | Accuracy | Precision | Recall |
83
  |:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|:--------:|:---------:|:------:|
84
- | 0.111 | 1.0 | 688 | 0.0642 | 0.9475 | None | 0.947 | 0.9447 | 0.9503 |
85
- | 0.0587 | 2.0 | 1376 | 0.0784 | 0.9484 | None | 0.948 | 0.9465 | 0.9503 |
86
 
87
 
88
  ### Framework versions
 
5
  tags:
6
  - generated_from_trainer
7
  datasets:
8
+ - moroco
9
  metrics:
10
  - f1
11
  - accuracy
 
18
  name: Text Classification
19
  type: text-classification
20
  dataset:
21
+ name: moroco
22
+ type: moroco
23
+ config: moroco
24
+ split: validation
25
+ args: moroco
26
  metrics:
27
  - name: F1
28
  type: f1
29
+ value: 0.8735212436845421
30
  - name: Accuracy
31
  type: accuracy
32
+ value: 0.8554298260428982
33
  - name: Precision
34
  type: precision
35
+ value: 0.875780196577562
36
  - name: Recall
37
  type: recall
38
+ value: 0.8713068696755483
39
  ---
40
 
41
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
43
 
44
  # teacher_laroseda
45
 
46
+ This model is a fine-tuned version of [dumitrescustefan/bert-base-romanian-cased-v1](https://huggingface.co/dumitrescustefan/bert-base-romanian-cased-v1) on the moroco dataset.
47
  It achieves the following results on the evaluation set:
48
+ - Loss: 0.0937
49
+ - F1: 0.8735
50
  - Roc Auc: None
51
+ - Accuracy: 0.8554
52
+ - Precision: 0.8758
53
+ - Recall: 0.8713
54
 
55
  ## Model description
56
 
 
69
  ### Training hyperparameters
70
 
71
  The following hyperparameters were used during training:
72
+ - learning_rate: 2e-05
73
  - train_batch_size: 16
74
  - eval_batch_size: 16
75
  - seed: 42
 
81
 
82
  | Training Loss | Epoch | Step | Validation Loss | F1 | Roc Auc | Accuracy | Precision | Recall |
83
  |:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|:--------:|:---------:|:------:|
84
+ | 0.1161 | 1.0 | 1358 | 0.1005 | 0.8657 | None | 0.8551 | 0.8822 | 0.8532 |
85
+ | 0.0807 | 2.0 | 2716 | 0.0937 | 0.8735 | None | 0.8554 | 0.8758 | 0.8713 |
86
 
87
 
88
  ### Framework versions
config.json CHANGED
@@ -9,14 +9,22 @@
9
  "hidden_dropout_prob": 0.1,
10
  "hidden_size": 768,
11
  "id2label": {
12
- "0": "Negative",
13
- "1": "Positive"
 
 
 
 
14
  },
15
  "initializer_range": 0.02,
16
  "intermediate_size": 3072,
17
  "label2id": {
18
- "Negative": 0,
19
- "Positive": 1
 
 
 
 
20
  },
21
  "layer_norm_eps": 1e-12,
22
  "max_position_embeddings": 512,
 
9
  "hidden_dropout_prob": 0.1,
10
  "hidden_size": 768,
11
  "id2label": {
12
+ "0": "culture",
13
+ "1": "finance",
14
+ "2": "politics",
15
+ "3": "science",
16
+ "4": "sports",
17
+ "5": "tech"
18
  },
19
  "initializer_range": 0.02,
20
  "intermediate_size": 3072,
21
  "label2id": {
22
+ "culture": 0,
23
+ "finance": 1,
24
+ "politics": 2,
25
+ "science": 3,
26
+ "sports": 4,
27
+ "tech": 5
28
  },
29
  "layer_norm_eps": 1e-12,
30
  "max_position_embeddings": 512,
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:dd87f03fecb6daf574b34f325d1e1242526ac2ddb24d9ecd6879f482c482ae90
3
- size 497795072
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cae10a859983bfc9f55faf666d79cc33ec9460b5256bfe73aa48c9fd86a55ed7
3
+ size 497807376
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5d6c49d51c22c9ea9009eabd22a8f08d17e043ffcff86b4bbc6f29b0970ba1a0
3
  size 5176
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:70411d2e62a23bf5d52f0985f2b8ba20217404797f084cb74ecb0ab78898dd4b
3
  size 5176