Update README.md
Browse files
README.md
CHANGED
@@ -83,11 +83,11 @@ https://github.com/nicolay-r/Reasoning-for-Sentiment-Analysis-Framework
|
|
83 |
Google-colab notebook for reproduction:
|
84 |
https://colab.research.google.com/github/nicolay-r/Reasoning-for-Sentiment-Analysis-Framework/blob/main/Reasoning_for_Sentiment_Analysis_Framework.ipynb
|
85 |
|
86 |
-
**Setup:** `Flan-T5-base`, output up to 300 tokens,
|
87 |
|
88 |
**GPU:** `NVidia-A100`, ~4 min/epoch, temperature 1.0, float 32
|
89 |
|
90 |
-
The overall training process took **
|
91 |
|
92 |
data:image/s3,"s3://crabby-images/02b46/02b46c57c908068a4e7058e451f1ac08965a713e" alt="image/png"
|
93 |
|
@@ -118,11 +118,11 @@ The test evaluation for this model [showcases](https://arxiv.org/abs/2404.12342)
|
|
118 |
Below is the log of the training process that showcases the final peformance on the RuSentNE-2023 `test` set after 4 epochs (lines 5-6):
|
119 |
```tsv
|
120 |
F1_PN F1_PN0 default mode
|
121 |
-
0
|
122 |
-
1
|
123 |
-
2
|
124 |
-
3
|
125 |
-
4
|
126 |
-
5
|
127 |
-
6
|
128 |
```
|
|
|
83 |
Google-colab notebook for reproduction:
|
84 |
https://colab.research.google.com/github/nicolay-r/Reasoning-for-Sentiment-Analysis-Framework/blob/main/Reasoning_for_Sentiment_Analysis_Framework.ipynb
|
85 |
|
86 |
+
**Setup:** `Flan-T5-base`, output up to 300 tokens, 16-batch size.
|
87 |
|
88 |
**GPU:** `NVidia-A100`, ~4 min/epoch, temperature 1.0, float 32
|
89 |
|
90 |
+
The overall training process took **5 epochs**.
|
91 |
|
92 |
data:image/s3,"s3://crabby-images/02b46/02b46c57c908068a4e7058e451f1ac08965a713e" alt="image/png"
|
93 |
|
|
|
118 |
Below is the log of the training process that showcases the final peformance on the RuSentNE-2023 `test` set after 4 epochs (lines 5-6):
|
119 |
```tsv
|
120 |
F1_PN F1_PN0 default mode
|
121 |
+
0 45.523 59.375 59.375 valid
|
122 |
+
1 62.345 70.260 70.260 valid
|
123 |
+
2 62.722 70.704 70.704 valid
|
124 |
+
3 62.721 70.671 70.671 valid
|
125 |
+
4 62.357 70.247 70.247 valid
|
126 |
+
5 60.024 68.171 68.171 test
|
127 |
+
6 60.024 68.171 68.171 test
|
128 |
```
|