Update README.md
Browse files
README.md
CHANGED
@@ -27,7 +27,7 @@ The following datasets are all in Chinese.
|
|
27 |
|
28 |
|
29 |
## Model List
|
30 |
-
The evaluation dataset is in Chinese, and we used the same language model **RoBERTa base** on different methods. In addition
|
31 |
| Model | STS-B(w-avg) | ATEC | BQ | LCQMC | PAWSX | Avg. |
|
32 |
|:-----------------------:|:------------:|:-----------:|:----------|:-------------|:------------:|:----------:|
|
33 |
| BERT-Whitening | 65.27| -| -| -| -| -|
|
|
|
27 |
|
28 |
|
29 |
## Model List
|
30 |
+
The evaluation dataset is in Chinese, and we used the same language model **RoBERTa base** on different methods. In addition, considering that the test set of some datasets is small, which may lead to a large deviation in evaluation accuracy, the evaluation data here uses train, valid and test at the same time, and the final evaluation result adopts the **weighted average (w-avg)** method.
|
31 |
| Model | STS-B(w-avg) | ATEC | BQ | LCQMC | PAWSX | Avg. |
|
32 |
|:-----------------------:|:------------:|:-----------:|:----------|:-------------|:------------:|:----------:|
|
33 |
| BERT-Whitening | 65.27| -| -| -| -| -|
|