Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,21 @@
|
|
1 |
---
|
2 |
license: mit
|
|
|
|
|
|
|
|
|
|
|
|
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: mit
|
3 |
+
datasets:
|
4 |
+
- deutsche-telekom/ger-backtrans-paraphrase
|
5 |
+
- paws-x
|
6 |
+
- stsb_multi_mt
|
7 |
+
language:
|
8 |
+
- de
|
9 |
---
|
10 |
+
The model is a [multilingual-e5-base](https://huggingface.co/intfloat/multilingual-e5-base) model fine-tuned with the task of semantic textual similarity in mind.
|
11 |
+
|
12 |
+
## Model Training
|
13 |
+
The model has been fine-tuned on the German subsets of the following datasets:
|
14 |
+
- [German paraphrase corpus by Philip May](https://huggingface.co/datasets/deutsche-telekom/ger-backtrans-paraphrase)
|
15 |
+
- [paws-x](https://huggingface.co/datasets/paws-x)
|
16 |
+
- [stsb_multi_mt](https://huggingface.co/datasets/stsb_multi_mt)
|
17 |
+
|
18 |
+
# Results
|
19 |
+
The model achieves the following results:
|
20 |
+
- 0.920 on stsb's validation subset
|
21 |
+
- 0.904 on stsb's test subset
|