danielheinz commited on
Commit
9aaae0e
·
verified ·
1 Parent(s): 32dc0bc

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +18 -0
README.md CHANGED
@@ -1,3 +1,21 @@
1
  ---
2
  license: mit
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: mit
3
+ datasets:
4
+ - deutsche-telekom/ger-backtrans-paraphrase
5
+ - paws-x
6
+ - stsb_multi_mt
7
+ language:
8
+ - de
9
  ---
10
+ The model is a [multilingual-e5-base](https://huggingface.co/intfloat/multilingual-e5-base) model fine-tuned with the task of semantic textual similarity in mind.
11
+
12
+ ## Model Training
13
+ The model has been fine-tuned on the German subsets of the following datasets:
14
+ - [German paraphrase corpus by Philip May](https://huggingface.co/datasets/deutsche-telekom/ger-backtrans-paraphrase)
15
+ - [paws-x](https://huggingface.co/datasets/paws-x)
16
+ - [stsb_multi_mt](https://huggingface.co/datasets/stsb_multi_mt)
17
+
18
+ # Results
19
+ The model achieves the following results:
20
+ - 0.920 on stsb's validation subset
21
+ - 0.904 on stsb's test subset