update
Browse files
README.md
CHANGED
@@ -12,17 +12,17 @@ license: gpl-3.0
|
|
12 |
|
13 |
# CKIP Oldhan BERT Base Chinese WS
|
14 |
|
15 |
-
This model provides word segmentation for the
|
16 |
|
17 |
## Homepage
|
18 |
* [ckiplab/han-transformers](https://github.com/ckiplab/han-transformers)
|
19 |
|
20 |
## Training Datasets
|
21 |
The copyright of the datasets belongs to the Institute of Linguistics, Academia Sinica.
|
22 |
-
* [中央研究院上古漢語標記語料庫](http://lingcorpus.iis.sinica.edu.tw/cgi-bin/kiwi/akiwi/kiwi.sh
|
23 |
-
* [中央研究院中古漢語語料庫](http://lingcorpus.iis.sinica.edu.tw/cgi-bin/kiwi/dkiwi/kiwi.sh
|
24 |
-
* [中央研究院近代漢語語料庫](http://lingcorpus.iis.sinica.edu.tw/cgi-bin/kiwi/pkiwi/kiwi.sh
|
25 |
-
* [中央研究院現代漢語語料庫](http://
|
26 |
|
27 |
## Contributors
|
28 |
* Chin-Tung Lin at [CKIP](https://ckip.iis.sinica.edu.tw/)
|
|
|
12 |
|
13 |
# CKIP Oldhan BERT Base Chinese WS
|
14 |
|
15 |
+
This model provides word segmentation for the ancient Chinese language. Our training dataset covers four eras of the Chinese language.
|
16 |
|
17 |
## Homepage
|
18 |
* [ckiplab/han-transformers](https://github.com/ckiplab/han-transformers)
|
19 |
|
20 |
## Training Datasets
|
21 |
The copyright of the datasets belongs to the Institute of Linguistics, Academia Sinica.
|
22 |
+
* [中央研究院上古漢語標記語料庫](http://lingcorpus.iis.sinica.edu.tw/cgi-bin/kiwi/akiwi/kiwi.sh)
|
23 |
+
* [中央研究院中古漢語語料庫](http://lingcorpus.iis.sinica.edu.tw/cgi-bin/kiwi/dkiwi/kiwi.sh)
|
24 |
+
* [中央研究院近代漢語語料庫](http://lingcorpus.iis.sinica.edu.tw/cgi-bin/kiwi/pkiwi/kiwi.sh)
|
25 |
+
* [中央研究院現代漢語語料庫](http://asbc.iis.sinica.edu.tw)
|
26 |
|
27 |
## Contributors
|
28 |
* Chin-Tung Lin at [CKIP](https://ckip.iis.sinica.edu.tw/)
|