Token Classification
spaCy
Tagalog
Eval Results
ljvmiranda921 commited on
Commit
a968673
·
verified ·
1 Parent(s): 6c2fd6d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -1
README.md CHANGED
@@ -57,8 +57,22 @@ model-index:
57
  - name: Sentences F-Score
58
  type: f_score
59
  value: 0.9968010237
 
 
 
 
60
  ---
61
- Parsers for UD-NewsCrawl
 
 
 
 
 
 
 
 
 
 
62
 
63
  | Feature | Description |
64
  | --- | --- |
 
57
  - name: Sentences F-Score
58
  type: f_score
59
  value: 0.9968010237
60
+ datasets:
61
+ - UD-Filipino/UD_Tagalog-NewsCrawl
62
+ pipeline_tag: token-classification
63
+ library_name: spacy
64
  ---
65
+
66
+ <img src="https://cdn-avatars.huggingface.co/v1/production/uploads/634e20a0c1ce28f1de920cc4/k7SJny1M3lDa5CH_T1bp3.png" width="130" height="130" align="right" />
67
+
68
+ # UD Parser (Multilingual context-sensitive vectors + transition-based parser)
69
+
70
+ This is the spaCy pipeline trained on [UD-NewsCrawl](https://huggingface.co/datasets/UD-Filipino/UD_Tagalog-NewsCrawl).
71
+ It uses [fastText word-embeddings for Tagalog](https://huggingface.co/facebook/fasttext-tl-vectors).
72
+ It is trained using a transition-based parser based on [Honnibal and Johnson (2015)](https://aclanthology.org/D15-1162/) and can perform dependency parsing, lemmatization, and morphological annotation.
73
+ The trainable lemmatizer is based on [Muller et al. (2015)](https://aclanthology.org/D15-1272/).
74
+ More information can be found [in this blog post](https://explosion.ai/blog/edit-tree-lemmatizer).
75
+
76
 
77
  | Feature | Description |
78
  | --- | --- |