Commit
Β·
7af3cad
1
Parent(s):
d23943e
Update README.md
Browse files
README.md
CHANGED
@@ -45,6 +45,7 @@ model-index:
|
|
45 |
---
|
46 |
# π Keyphrase Generation model: T5-small-OpenKP
|
47 |
Keyphrase extraction is a technique in text analysis where you extract the important keyphrases from a document. Thanks to these keyphrases humans can understand the content of a text very quickly and easily without reading it completely. Keyphrase extraction was first done primarily by human annotators, who read the text in detail and then wrote down the most important keyphrases. The disadvantage is that if you work with a lot of documents, this process can take a lot of time β³.
|
|
|
48 |
Here is where Artificial Intelligence π€ comes in. Currently, classical machine learning methods, that use statistical and linguistic features, are widely used for the extraction process. Now with deep learning, it is possible to capture the semantic meaning of a text even better than these classical methods. Classical methods look at the frequency, occurrence and order of words in the text, whereas these neural approaches can capture long-term semantic dependencies and context of words in a text.
|
49 |
|
50 |
|
@@ -122,6 +123,7 @@ print(keyphrases)
|
|
122 |
|
123 |
## π Training Dataset
|
124 |
[OpenKP](https://github.com/microsoft/OpenKP) is a large-scale, open-domain keyphrase extraction dataset with 148,124 real-world web documents along with 1-3 most relevant human-annotated keyphrases.
|
|
|
125 |
You can find more information in the [paper](https://arxiv.org/abs/1911.02671).
|
126 |
|
127 |
## π·ββοΈ Training Procedure
|
@@ -212,6 +214,7 @@ def extract_keyphrases(examples):
|
|
212 |
## π Evaluation Results
|
213 |
|
214 |
Traditional evaluation methods are the precision, recall and F1-score @k,m where k is the number that stands for the first k predicted keyphrases and m for the average amount of predicted keyphrases. In keyphrase generation you also look at F1@O where O stands for the number of ground truth keyphrases.
|
|
|
215 |
The model achieves the following results on the OpenKP test set:
|
216 |
|
217 |
|
|
|
45 |
---
|
46 |
# π Keyphrase Generation model: T5-small-OpenKP
|
47 |
Keyphrase extraction is a technique in text analysis where you extract the important keyphrases from a document. Thanks to these keyphrases humans can understand the content of a text very quickly and easily without reading it completely. Keyphrase extraction was first done primarily by human annotators, who read the text in detail and then wrote down the most important keyphrases. The disadvantage is that if you work with a lot of documents, this process can take a lot of time β³.
|
48 |
+
|
49 |
Here is where Artificial Intelligence π€ comes in. Currently, classical machine learning methods, that use statistical and linguistic features, are widely used for the extraction process. Now with deep learning, it is possible to capture the semantic meaning of a text even better than these classical methods. Classical methods look at the frequency, occurrence and order of words in the text, whereas these neural approaches can capture long-term semantic dependencies and context of words in a text.
|
50 |
|
51 |
|
|
|
123 |
|
124 |
## π Training Dataset
|
125 |
[OpenKP](https://github.com/microsoft/OpenKP) is a large-scale, open-domain keyphrase extraction dataset with 148,124 real-world web documents along with 1-3 most relevant human-annotated keyphrases.
|
126 |
+
|
127 |
You can find more information in the [paper](https://arxiv.org/abs/1911.02671).
|
128 |
|
129 |
## π·ββοΈ Training Procedure
|
|
|
214 |
## π Evaluation Results
|
215 |
|
216 |
Traditional evaluation methods are the precision, recall and F1-score @k,m where k is the number that stands for the first k predicted keyphrases and m for the average amount of predicted keyphrases. In keyphrase generation you also look at F1@O where O stands for the number of ground truth keyphrases.
|
217 |
+
|
218 |
The model achieves the following results on the OpenKP test set:
|
219 |
|
220 |
|