Update README.md
Browse files
README.md
CHANGED
@@ -9,10 +9,10 @@ This is the project for the course Multimedia System in Leiden University 2023 f
|
|
9 |
The original summarization model is from https://towardsdatascience.com/text-summarization-with-gpt2-and-layer-ai-599625085d8e .
|
10 |
|
11 |
## Training Datasets
|
12 |
-
- Amazon review dataset
|
13 |
-
- Twitter crawler dataset
|
14 |
-
- Emotion analysis dataset
|
15 |
-
- Kindle review dataset
|
16 |
|
17 |
## TODO
|
18 |
- Improve training data quality
|
|
|
9 |
The original summarization model is from https://towardsdatascience.com/text-summarization-with-gpt2-and-layer-ai-599625085d8e .
|
10 |
|
11 |
## Training Datasets
|
12 |
+
- Amazon review dataset https://www.kaggle.com/datasets/kritanjalijain/amazon-reviews?select=amazon_review_polarity_csv.tgz
|
13 |
+
- Twitter crawler dataset https://www.kaggle.com/datasets/tripathiharsh/training
|
14 |
+
- Emotion analysis dataset https://huggingface.co/datasets/dair-ai/emotion
|
15 |
+
- Kindle review dataset https://www.kaggle.com/datasets/meetnagadia/amazon-kindle-book-review-for-sentiment-analysis/data
|
16 |
|
17 |
## TODO
|
18 |
- Improve training data quality
|