research / documents /docs /2-总结功能.md
haoqi7's picture
Upload 47 files
16188ba
# 2 Research Trends Summarization
## Model Architecture
![](https://i.imgur.com/Lv8um1V.png)
### 1 Baseline Configuration
1. pre-trained language model: `sentence-transformers/all-MiniLM-L6-v2`
2. dimension reduction: `None`
3. clustering algorithms: `kmeans`
4. keywords extraction model: `keyphrase-transformer`
[[example run](https://github.com/Mondkuchen/idp_LiteratureResearch_Tool/blob/main/example_run.py)] [[results](https://github.com/Mondkuchen/idp_LiteratureResearch_Tool/blob/main/examples/IDP.ipynb)]
### TODO:
1. clustering: using other clustering algorithms such as Gausian Mixture Model (GMM)
2. keywords extraction model: train another model
3. add dimension reduction
4. better PLM: sentence-transformers/sentence-t5-xxl