doc2query / README.md
Sean MacAvaney
update
956dd16
|
raw
history blame
1.64 kB
---
title: PyTerrier Doc2Query
emoji: πŸ•
colorFrom: green
colorTo: green
sdk: gradio
sdk_version: 3.7
app_file: app.py
pinned: false
models:
- macavaney/doc2query-t5-base-msmarco
---
<style>
.transformer {
display: inline-block;
background: #8facdb;
position: relative;
height: 60px;
line-height: 60px;
padding: 0 24px;
margin: 0 18px;
color: #333;
}
.transformer::before {
content: "";
position: absolute;
bottom: 0;
top: 0;
left: -15px;
border-top: 30px solid #8facdb;
border-bottom: 30px solid #8facdb;
border-left: 15px solid transparent;
}
.transformer::after {
content: "";
position: absolute;
bottom: 0;
top: 0;
right: -15px;
border-top: 30px solid transparent;
border-bottom: 30px solid transparent;
border-left: 15px solid #8facdb;
}
.df {
width: 24px;
line-height: 24px;
text-align: center;
border: 3px double #888;
background-color: #eee;
color: #333;
border-radius: 4px;
display: inline-block;
box-sizing: content-box;
}
.pipeline {
text-align: center;
}
</style>
This is a demonstration of [PyTerrier's Doc2Query package](https://github.com/terrierteam/pyterrier_doc2query). Doc2Query generates
queries for a document, which can then be appended to a document's text before indexing to boost important terms and add missing terms.
Doc2Query functions as a `D→D` (document-to-document) transformer and can be used in pipelines accordingly.
<div class="pipeline">
<div class="df" title="Document Frame">D</div>
<div class="transformer" title="Doc2Query transformer">Doc2Query</div>
<div class="df" title="Document Frame">D</div>
</div>
Try it below!