jmvcoelho
/

t5-base-msmarco-squad-query-generation-firstp-v2

Text2Text Generation

text-generation-inference

Model card Files Files and versions Community

t5-base-msmarco-squad-query-generation-firstp-v2 / README.md

jmvcoelho's picture

Update README.md

de79d5e about 2 years ago

|

history blame contribute delete

817 Bytes

	---
	license: wtfpl
	datasets:
	- ms_marco
	- squad
	language:
	- en
	---
	# Model

	t5-base-msmarco-squad-query-generation-firstp-v2

	Task: query generation
	Architecture: T5

	Base model: t5-base

	Note: This is supposed to be a baseline model.


	## Prompt:

	"Generate Query: {document}. Query:"

	## Sequence length:

	512 tokens

	## Training details

	### Hyperparameters

	Batch size: 8;
	Gradient acc: 8;
	LR: 3e-4, linear scheduler, 400 warmup steps.


	### Data

	Total: 252059 pairs (document, query)

	From MARCO-V2: 165238
	From SQuAD: 86821

	The remaining queries from MARCO-V2 train split were not used.

	## Evaluation

	This model is supposed to be used for data augmentation.
	Hence, meaningful evaluation will come from downstream tasks.

	MARCO-V2 Dev1:
	BLEU: 0.105
	ROUGE: 0.449

	MARCO-V2 Dev2:
	BLEU: 0.171
	ROUGE: 0.503