Spaces:

rag-datasets
/

README

Running

README / README.md

Update README.md

f0ccb80 over 1 year ago

914 Bytes

	---
	title: README
	emoji: 🐨
	colorFrom: purple
	colorTo: indigo
	sdk: static
	pinned: false
	---

	To test your RAG solution it would be powerful to have access to a dataset that consists of a text corpus,
	correct responses to queries (e.g. question-answer) to test the solution end-to-end and maybe even a set of relevant passages
	from the text corpus for each query to test the retrieval component separately as well.
	We call this a question-answer-passages dataset.

	There are plenty of large-scale datasets of this kind such as [Google's Natural Questions](https://ai.google.com/research/NaturalQuestions/).

	Still we lack such datasets that are small-scale and narrow-domain to just test our RAG solution quickly or to see how it performs
	in a certain domain context.

	We created this space to create a collections of such datasets to boost the developement of RAG solutions.

	Datasets consist of:
	* asdf