README / README.md
tillwenke's picture
Update README.md
f0ccb80
|
raw
history blame
914 Bytes
---
title: README
emoji: 🐨
colorFrom: purple
colorTo: indigo
sdk: static
pinned: false
---
To test your RAG solution it would be powerful to have access to a dataset that consists of a text corpus,
correct responses to queries (e.g. question-answer) to test the solution end-to-end and maybe even a set of relevant passages
from the text corpus for each query to test the retrieval component separately as well.
We call this a question-answer-passages dataset.
There are plenty of large-scale datasets of this kind such as [Google's Natural Questions](https://ai.google.com/research/NaturalQuestions/).
Still we lack such datasets that are **small-scale** and **narrow-domain** to just test our RAG solution quickly or to see how it performs
in a certain domain context.
We created this space to create a collections of such datasets to boost the developement of RAG solutions.
Datasets consist of:
* asdf