RAGTheDocs / README.md
jerpint's picture
update readme
4ac3999
|
raw
history blame
1.75 kB
metadata
title: RAGTheDocs
emoji: πŸ‘€
colorFrom: gray
colorTo: yellow
sdk: gradio
sdk_version: 3.50.2
app_file: app.py
pinned: false
license: mit

RAGtheDocs

Introduction πŸ“š

RAGTheDocs is an open-source library that allows you to one-click deploy retrieval augmented generation (RAG) on any readthedocs documentation on huggingface πŸ€— spaces!

Usage πŸ‘‰

  1. Go to the example space
  2. Duplicate the space:

image

  1. Set your environment variables:
  • OPENAI_API_KEY (required): Needed for the app to work, e.g. sk-...
  • READTHEDOCS_URL (required): The url of the website you are interested in scraping
  • READTHEDOCS_VERSION (optional): This is important only if there exist multiple versions of the docs (e.g. "en/v0.2.7" or "en/latest"). If left empty, it will scrape all available versions (there can be many for open-source projects!).

Features πŸš€

  • Web Scraping and embeddings: RAGtheDocs automatically scrapes and embeds documentation from any website generated by ReadTheDocs/Sphinx using OpenAI embeddings

  • RAG Interface: It comes built-in with a gradio UI for users to interact with Buster πŸ€– our RAG agent.

  • Customization Options: Tailor RAGtheDocs prompts and settings with customizable settings and options.

Disclaimers ❗

  • This is a quickly hacked together side-project. This code should be considered experimental at best.

  • This library will automatically call OpenAI APIs for you (for embeddings and chatGPT).

  • Use at your own risk! ⚠️