Spaces:

sabazo
/

insurance_advisor_wb

Running

App Files Files Community

isayahc commited on Jul 12, 2024

Commit

e3a5dff

unverified ·

2 Parent(s): aab5640 c285f2f

initaialized chain and merged from main

Browse files

Files changed (10) hide show

.devcontainer/Dockerfile +12 -3
.devcontainer/devcontainer.json +1 -1
.github/ISSUE_TEMPLATE/issue_template.md +34 -0
.github/ISSUE_TEMPLATE/pullrequest_template.md +28 -0
.github/workflows/check_file_size.yaml +16 -0
.github/workflows/sync_2_hf.yaml +20 -0
README.md +12 -0
app_gui.py +73 -9
rag_app/chains/__init__.py +2 -1
rag_app/hybrid_search.py +63 -0

.devcontainer/Dockerfile CHANGED Viewed

@@ -44,6 +44,15 @@ RUN echo "done 0" \
     && pyenv global ${PYTHON_VERSION} \
     && echo "done 3" \
     && curl -sSL https://install.python-poetry.org | python3 - \
-    && poetry config virtualenvs.in-project true \
-    && echo "done 4" \
-    && pip install -r requirements.txt

     && pyenv global ${PYTHON_VERSION} \
     && echo "done 3" \
     && curl -sSL https://install.python-poetry.org | python3 - \
+    && poetry config virtualenvs.in-project true
+COPY requirements.txt /tmp/
+RUN DEBIAN_FRONTEND=noninteractive \
+    && pip install --requirements /tmp/requirements.txt
+COPY . /tmp/
+RUN DEBIAN_FRONTEND=noninteractive \
+    && python -m pip install --upgrade pip
+ARG USERNAME
+COPY C:/Users/${USERNAME}/.ssh/id_ed25519 ${HOME}/.ssh/

.devcontainer/devcontainer.json CHANGED Viewed

@@ -16,7 +16,7 @@
 	// 👇 Configure tool-specific properties.
 	"customizations": {
         "vscode": {
-            "extensions":["ms-python.python", "njpwerner.autodocstring","ms-azuretools.vscode-docker", "qwtel.sqlite-viewer"]
             }
         }

 	// 👇 Configure tool-specific properties.
 	"customizations": {
         "vscode": {
+            "extensions":["ms-python.python","njpwerner.autodocstring","ms-azuretools.vscode-docker","qwtel.sqlite-viewer"]
             }
         }

.github/ISSUE_TEMPLATE/issue_template.md ADDED Viewed

	@@ -0,0 +1,34 @@

+# Issue Template for LLM + RAG Applications
+## Description
+Please provide a clear and concise description of the issue. Include what you expected to happen versus what actually happened. If the issue is related to retrieval results or generative outputs, specify the discrepancies.
+## Steps to Reproduce
+1. Detail the exact steps to reproduce the problem. Include any specific inputs given to the system.
+2.
+3.
+(Continue as needed)
+## Expected vs. Actual Results
+- **Expected Results**: Describe what you expected to happen.
+- **Actual Results**: Detail what actually happened, including any unexpected outputs or errors.
+## Environment
+- Application version:
+- LLM model version:
+- RAG component (if applicable):
+- OS:
+- Additional tools/libraries:
+## Query or Input Details
+- **Input Query/Text**: Provide the exact text or input given to the system.
+- **Retrieval Source(s)**: Specify the datasets or sources queried by the RAG component, if relevant.
+## Logs and Error Messages
+Please include any relevant logs, error messages, or stack traces that could help diagnose the issue.
+## Screenshots
+If applicable, add screenshots to help explain the issue, especially if it involves UI elements.
+## Additional Context
+Add any other context about the problem here, such as specific configurations of the LLM or RAG components that might be relevant.

.github/ISSUE_TEMPLATE/pullrequest_template.md ADDED Viewed

	@@ -0,0 +1,28 @@

+# Pull Request Template
+## Description
+Please include a brief description of the changes introduced by this PR.
+## Related Issue(s)
+- If this PR addresses a particular issue, please reference it here using GitHub's linking syntax, e.g., "Fixes #123".
+- If there's no related issue, briefly explain the motivation behind these changes.
+## Changes Made
+Please provide a list of the changes made in this PR.
+## Screenshots (if applicable)
+If the changes include UI updates or visual changes, please attach relevant screenshots here.
+## Checklist
+- [ ] I have tested my changes locally and ensured that they work as expected.
+- [ ] I have updated the documentation (if applicable).
+- [ ] My code follows the project's coding conventions and style guidelines.
+- [ ] I have added appropriate test cases (if applicable).
+- [ ] I have reviewed my own code to ensure its quality.
+## Additional Notes
+Add any additional notes or context about this PR here.
+## Reviewer(s)
+- @reviewer1
+- @reviewer2

.github/workflows/check_file_size.yaml ADDED Viewed

	@@ -0,0 +1,16 @@

+name: Check file size
+on:               # or directly `on: [push]` to run the action on every push on any branch
+  pull_request:
+    branches: [main]
+  # to run this workflow manually from the Actions tab
+  workflow_dispatch:
+jobs:
+  sync-to-hub:
+    runs-on: ubuntu-latest
+    steps:
+      - name: Check large files
+        uses: ActionsDesk/[email protected]
+        with:
+          filesizelimit: 10485760 # this is 10MB so we can sync to HF Spaces

.github/workflows/sync_2_hf.yaml ADDED Viewed

	@@ -0,0 +1,20 @@

+name: Sync to Hugging Face hub
+on:
+  push:
+    branches: [main]
+  # to run this workflow manually from the Actions tab
+  workflow_dispatch:
+jobs:
+  sync-to-hub:
+    runs-on: ubuntu-latest
+    steps:
+      - uses: actions/checkout@v3
+        with:
+          fetch-depth: 0
+          lfs: true
+      - name: Push to hub
+        env:
+          HF_TOKEN: ${{ secrets.HF_TOKEN }}
+        run: git push https://sabazo:[email protected]/spaces/sabazo/insurance-advisor-agents main

README.md CHANGED Viewed

@@ -1,3 +1,15 @@
 # Insurance Advisor Agent(s)
 Setup a modular, multi-agent system to handle inqueries to an insurance company. The system utilizes different approachs to find reliable answers regarding the insurance products

+---
+title: Insurance Advisor Agents PoC
+emoji: 🤖
+colorFrom: red
+colorTo: indigo
+sdk: docker
+python: 3.11
+app_file: app_gui.py
+pinned: false
+---
 # Insurance Advisor Agent(s)
 Setup a modular, multi-agent system to handle inqueries to an insurance company. The system utilizes different approachs to find reliable answers regarding the insurance products

app_gui.py CHANGED Viewed

@@ -1,6 +1,7 @@
 # Import Gradio for UI, along with other necessary libraries
 import gradio as gr
 from fastapi import FastAPI
 from rag_app.agents.react_agent import agent_executor, llm
 from rag_app.chains import user_response_sentiment_prompt
 from typing import Dict
@@ -10,27 +11,36 @@ from rag_app.loading_data.load_S3_vector_stores import get_chroma_vs
 from rag_app.agents.react_agent import agent_executor
 # need to import the qa!
 app = FastAPI()
 get_chroma_vs()
-user_sentiment_chain = user_response_sentiment_prompt | llm
-# data = user_sentiment_chain.invoke({"user_reponse":"thanks for the help"})
-data = user_sentiment_chain.invoke({"user_reponse":"OMG I AM SO LOST!!! HELP!!!"})
-responses = extract_responses(data)
-if responses['AI'] == "1":
-    print("GG")
 if __name__ == "__main__":
     # Function to add a new input to the chat history
     def add_text(history, text):
         # Append the new text to the history with a placeholder for the response
         history = history + [(text, None)]
         return history, ""
     # Function representing the bot's response mechanism
     def bot(history):
         # Obtain the response from the 'infer' function using the latest input
         response = infer(history[-1][0], history)
@@ -38,6 +48,26 @@ if __name__ == "__main__":
         return history
     # Function to infer the response using the RAG model
     def infer(question, history):
         # Use the question and history to query the RAG model
         #result = qa({"query": question, "history": history, "question": question})
@@ -72,7 +102,17 @@ if __name__ == "__main__":
     css = """
     #col-container {max-width: 700px; margin-left: auto; margin-right: auto;}
     """
     # HTML content for the Gradio interface title
     title = """
     <div style="text-align:left;">
@@ -80,6 +120,17 @@ if __name__ == "__main__":
     </div>
     """
     # Building the Gradio interface
     with gr.Blocks(theme=gr.themes.Soft()) as demo:
         with gr.Column(elem_id="col-container"):
@@ -95,6 +146,9 @@ if __name__ == "__main__":
             # Create a row for the question input
             with gr.Row():
                 question = gr.Textbox(label="Question", placeholder="Type your question and hit Enter ")
         # Define the action when the question is submitted
         question.submit(add_text, [chatbot, question], [chatbot, question], queue=False).then(
@@ -102,6 +156,16 @@ if __name__ == "__main__":
         )
         # Define the action for the clear button
         clear.click(lambda: None, None, chatbot, queue=False)
     # Launch the Gradio demo interface
     demo.queue().launch(share=False, debug=True)

 # Import Gradio for UI, along with other necessary libraries
 import gradio as gr
 from fastapi import FastAPI
+from fastapi import FastAPI
 from rag_app.agents.react_agent import agent_executor, llm
 from rag_app.chains import user_response_sentiment_prompt
 from typing import Dict
 from rag_app.agents.react_agent import agent_executor
 # need to import the qa!
 app = FastAPI()
 get_chroma_vs()
 if __name__ == "__main__":
     # Function to add a new input to the chat history
+    def add_text(history, text):
+        # Append the new text to the history with a placeholder for the response
+        history = history + [(text, None)]
+        return history, ""
+    # Function to add a new input to the chat history
     def add_text(history, text):
         # Append the new text to the history with a placeholder for the response
         history = history + [(text, None)]
         return history, ""
     # Function representing the bot's response mechanism
+    def bot(history):
+        # Obtain the response from the 'infer' function using the latest input
+        response = infer(history[-1][0], history)
+        #sources = [doc.metadata.get("source") for doc in response['source_documents']]
+        #src_list = '\n'.join(sources)
+        #print_this = response['result'] + "\n\n\n Sources: \n\n\n" + src_list
+        #history[-1][1] = print_this #response['answer']
+        # Update the history with the bot's response
+        history[-1][1] = response['output']
+        return history
+    # Function representing the bot's response mechanism
     def bot(history):
         # Obtain the response from the 'infer' function using the latest input
         response = infer(history[-1][0], history)
         return history
     # Function to infer the response using the RAG model
+    def infer(question, history):
+        # Use the question and history to query the RAG model
+        #result = qa({"query": question, "history": history, "question": question})
+        try:
+            result = agent_executor.invoke(
+                {
+                    "input": question,
+                    "chat_history": history
+                }
+            )
+            return result
+        except Exception:
+            raise gr.Error("Model is Overloaded, Please retry later!")
+    def vote(data: gr.LikeData):
+        if data.liked:
+            print("You upvoted this response: " + data.value)
+        else:
+            print("You downvoted this response: " + data.value)
+    # Function to infer the response using the RAG model
     def infer(question, history):
         # Use the question and history to query the RAG model
         #result = qa({"query": question, "history": history, "question": question})
     css = """
     #col-container {max-width: 700px; margin-left: auto; margin-right: auto;}
     """
+    # CSS styling for the Gradio interface
+    css = """
+    #col-container {max-width: 700px; margin-left: auto; margin-right: auto;}
+    """
+    # HTML content for the Gradio interface title
+    title = """
+    <div style="text-align:left;">
+        <p>Hello, I BotTina 2.0, your intelligent AI assistant. I can help you explore Wuerttembergische Versicherungs products.<br />
+    </div>
+    """
     # HTML content for the Gradio interface title
     title = """
     <div style="text-align:left;">
     </div>
     """
+    # Building the Gradio interface
+    with gr.Blocks(theme=gr.themes.Soft()) as demo:
+        with gr.Column(elem_id="col-container"):
+            gr.HTML(title)  # Add the HTML title to the interface
+            chatbot = gr.Chatbot([], elem_id="chatbot",
+                                        label="BotTina 2.0",
+                                        bubble_full_width=False,
+                                        avatar_images=(None, "https://dacodi-production.s3.amazonaws.com/store/87bc00b6727589462954f2e3ff6f531c.png"),
+                                        height=680,)  # Initialize the chatbot component
+            chatbot.like(vote, None, None)
+            clear = gr.Button("Clear")  # Add a button to clear the chat
     # Building the Gradio interface
     with gr.Blocks(theme=gr.themes.Soft()) as demo:
         with gr.Column(elem_id="col-container"):
             # Create a row for the question input
             with gr.Row():
                 question = gr.Textbox(label="Question", placeholder="Type your question and hit Enter ")
+            # Create a row for the question input
+            with gr.Row():
+                question = gr.Textbox(label="Question", placeholder="Type your question and hit Enter ")
         # Define the action when the question is submitted
         question.submit(add_text, [chatbot, question], [chatbot, question], queue=False).then(
         )
         # Define the action for the clear button
         clear.click(lambda: None, None, chatbot, queue=False)
+        # Define the action when the question is submitted
+        question.submit(add_text, [chatbot, question], [chatbot, question], queue=False).then(
+            bot, chatbot, chatbot
+        )
+        # Define the action for the clear button
+        clear.click(lambda: None, None, chatbot, queue=False)
+    # Launch the Gradio demo interface
+    demo.queue().launch(share=False, debug=True)
+    app = gr.mount_gradio_app(app, demo, path="/")
     # Launch the Gradio demo interface
     demo.queue().launch(share=False, debug=True)

rag_app/chains/__init__.py CHANGED Viewed

	@@ -1 +1,2 @@
1	- # from rag_app.chains.s


1	+ # from rag_app.chains.s
2	+ from rag_app.chains.user_response_sentiment_chain import user_response_sentiment_prompt

rag_app/hybrid_search.py ADDED Viewed

	@@ -0,0 +1,63 @@

+from pathlib import Path
+from langchain_community.vectorstores import FAISS
+from dotenv import load_dotenv
+import os
+from langchain_community.embeddings import HuggingFaceInferenceAPIEmbeddings
+from langchain.retrievers import EnsembleRetriever
+from langchain_community.retrievers import BM25Retriever
+def get_hybrid_search_results(query:str,
+                              path_to_db:str,
+                              embedding_model:str,
+                              hf_api_key:str,
+                              num_docs:int=5) -> list:
+    """ Uses an ensemble retriever of BM25 and FAISS to return k num documents
+        Args:
+            query (str): The search query
+            path_to_db (str): Path to the vectorstore database
+            embedding_model (str): Embedding model used in the vector store
+            num_docs (int): Number of documents to return
+        Returns
+            List of documents
+    """
+    embeddings = HuggingFaceInferenceAPIEmbeddings(api_key=hf_api_key,
+                                                   model_name=embedding_model)
+    # Load the vectorstore database
+    db = FAISS.load_local(folder_path=path_to_db,
+                          embeddings=embeddings,
+                          allow_dangerous_deserialization=True)
+    all_docs = db.similarity_search("", k=db.index.ntotal)
+    bm25_retriever = BM25Retriever.from_documents(all_docs)
+    bm25_retriever.k = num_docs  # How many results you want
+    faiss_retriever = db.as_retriever(search_kwargs={'k': num_docs})
+    ensemble_retriever = EnsembleRetriever(retrievers=[bm25_retriever, faiss_retriever],
+                                           weights=[0.5,0.5])
+    results = ensemble_retriever.invoke(input=query)
+    return results
+if __name__ == "__main__":
+    query = "Haustierversicherung"
+    HUGGINGFACEHUB_API_TOKEN = os.getenv('HUGGINGFACEHUB_API_TOKEN')
+    EMBEDDING_MODEL = os.getenv("EMBEDDING_MODEL")
+    path_to_vector_db = Path("..")/'vectorstore/faiss-insurance-agent-500'
+    results = get_hybrid_search_results(query=query,
+                                    path_to_db=path_to_vector_db,
+                                    embedding_model=EMBEDDING_MODEL,
+                                    hf_api_key=HUGGINGFACEHUB_API_TOKEN)
+    for doc in results:
+        print(doc)
+        print()