Spaces:
Build error
Build error
title: ColPali Visual Retrieval | |
emoji: π | |
colorFrom: green | |
colorTo: blue | |
sdk: docker | |
sdk_version: "3.11" | |
app_file: app.py | |
pinned: false | |
# ColPali Visual Retrieval with Vespa | |
A powerful visual document retrieval system that combines **ColPali** (Contextual Late Interaction with Patch-level Information) with **Vespa** for scalable, intelligent document search and question-answering. | |
## π Features | |
- **Visual Document Search**: Search through PDF documents using natural language queries | |
- **Token-level Similarity Maps**: Visualize exactly which parts of documents match your query | |
- **AI-Powered Chat**: Ask questions about retrieved documents using Google Gemini | |
- **Multiple Ranking Methods**: Choose between ColPali, BM25, or Hybrid ranking | |
## π Try It Out | |
1. Enter a natural language query in the search box | |
2. Select your preferred ranking method | |
3. Click on token buttons to see visual attention maps | |
4. Ask follow-up questions in the chat interface | |
## π Sample Queries | |
- "Pie chart with model comparison" | |
- "Speaker diarization evaluation" | |
- "Results table from dense retrieval" | |
- "Graph showing training loss" | |
- "Architecture diagram with transformer" | |
## π οΈ Technology Stack | |
- **ColPali**: Visual-language model for document understanding | |
- **Vespa**: Distributed search engine for scalability | |
- **FastHTML**: Modern web framework for the UI | |
- **Google Gemini**: AI-powered question answering | |
## π About the Dataset | |
This demo uses ~400 pages from AI-related research papers published in 2024. The documents are processed using ColPali to create visual embeddings that enable semantic search across document images. | |
## π Links | |
- [ColPali Paper](https://arxiv.org/abs/2404.09317) | |
- [Vespa Documentation](https://docs.vespa.ai/) | |
- [Blog Post](https://blog.vespa.ai/visual-retrieval-with-colpali-and-vespa/) | |
- [GitHub Repository](https://github.com/vespa-engine/vespa/tree/master/examples/colpali-visual-retrieval) |