Spaces:

LamiaYT
/

gaia-llamaindex-agent

Sleeping

App Files Files Community

gaia-llamaindex-agent / README.md

LamiaYT's picture

Fix README YAML and regenerate full content

4a42cc8 about 2 months ago

|

history blame contribute delete

1.83 kB

metadata

title: Gaia Llamaindex Agent
emoji: 🦙
colorFrom: red
colorTo: pink
sdk: docker
app_file: app.py
pinned: false
short_description: Test To Pass GAIA

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

🦙 GAIA Benchmark Agent with LlamaIndex

This Space implements a complete LlamaIndex agent designed to tackle the GAIA (General AI Assistants) benchmark questions.

Features

Local LLM: Runs entirely on Hugging Face Spaces without external API dependencies
LlamaIndex Integration: Uses ReAct agent framework for reasoning and tool use
GAIA API Integration: Fetches questions and submits answers automatically
Tool Suite: Web search, calculation, file reading, and more
User-Friendly Interface: Gradio UI for testing and submission

Architecture

📦 GAIA Agent
├── 🧠 Local LLM (DialoGPT/GPT-2)
├── 🔧 Agent Tools
│   ├── Web Search
│   ├── Calculator
│   ├── File Reader
│   └── GAIA API Client
├── 🤖 ReAct Agent (LlamaIndex)
└── 🖥️ Gradio Interface

Usage

Test Single Questions: Try individual GAIA questions
Full Evaluation: Process all 20 questions from the dataset
Submit to GAIA: Send answers for official scoring

Scoring Target

The goal is to achieve 30% accuracy on GAIA Level 1 questions, which represents a significant milestone in AI assistant capabilities.

Hardware Requirements

CPU: Works on free tier
Memory: ~8GB recommended
GPU: Optional but improves performance

Getting Started

Clone or duplicate this Space
Run the application
Start with single question testing
Process all questions when ready
Submit to GAIA leaderboard

Built with ❤️ for the GAIA benchmark challenge!