metadata
license: apache-2.0
datasets:
- kaist-ai/Perception-Collection
- kaist-ai/Perception-Bench
language:
- en
metrics:
- pearsonr
- spearmanr
library_name: transformers
pipeline_tag: image-to-text
tags:
- Image-to-Text
- Visual Question Answering
- Text2Text Generation
Links for Reference
- Homepage:
- Repository: https://github.com/kaistAI/prometheus-vision
- Paper: https://arxiv.org/abs/2401.06591
- Point of Contact: [email protected]
TL;DR
Prometheus-Vision is the first open-source VLM specialized for evaluation purposes. Prometheus-Vision shows a high correlation with both GPT-4V and human evaluators, indicating its potential to be used as a cheap alternative for GPT-4V evaluation.