prometheus-eval
/

prometheus-vision-13b-v1.0

text-generation

Visual Question Answering

Text2Text Generation

Model card Files Files and versions Community

Seongyun commited on Jan 15, 2024

Commit

bcfd22a

·

verified ·

1 Parent(s): 495b03e

Update README.md

Files changed (1) hide show

README.md +38 -1

README.md CHANGED Viewed

@@ -36,4 +36,41 @@ Prometheus-Vision is the first open-source VLM specialized for evaluation purpos
 Prometheu-Vision is trained with two different sizes (7B and 13B).
 You could check the 7B sized LM on [this page](https://huggingface.co/kaist-ai/prometheus-vision-7b-v1.0).
-Also, check out our dataset as well on [this page](https://huggingface.co/datasets/kaist-ai/Perception-Collection).

 Prometheu-Vision is trained with two different sizes (7B and 13B).
 You could check the 7B sized LM on [this page](https://huggingface.co/kaist-ai/prometheus-vision-7b-v1.0).
+Also, check out our dataset as well on [this page](https://huggingface.co/datasets/kaist-ai/Perception-Collection).
+## Prompt Format
+Prometheus-Vision requires 5 components in the input: An image, an instruction, a response to evaluate, a score rubric, and a reference answer. You could refer to the prompt format below.
+You should fill in the instruction, response, reference answer, criteria description, and score description for score in range of 1 to 5.
+```
+###Task Description:
+An instruction (might include an Input inside it), a response to evaluate, a reference answer that gets a score of 5, an image and a score rubric representing an evaluation criterion is given.
+1. Write a detailed feedback that assess the quality of the response strictly based on the given score rubric, not evaluating in general.
+2. After writing a feedback, write a score that is an integer between 1 and 5. You should refer to the score rubric.
+3. The output format should look as follows: \"Feedback: (write a feedback for criteria) [RESULT] (an integer number between 1 and 5)\"
+4. Please do not generate any other opening, closing, and explanations.
+###The instruction to evaluate:
+{instruction}
+###Response to evaluate:
+{response}
+###Reference Answer (Score 5):
+{reference_answer}
+###Score Rubrics:
+[{criteria_description}]
+Score 1: {score1_description}
+Score 2: {score2_description}
+Score 3: {score3_description}
+Score 4: {score4_description}
+Score 5: {score5_description}
+###Feedback:
+```
+Also, we use the following output format. During inference, you could parse the score by splitting the number that is generated next to the [RESULT] phrase.
+```
+{orig_feedback}
+[RESULT] {orig_score}
+```
+## License
+Perception Collection and Prometheus-Vision are subject to OpenAI's Terms of Use for the generated data. If you suspect any violations, please reach out to us.