runninglsy commited on
Commit
0690a60
·
verified ·
1 Parent(s): ce2fe01

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +25 -13
README.md CHANGED
@@ -47,19 +47,31 @@ We are pleased to announce the release of **Ovis2**, our latest advancement in m
47
  | Ovis2-34B | aimv2-1B-patch14-448 | Qwen2.5-32B-Instruct | [Huggingface](https://huggingface.co/AIDC-AI/Ovis2-34B) | - |
48
 
49
  ## Performance
50
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/637aebed7ce76c3b834cea37/aCuSemmHy_MhrDaBiYfco.png)
51
-
52
- |Benchmark|Ovis2-1B|Ovis2-2B|Ovis2-4B|Ovis2-8B|Ovis2-16B|Ovis2-34B|
53
- |:---:|:---:|:---:|:---:|:---:|:---:|:---:|
54
- |MMBench-V1.1<sub>test</sub>|68.5|77.2|81.4|83.3|85.2|86.2|
55
- |MMStar|52.0|59.0|61.7|64.4|66.9|69.4|
56
- |MMMU<sub>val</sub>|36.0|45.3|48.0|59.0|59.6|65.6|
57
- |MathVista<sub>testmini</sub>|59.5|64.4|69.1|71.4|74.9|77.0|
58
- |HallBench<sub>avg</sub>|44.5|50.2|54.0|56.0|55.9|58.8|
59
- |AI2D<sub>test</sub>|76.8|82.6|85.5|86.8|86.1|88.4|
60
- |OCRBench|88.7|87.5|91.0|89.3|88.2|89.8|
61
- |MMVet|50.3|58.6|65.5|68.5|68.4|75.5|
62
- |Average|59.5|65.6|69.5|72.3|73.1|76.3|
 
 
 
 
 
 
 
 
 
 
 
 
63
 
64
  ## Usage
65
  Below is a code snippet demonstrating how to run Ovis with various input types. For additional usage instructions, including inference wrapper and Gradio UI, please refer to [Ovis GitHub](https://github.com/AIDC-AI/Ovis?tab=readme-ov-file#inference).
 
47
  | Ovis2-34B | aimv2-1B-patch14-448 | Qwen2.5-32B-Instruct | [Huggingface](https://huggingface.co/AIDC-AI/Ovis2-34B) | - |
48
 
49
  ## Performance
50
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/658a8a837959448ef5500ce5/M1XRFbeNbfe1lEvt9WF-j.png)
51
+
52
+ ### Image Benchmark
53
+ | Benchmark | Qwen2.5-VL-72B | InternVL2.5-38B-MPO | InternVL2.5-26B-MPO | Ovis1.6-27B | LLaVA-OV-72B | Ovis2-16B | Ovis2-34B |
54
+ |:-----------------------------|:----------------:|:---------------------:|:---------------------:|:-------------:|:---------------------:|:-----------:|:-----------:|
55
+ | MMBench-V1.1<sub>test</sub> | **87.8** | 85.4 | 84.2 | 82.2 | 84.4 | 85.6 | 86.6 |
56
+ | MMStar | **71.1** | 70.1 | 67.7 | 63.5 | 65.8 | 67.2 | 69.2 |
57
+ | MMMU<sub>val</sub> | **67.9** | 63.8 | 56.4 | 60.3 | 56.6 | 60.7 | 66.7 |
58
+ | MathVista<sub>testmini</sub> | 70.8 | 73.6 | 71.5 | 70.2 | 68.4 | 73.7 | **76.1** |
59
+ | HallusionBench | 58.8 | **59.7** | 52.4 | 54.1 | 47.9 | 56.8 | 58.8 |
60
+ | AI2D | 88.2 | 87.9 | 86.2 | 86.6 | 86.2 | 86.3 | **88.3** |
61
+ | OCRBench | 88.1 | 89.4 | **90.5** | 85.6 | 74.1 | 87.9 | 89.4 |
62
+ | MMVet | 76.7 | 72.6 | 68.1 | 68 | 60.6 | 68.4 | **77.1** |
63
+ | MMBench<sub>test</sub> | **88.2** | 86.4 | 85.4 | 84.6 | 85.6 | 87.1 | 87.8 |
64
+ | MMT-Bench<sub>val</sub> | 69.1 | 69.1 | 65.7 | 68.2 | - | 69.2 | **71.2** |
65
+ | RealWorldQA | **75.9** | 74.4 | 73.7 | 72.7 | 73.9 | 74.1 | 75.6 |
66
+ | BLINK | 62.3 | **63.2** | 62.6 | 48 | - | 59.0 | 60.1 |
67
+ | QBench | - | 76.1 | 76.0 | 77.7 | - | 79.5 | **79.8** |
68
+ | ABench | - | 78.6 | **79.4** | 76.5 | - | **79.4** | 78.7 |
69
+ | MTVQA | - | **31.2** | 28.7 | 26.5 | - | 30.3 | 30.6 |
70
+
71
+ ### Video Benchmark
72
+
73
+ ### Reasoning Benchmark
74
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/658a8a837959448ef5500ce5/aGc1DZLPEeERd0jGW9JRA.png)
75
 
76
  ## Usage
77
  Below is a code snippet demonstrating how to run Ovis with various input types. For additional usage instructions, including inference wrapper and Gradio UI, please refer to [Ovis GitHub](https://github.com/AIDC-AI/Ovis?tab=readme-ov-file#inference).