TryingHard commited on
Commit
4a82535
·
verified ·
1 Parent(s): a6343cd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +22 -13
README.md CHANGED
@@ -47,19 +47,28 @@ We are pleased to announce the release of **Ovis2**, our latest advancement in m
47
  | Ovis2-34B | aimv2-1B-patch14-448 | Qwen2.5-32B-Instruct | [Huggingface](https://huggingface.co/AIDC-AI/Ovis2-34B) | - |
48
 
49
  ## Performance
50
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/637aebed7ce76c3b834cea37/aCuSemmHy_MhrDaBiYfco.png)
51
-
52
- |Benchmark|Ovis2-1B|Ovis2-2B|Ovis2-4B|Ovis2-8B|Ovis2-16B|Ovis2-34B|
53
- |:---:|:---:|:---:|:---:|:---:|:---:|:---:|
54
- |MMBench-V1.1<sub>test</sub>|68.5|77.2|81.4|83.3|85.2|86.2|
55
- |MMStar|52.0|59.0|61.7|64.4|66.9|69.4|
56
- |MMMU<sub>val</sub>|36.0|45.3|48.0|59.0|59.6|65.6|
57
- |MathVista<sub>testmini</sub>|59.5|64.4|69.1|71.4|74.9|77.0|
58
- |HallBench<sub>avg</sub>|44.5|50.2|54.0|56.0|55.9|58.8|
59
- |AI2D<sub>test</sub>|76.8|82.6|85.5|86.8|86.1|88.4|
60
- |OCRBench|88.7|87.5|91.0|89.3|88.2|89.8|
61
- |MMVet|50.3|58.6|65.5|68.5|68.4|75.5|
62
- |Average|59.5|65.6|69.5|72.3|73.1|76.3|
 
 
 
 
 
 
 
 
 
63
 
64
  ## Usage
65
  Below is a code snippet demonstrating how to run Ovis with various input types. For additional usage instructions, including inference wrapper and Gradio UI, please refer to [Ovis GitHub](https://github.com/AIDC-AI/Ovis?tab=readme-ov-file#inference).
 
47
  | Ovis2-34B | aimv2-1B-patch14-448 | Qwen2.5-32B-Instruct | [Huggingface](https://huggingface.co/AIDC-AI/Ovis2-34B) | - |
48
 
49
  ## Performance
50
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/658a8a837959448ef5500ce5/M1XRFbeNbfe1lEvt9WF-j.png)
51
+
52
+ ### Image Benchmark
53
+ | Benchmark | Qwen2.5-VL-3B | SAIL-VL-2B | InternVL2.5-2B-MPO | Ovis1.6-3B | InternVL2.5-1B-MPO | Ovis2-1B | Ovis2-2B |
54
+ |:-----------------------------|:---------------:|:------------:|:--------------------:|:------------:|:--------------------:|:----------:|:----------:|
55
+ | MMBench-V1.1<sub>test</sub> | **77.1** | 73.6 | 70.7 | 74.1 | 65.8 | 68.4 | 76.9 |
56
+ | MMStar | 56.5 | 56.5 | 54.9 | 52.0 | 49.5 | 52.1 | **56.7** |
57
+ | MMMU<sub>val</sub> | **51.4** | 44.1 | 44.6 | 46.7 | 40.3 | 36.1 | 45.6 |
58
+ | MathVista<sub>testmini</sub> | 60.1 | 62.8 | 53.4 | 58.9 | 47.7 | 59.4 | **64.1** |
59
+ | HallusionBench | 48.7 | 45.9 | 40.7 | 43.8 | 34.8 | 45.2 | **50.2** |
60
+ | AI2D | 81.4 | 77.4 | 75.1 | 77.8 | 68.5 | 76.4 | **82.7** |
61
+ | OCRBench | 83.1 | 83.1 | 83.8 | 80.1 | 84.3 | **89.0** | 87.3 |
62
+ | MMVet | 63.2 | 44.2 | **64.2** | 57.6 | 47.2 | 50.0 | 58.3 |
63
+ | MMBench<sub>test</sub> | 78.6 | 77 | 72.8 | 76.6 | 67.9 | 70.2 | **78.9** |
64
+ | MMT-Bench<sub>val</sub> | 60.8 | 57.1 | 54.4 | 59.2 | 50.8 | 55.5 | **61.7** |
65
+ | RealWorldQA | 66.5 | 62 | 61.3 | **66.7** | 57 | 63.9 | 66.0 |
66
+ | BLINK | **48.4** | 46.4 | 43.8 | 43.8 | 41 | 44.0 | 47.9 |
67
+ | QBench | 74.4 | 72.8 | 69.8 | 75.8 | 63.3 | 71.3 | **76.2** |
68
+ | ABench | 75.5 | 74.5 | 71.1 | 75.2 | 67.5 | 71.3 | **76.6** |
69
+ | MTVQA | 24.9 | 20.2 | 22.6 | 21.1 | 21.7 | 23.7 | **25.6** |
70
+
71
+ ### Video Benchmark
72
 
73
  ## Usage
74
  Below is a code snippet demonstrating how to run Ovis with various input types. For additional usage instructions, including inference wrapper and Gradio UI, please refer to [Ovis GitHub](https://github.com/AIDC-AI/Ovis?tab=readme-ov-file#inference).