YuukiAsuna commited on
Commit
deea30f
·
verified ·
1 Parent(s): 2fda4c0

Add report link and benchmarks

Browse files
Files changed (1) hide show
  1. README.md +19 -4
README.md CHANGED
@@ -9,7 +9,12 @@ base_model:
9
  pipeline_tag: document-question-answering
10
  library_name: transformers
11
  ---
12
- # Model Card for Model ID
 
 
 
 
 
13
 
14
  <!-- Provide a quick summary of what the model is/does. -->
15
  Vintern-1B-v2-ViTable-docvqa is a fine-tuned version of the 5CD-AI/Vintern-1B-v2 multimodal model for the Vietnamese DocVQA (Table data)
@@ -17,11 +22,21 @@ Vintern-1B-v2-ViTable-docvqa is a fine-tuned version of the 5CD-AI/Vintern-1B-v2
17
 
18
  ## Benchmarks
19
 
20
- To be developed later
 
 
 
 
 
 
 
 
 
 
21
 
22
- ## Quickstart
23
 
24
- To be developed later
25
 
26
  **Citation:**
27
 
 
9
  pipeline_tag: document-question-answering
10
  library_name: transformers
11
  ---
12
+ # Vintern-1B-v2-ViTable-docvqa
13
+
14
+ <p align="center">
15
+ <a href="https://drive.google.com/file/d/1MU8bgsAwaWWcTl9GN1gXJcSPUSQoyWXy/view?usp=sharing"><b>Report Link</b>👁️</a>
16
+ </p>
17
+
18
 
19
  <!-- Provide a quick summary of what the model is/does. -->
20
  Vintern-1B-v2-ViTable-docvqa is a fine-tuned version of the 5CD-AI/Vintern-1B-v2 multimodal model for the Vietnamese DocVQA (Table data)
 
22
 
23
  ## Benchmarks
24
 
25
+ <div align="center">
26
+
27
+ | Model | ANLS | Semantic Similarity | MLLM-as-judge (Gemini) |
28
+ |-----------------------------|------------------------|------------------------|------------------------|
29
+ | Gemini 1.5 Flash | 0.35 | 0.56 | 0.40 |
30
+ | Vintern-1B-v2 | 0.04 | 0.45 | 0.50 |
31
+ | Vintern-1B-v2-ViTable-docvq | **0.50** | **0.71** | **0.59** |
32
+
33
+ </div>
34
+
35
+ <!-- Code benchmark: to be written later -->
36
 
37
+ <!-- To be written later ## Usage
38
 
39
+ You can use this notebook <a href="https://colab.research.google.com/"> <img src="https://colab.research.google.com/img/colab_favicon_256px.png" width="30"></a> -->
40
 
41
  **Citation:**
42