File size: 794 Bytes
05f4a64
 
 
 
f9ad634
ff4a733
 
51138da
 
 
3e383ca
51138da
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
---
license: mit
language:
- ko
pipeline_tag: visual-question-answering
tags:
- beit3
---


# BEiT3 based Korean VQA Model # (https://github.com/SeanJeonghwanLee/KoBEiT3)

## Basic information ##
### Model ###
  - Base Model : beit3_large_indomain_patch16_224 (https://github.com/microsoft/unilm/tree/master/beit3)
    - best epoch : 8
    - learning rate : 2e-5
    - fixed seed : 42

  - Tokenizer
    - korean sentencepiece tokenizer trained on korean wikipedia
    
### Dataset ###
  - KoBEiT3
    - aihub 시각정보 기반 질의응답 (https://aihub.or.kr/aihubdata/data/view.do?currMenu=115&topMenu=100&aihubDataSe=realm&dataSetSn=104)
      * Only Korean can access to the dataset
  - Tokenizer
    - kowiki-latest-pages-articles.xml.bz2 (https://dumps.wikimedia.org/kowiki/latest/)