File size: 794 Bytes
05f4a64 f9ad634 ff4a733 51138da 3e383ca 51138da |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 |
---
license: mit
language:
- ko
pipeline_tag: visual-question-answering
tags:
- beit3
---
# BEiT3 based Korean VQA Model # (https://github.com/SeanJeonghwanLee/KoBEiT3)
## Basic information ##
### Model ###
- Base Model : beit3_large_indomain_patch16_224 (https://github.com/microsoft/unilm/tree/master/beit3)
- best epoch : 8
- learning rate : 2e-5
- fixed seed : 42
- Tokenizer
- korean sentencepiece tokenizer trained on korean wikipedia
### Dataset ###
- KoBEiT3
- aihub 시각정보 기반 질의응답 (https://aihub.or.kr/aihubdata/data/view.do?currMenu=115&topMenu=100&aihubDataSe=realm&dataSetSn=104)
* Only Korean can access to the dataset
- Tokenizer
- kowiki-latest-pages-articles.xml.bz2 (https://dumps.wikimedia.org/kowiki/latest/)
|