|
--- |
|
library_name: transformers |
|
license: gpl-3.0 |
|
datasets: |
|
- phunc20/nj_biergarten_captcha_v2 |
|
base_model: |
|
- microsoft/trocr-base-handwritten |
|
--- |
|
|
|
# Model Card for trocr-base-handwritten_nj_biergarten_captcha_v2 |
|
|
|
This is a model for CAPTCHA OCR. |
|
|
|
|
|
|
|
## Model Details |
|
|
|
### Model Description |
|
|
|
This is a simple model finetuned from `microsoft/trocr-base-handwritten` on a dataset |
|
I created at `phunc20/nj_biergarten_captcha_v2`. |
|
|
|
## Uses |
|
|
|
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. --> |
|
|
|
|
|
### Direct Use |
|
|
|
<!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. --> |
|
|
|
[More Information Needed] |
|
|
|
|
|
## Bias, Risks, and Limitations |
|
|
|
Although the model seems to perform well on the dataset `phunc20/nj_biergarten_captcha_v2`, |
|
it does not exhibit such good performance across all CAPTCHA images. In this respect, this |
|
model is worse than Human. |
|
|
|
### Recommendations |
|
|
|
<!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. --> |
|
|
|
Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations. |
|
|
|
## How to Get Started with the Model |
|
|
|
Use the code below to get started with the model. |
|
|
|
[More Information Needed] |
|
|
|
## Training Details |
|
|
|
### Training Data |
|
|
|
Like I mentioned, I trained this model on `phunc20/nj_biergarten_captcha_v2`. |
|
In particular, I trained on the `train` split and evalaute on `validation` split, |
|
without touching the `test` split. |
|
|
|
### Training Procedure |
|
|
|
Please refer to |
|
<https://gitlab.com/phunc20/captchew/-/blob/main/colab_notebooks/train_from_pretrained_Seq2SeqTrainer_torchDataset.ipynb?ref_type=heads> |
|
which is adapted from |
|
<https://github.com/NielsRogge/Transformers-Tutorials/blob/master/TrOCR/Fine_tune_TrOCR_on_IAM_Handwriting_Database_using_Seq2SeqTrainer.ipynb> |
|
|
|
## Evaluation |
|
|
|
### Testing Data, Factors & Metrics |
|
|
|
#### Testing Data |
|
|
|
1. The `test` split of `phunc20/nj_biergarten_captcha_v2` |
|
2. This Kaggle dataset <https://www.kaggle.com/datasets/fournierp/captcha-version-2-images/data> |
|
(we shall call this dataset by the name of `kaggle_test_set` in this model card.) |
|
|
|
#### Factors |
|
|
|
<!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. --> |
|
|
|
[More Information Needed] |
|
|
|
#### Metrics |
|
|
|
CER, exact match and average length difference. The former two can be found in HuggingFace's |
|
documentation. The last one is just one metric I care a little about. It is quite easy to |
|
understand and, if need be, explanation could be found at the source code: |
|
<https://gitlab.com/phunc20/captchew/-/blob/v0.1/average_length_difference.py> |
|
|
|
### Results |
|
On the `test` split of `phunc20/nj_biergarten_captcha_v2` |
|
|
|
| Model | cer | exact match | avg len diff | |
|
| --------------------------------------------------------- | -------- | ----------- | ------------ | |
|
| `phunc20/trocr-base-handwritten_nj_biergarten_captcha_v2` | 0.001333 | 496/500 | 1/500 | |
|
| `microsoft/trocr-base-handwritten` | 0.9 | 5/500 | 2.4 | |
|
|
|
On `kaggle_test_set` |
|
|
|
| Model | cer | exact match | avg len diff | |
|
| --------------------------------------------------------- | -------- | ----------- | ------------ | |
|
| `phunc20/trocr-base-handwritten_nj_biergarten_captcha_v2` | 0.4381 | 69/1070 | 0.1289 | |
|
| `microsoft/trocr-base-handwritten` | 1.0112 | 17/1070 | 2.4439 | |
|
|
|
|
|
## Environmental Impact |
|
|
|
Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700). |
|
|
|
- **Hardware Type:** [More Information Needed] |
|
- **Hours used:** [More Information Needed] |
|
- **Cloud Provider:** [More Information Needed] |
|
- **Compute Region:** [More Information Needed] |
|
- **Carbon Emitted:** [More Information Needed] |