blip2_lora_vqa_model

This model is a fine-tuned version of Salesforce/blip2-flan-t5-xl on the arrow dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

learning_rate: 0.002
train_batch_size: 64
eval_batch_size: 64
seed: 42
optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
lr_scheduler_type: linear
num_epochs: 10

Training Loss	Epoch	Step	Validation Loss	Exact	F1	Total	Hasans Exact	Hasans F1	Hasans Total	Best Exact	Best F1
No log	1.0	77	0.1769	57.3044	62.1696	1061	57.3044	62.1696	1061	57.3044	62.1696
1.8287	2.0	154	0.1209	63.8077	68.4910	1061	63.8077	68.4910	1061	63.8077	68.4910
0.1517	3.0	231	0.0959	65.0330	69.5182	1061	65.0330	69.5182	1061	65.0330	69.5182
0.1145	4.0	308	0.0863	67.9548	72.6603	1061	67.9548	72.6603	1061	67.9548	72.6603
0.1145	5.0	385	0.0788	70.8765	74.6248	1061	70.8765	74.6248	1061	70.8765	74.6248
0.0946	6.0	462	0.0697	73.1385	76.8465	1061	73.1385	76.8465	1061	73.1385	76.8465
0.0837	7.0	539	0.0724	72.3845	76.2186	1061	72.3845	76.2186	1061	72.3845	76.2186
0.069	8.0	616	0.0644	74.3638	77.8250	1061	74.3638	77.8250	1061	74.3638	77.8250
0.069	9.0	693	0.0632	73.7041	77.3865	1061	73.7041	77.3865	1061	73.7041	77.3865
0.0652	10.0	770	0.0612	74.8351	78.3950	1061	74.8351	78.3950	1061	74.8351	78.3950