diff --git "a/eval_mm_niah/reasoning-image-test.log" "b/eval_mm_niah/reasoning-image-test.log" deleted file mode 100644--- "a/eval_mm_niah/reasoning-image-test.log" +++ /dev/null @@ -1,3029 +0,0 @@ -language_model.model.layers.0 4 -language_model.model.layers.1 4 -language_model.model.layers.2 4 -language_model.model.layers.3 4 -language_model.model.layers.4 4 -language_model.model.layers.5 4 -language_model.model.layers.6 4 -language_model.model.layers.7 4 -language_model.model.layers.8 4 -language_model.model.layers.9 4 -language_model.model.layers.10 4 -language_model.model.layers.11 4 -language_model.model.layers.12 4 -language_model.model.layers.13 4 -language_model.model.layers.14 4 -language_model.model.layers.15 4 -language_model.model.layers.16 4 -language_model.model.layers.17 4 -language_model.model.layers.18 4 -language_model.model.layers.19 4 -language_model.model.layers.20 4 -language_model.model.layers.21 4 -language_model.model.layers.22 4 -language_model.model.layers.23 4 -vision_model.encoder.layers.0 0 -vision_model.encoder.layers.1 0 -vision_model.encoder.layers.2 0 -vision_model.encoder.layers.3 0 -vision_model.encoder.layers.4 0 -vision_model.encoder.layers.5 0 -vision_model.encoder.layers.6 0 -vision_model.encoder.layers.7 0 -vision_model.encoder.layers.8 0 -vision_model.encoder.layers.9 0 -vision_model.encoder.layers.10 0 -vision_model.encoder.layers.11 0 -vision_model.encoder.layers.12 0 -vision_model.encoder.layers.13 0 -vision_model.encoder.layers.14 0 -vision_model.encoder.layers.15 0 -vision_model.encoder.layers.16 0 -vision_model.encoder.layers.17 0 -vision_model.encoder.layers.18 0 -vision_model.encoder.layers.19 0 -vision_model.encoder.layers.20 0 -vision_model.encoder.layers.21 0 -vision_model.encoder.layers.22 0 -vision_model.encoder.layers.23 0 -vision_model.embeddings 0 -mlp1 0 -language_model.model.tok_embeddings 4 -language_model.model.norm 4 -language_model.output 4 -language_model.model.embed_tokens 4 -language_model.lm_head 4 -The argument `trust_remote_code` is to be used with Auto classes. It has no effect here and is ignored. -The argument `trust_remote_code` is to be used with Auto classes. It has no effect here and is ignored. -The argument `trust_remote_code` is to be used with Auto classes. It has no effect here and is ignored. -The argument `trust_remote_code` is to be used with Auto classes. It has no effect here and is ignored. -Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained. -Rank [3] Begin to eval model work_dirs/share_internvl/InternVL2-2B on task reasoning-image-test, devices: {device(type='cuda', index=3), device(type='cuda', index=7)} -Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained. -Rank [0] Begin to eval model work_dirs/share_internvl/InternVL2-2B on task reasoning-image-test, devices: {device(type='cuda', index=0), device(type='cuda', index=4)} -Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained. -Rank [2] Begin to eval model work_dirs/share_internvl/InternVL2-2B on task reasoning-image-test, devices: {device(type='cuda', index=2), device(type='cuda', index=6)} -Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained. -Rank [1] Begin to eval model work_dirs/share_internvl/InternVL2-2B on task reasoning-image-test, devices: {device(type='cuda', index=1), device(type='cuda', index=5)} -Rank 2 len(skip_idx)=0 -Rank 3 len(skip_idx)=0 -Rank 0 len(skip_idx)=0 -Rank 1 len(skip_idx)=0 -[2024-08-03 15:13:16] [Rank 2] totoal_tokens=770, outputs='A' -[2024-08-03 15:13:16] [Rank 1] totoal_tokens=770, outputs='A' -[2024-08-03 15:13:16] [Rank 3] totoal_tokens=770, outputs='A' -[2024-08-03 15:13:16] [Rank 0] totoal_tokens=837, outputs='A' -[2024-08-03 15:13:17] [Rank 1] totoal_tokens=887, outputs='A' -[2024-08-03 15:13:17] [Rank 2] totoal_tokens=802, outputs='A' -[2024-08-03 15:13:17] [Rank 3] totoal_tokens=802, outputs='A' -[2024-08-03 15:13:17] [Rank 0] totoal_tokens=837, outputs='A' -[2024-08-03 15:13:17] [Rank 1] totoal_tokens=966, outputs='A' -[2024-08-03 15:13:17] [Rank 2] totoal_tokens=827, outputs='A' -[2024-08-03 15:13:17] [Rank 0] totoal_tokens=837, outputs='A' -[2024-08-03 15:13:17] [Rank 3] totoal_tokens=827, outputs='A' -[2024-08-03 15:13:17] [Rank 1] totoal_tokens=968, outputs='A' -[2024-08-03 15:13:17] [Rank 2] totoal_tokens=946, outputs='A' -[2024-08-03 15:13:17] [Rank 0] totoal_tokens=858, outputs='A' -[2024-08-03 15:13:17] [Rank 3] totoal_tokens=852, outputs='A' -[2024-08-03 15:13:17] [Rank 1] totoal_tokens=975, outputs='A' -[2024-08-03 15:13:17] [Rank 2] totoal_tokens=950, outputs='A' -[2024-08-03 15:13:17] [Rank 0] totoal_tokens=946, outputs='A' -[2024-08-03 15:13:17] [Rank 3] totoal_tokens=981, outputs='A' -[2024-08-03 15:13:18] [Rank 1] totoal_tokens=1003, outputs='A' -[2024-08-03 15:13:18] [Rank 2] totoal_tokens=964, outputs='A' -[2024-08-03 15:13:18] [Rank 0] totoal_tokens=964, outputs='A' -[2024-08-03 15:13:18] [Rank 3] totoal_tokens=981, outputs='A' -[2024-08-03 15:13:18] [Rank 1] totoal_tokens=1050, outputs='A' -[2024-08-03 15:13:18] [Rank 2] totoal_tokens=966, outputs='A' -[2024-08-03 15:13:18] [Rank 0] totoal_tokens=1018, outputs='A' -[2024-08-03 15:13:18] [Rank 3] totoal_tokens=994, outputs='A' -[2024-08-03 15:13:18] [Rank 1] totoal_tokens=1082, outputs='A' -[2024-08-03 15:13:18] [Rank 2] totoal_tokens=966, outputs='A' -[2024-08-03 15:13:18] [Rank 3] totoal_tokens=1025, outputs='A' -[2024-08-03 15:13:18] [Rank 0] totoal_tokens=1082, outputs='A' -[2024-08-03 15:13:18] [Rank 1] totoal_tokens=1107, outputs='A' -[2024-08-03 15:13:18] [Rank 2] totoal_tokens=981, outputs='A' -[2024-08-03 15:13:18] [Rank 3] totoal_tokens=1050, outputs='A' -[2024-08-03 15:13:18] [Rank 0] totoal_tokens=1082, outputs='A' -[2024-08-03 15:13:18] [Rank 1] totoal_tokens=1149, outputs='A' -[2024-08-03 15:13:18] [Rank 2] totoal_tokens=994, outputs='A' -[2024-08-03 15:13:18] [Rank 3] totoal_tokens=1060, outputs='A' -[2024-08-03 15:13:18] [Rank 0] totoal_tokens=1085, outputs='A' - Processing InternVL2-2B_reasoning-image-test.jsonl: 0%| | 0/734 [00:00 work_dirs/share_internvl/InternVL2-2B/eval_mm_niah/reasoning-image-test/InternVL2-2B_reasoning-image-test.jsonl -cat work_dirs/share_internvl/InternVL2-2B/eval_mm_niah/reasoning-image-test/temp_InternVL2-2B_reasoning-image-test/* > work_dirs/share_internvl/InternVL2-2B/eval_mm_niah/reasoning-image-test/InternVL2-2B_reasoning-image-test.jsonl -cat work_dirs/share_internvl/InternVL2-2B/eval_mm_niah/reasoning-image-test/temp_InternVL2-2B_reasoning-image-test/* > work_dirs/share_internvl/InternVL2-2B/eval_mm_niah/reasoning-image-test/InternVL2-2B_reasoning-image-test.jsonl -cat work_dirs/share_internvl/InternVL2-2B/eval_mm_niah/reasoning-image-test/temp_InternVL2-2B_reasoning-image-test/* > work_dirs/share_internvl/InternVL2-2B/eval_mm_niah/reasoning-image-test/InternVL2-2B_reasoning-image-test.jsonl -python eval/mm_niah/calculate_scores.py --outputs-dir work_dirs/share_internvl/InternVL2-2B/eval_mm_niah/reasoning-image-test -python eval/mm_niah/calculate_scores.py --outputs-dir work_dirs/share_internvl/InternVL2-2B/eval_mm_niah/reasoning-image-test -python eval/mm_niah/calculate_scores.py --outputs-dir work_dirs/share_internvl/InternVL2-2B/eval_mm_niah/reasoning-image-test -python eval/mm_niah/calculate_scores.py --outputs-dir work_dirs/share_internvl/InternVL2-2B/eval_mm_niah/reasoning-image-test -[Warning] Since len(res)=1 is not equal to 6, the overall score will be ignored. Please ensure that you correctly organize the directory structure. - -results on test split of InternVL2-2B are save in work_dirs/share_internvl/InternVL2-2B/eval_mm_niah/reasoning-image-test/results/InternVL2-2B/scores_test.json -[Warning] Since len(res)=1 is not equal to 6, the overall score will be ignored. Please ensure that you correctly organize the directory structure. - -results on test split of InternVL2-2B are save in work_dirs/share_internvl/InternVL2-2B/eval_mm_niah/reasoning-image-test/results/InternVL2-2B/scores_test.json -[Warning] Since len(res)=1 is not equal to 6, the overall score will be ignored. Please ensure that you correctly organize the directory structure. - -results on test split of InternVL2-2B are save in work_dirs/share_internvl/InternVL2-2B/eval_mm_niah/reasoning-image-test/results/InternVL2-2B/scores_test.json -[Warning] Since len(res)=1 is not equal to 6, the overall score will be ignored. Please ensure that you correctly organize the directory structure. - -results on test split of InternVL2-2B are save in work_dirs/share_internvl/InternVL2-2B/eval_mm_niah/reasoning-image-test/results/InternVL2-2B/scores_test.json