--- tags: - sentence-transformers - sentence-similarity - feature-extraction - mteb base_model: sbintuitions/modernbert-ja-130m language: - ja - en model-index: - name: retrieva-jp/amber-base results: - dataset: config: en name: MTEB AmazonCounterfactualClassification (en) revision: e8379541af4e31359cca9fbcf4b00f2671dba205 split: test type: mteb/amazon_counterfactual metrics: - type: accuracy value: 68.1642 - type: f1 value: 61.9811 - type: f1_weighted value: 71.2157 - type: ap value: 30.6541 - type: ap_weighted value: 30.6541 - type: main_score value: 68.1642 task: type: Classification - dataset: config: default name: MTEB ArXivHierarchicalClusteringP2P (default) revision: 0bbdb47bcbe3a90093699aefeed338a0f28a7ee8 split: test type: mteb/arxiv-clustering-p2p metrics: - type: v_measure value: 55.655100000000004 - type: v_measure_std value: 3.2918999999999996 - type: main_score value: 55.655100000000004 task: type: Clustering - dataset: config: default name: MTEB ArXivHierarchicalClusteringS2S (default) revision: b73bd54100e5abfa6e3a23dcafb46fe4d2438dc3 split: test type: mteb/arxiv-clustering-s2s metrics: - type: v_measure value: 53.6493 - type: v_measure_std value: 3.2359 - type: main_score value: 53.6493 task: type: Clustering - dataset: config: default name: MTEB ArguAna (default) revision: c22ab2a51041ffd869aaddef7af8d8215647e41a split: test type: mteb/arguana metrics: - type: ndcg_at_1 value: 25.249 - type: ndcg_at_3 value: 38.056 - type: ndcg_at_5 value: 43.124 - type: ndcg_at_10 value: 48.068 - type: ndcg_at_20 value: 51.461 - type: ndcg_at_100 value: 53.15800000000001 - type: ndcg_at_1000 value: 53.38 - type: map_at_1 value: 25.249 - type: map_at_3 value: 34.803 - type: map_at_5 value: 37.598 - type: map_at_10 value: 39.611000000000004 - type: map_at_20 value: 40.569 - type: map_at_100 value: 40.821000000000005 - type: map_at_1000 value: 40.83 - type: recall_at_1 value: 25.249 - type: recall_at_3 value: 47.510999999999996 - type: recall_at_5 value: 59.885999999999996 - type: recall_at_10 value: 75.32 - type: recall_at_20 value: 88.549 - type: recall_at_100 value: 97.44 - type: recall_at_1000 value: 99.14699999999999 - type: precision_at_1 value: 25.249 - type: precision_at_3 value: 15.837000000000002 - type: precision_at_5 value: 11.977 - type: precision_at_10 value: 7.532 - type: precision_at_20 value: 4.427 - type: precision_at_100 value: 0.9740000000000001 - type: precision_at_1000 value: 0.099 - type: mrr_at_1 value: 25.817899999999998 - type: mrr_at_3 value: 34.9692 - type: mrr_at_5 value: 37.7928 - type: mrr_at_10 value: 39.8238 - type: mrr_at_20 value: 40.7844 - type: mrr_at_100 value: 41.0403 - type: mrr_at_1000 value: 41.0495 - type: nauc_ndcg_at_1_max value: -2.6569 - type: nauc_ndcg_at_1_std value: -2.4726000000000004 - type: nauc_ndcg_at_1_diff1 value: 10.259699999999999 - type: nauc_ndcg_at_3_max value: -0.8151 - type: nauc_ndcg_at_3_std value: -3.3642 - type: nauc_ndcg_at_3_diff1 value: 7.884099999999999 - type: nauc_ndcg_at_5_max value: -0.3906 - type: nauc_ndcg_at_5_std value: -2.4619 - type: nauc_ndcg_at_5_diff1 value: 7.558 - type: nauc_ndcg_at_10_max value: 1.0935000000000001 - type: nauc_ndcg_at_10_std value: -1.8624999999999998 - type: nauc_ndcg_at_10_diff1 value: 8.0503 - type: nauc_ndcg_at_20_max value: 1.3164 - type: nauc_ndcg_at_20_std value: -1.3407 - type: nauc_ndcg_at_20_diff1 value: 7.8992 - type: nauc_ndcg_at_100_max value: 0.8316 - type: nauc_ndcg_at_100_std value: -0.8725 - type: nauc_ndcg_at_100_diff1 value: 8.5633 - type: nauc_ndcg_at_1000_max value: 0.44999999999999996 - type: nauc_ndcg_at_1000_std value: -1.4357 - type: nauc_ndcg_at_1000_diff1 value: 8.4438 - type: nauc_map_at_1_max value: -2.6569 - type: nauc_map_at_1_std value: -2.4726000000000004 - type: nauc_map_at_1_diff1 value: 10.259699999999999 - type: nauc_map_at_3_max value: -1.3567 - type: nauc_map_at_3_std value: -3.222 - type: nauc_map_at_3_diff1 value: 8.3557 - type: nauc_map_at_5_max value: -1.162 - type: nauc_map_at_5_std value: -2.7384 - type: nauc_map_at_5_diff1 value: 8.118400000000001 - type: nauc_map_at_10_max value: -0.615 - type: nauc_map_at_10_std value: -2.5394 - type: nauc_map_at_10_diff1 value: 8.283100000000001 - type: nauc_map_at_20_max value: -0.5492 - type: nauc_map_at_20_std value: -2.4076 - type: nauc_map_at_20_diff1 value: 8.280999999999999 - type: nauc_map_at_100_max value: -0.6049 - type: nauc_map_at_100_std value: -2.3560000000000003 - type: nauc_map_at_100_diff1 value: 8.3933 - type: nauc_map_at_1000_max value: -0.6154 - type: nauc_map_at_1000_std value: -2.373 - type: nauc_map_at_1000_diff1 value: 8.3902 - type: nauc_recall_at_1_max value: -2.6569 - type: nauc_recall_at_1_std value: -2.4726000000000004 - type: nauc_recall_at_1_diff1 value: 10.259699999999999 - type: nauc_recall_at_3_max value: 0.7234 - type: nauc_recall_at_3_std value: -3.7315 - type: nauc_recall_at_3_diff1 value: 6.6138 - type: nauc_recall_at_5_max value: 2.0847 - type: nauc_recall_at_5_std value: -1.4385000000000001 - type: nauc_recall_at_5_diff1 value: 5.9428 - type: nauc_recall_at_10_max value: 9.2417 - type: nauc_recall_at_10_std value: 1.6372000000000002 - type: nauc_recall_at_10_diff1 value: 7.6442 - type: nauc_recall_at_20_max value: 17.9819 - type: nauc_recall_at_20_std value: 9.3827 - type: nauc_recall_at_20_diff1 value: 5.2288 - type: nauc_recall_at_100_max value: 46.3576 - type: nauc_recall_at_100_std value: 69.5314 - type: nauc_recall_at_100_diff1 value: 25.2365 - type: nauc_recall_at_1000_max value: 47.3173 - type: nauc_recall_at_1000_std value: 80.3564 - type: nauc_recall_at_1000_diff1 value: 30.506 - type: nauc_precision_at_1_max value: -2.6569 - type: nauc_precision_at_1_std value: -2.4726000000000004 - type: nauc_precision_at_1_diff1 value: 10.259699999999999 - type: nauc_precision_at_3_max value: 0.7234 - type: nauc_precision_at_3_std value: -3.7315 - type: nauc_precision_at_3_diff1 value: 6.6138 - type: nauc_precision_at_5_max value: 2.0847 - type: nauc_precision_at_5_std value: -1.4385000000000001 - type: nauc_precision_at_5_diff1 value: 5.9428 - type: nauc_precision_at_10_max value: 9.2417 - type: nauc_precision_at_10_std value: 1.6372000000000002 - type: nauc_precision_at_10_diff1 value: 7.6442 - type: nauc_precision_at_20_max value: 17.9819 - type: nauc_precision_at_20_std value: 9.3827 - type: nauc_precision_at_20_diff1 value: 5.2288 - type: nauc_precision_at_100_max value: 46.3576 - type: nauc_precision_at_100_std value: 69.5314 - type: nauc_precision_at_100_diff1 value: 25.2365 - type: nauc_precision_at_1000_max value: 47.3173 - type: nauc_precision_at_1000_std value: 80.3564 - type: nauc_precision_at_1000_diff1 value: 30.506 - type: nauc_mrr_at_1_max value: -2.5852 - type: nauc_mrr_at_1_std value: -2.7133000000000003 - type: nauc_mrr_at_1_diff1 value: 8.3902 - type: nauc_mrr_at_3_max value: -2.3878 - type: nauc_mrr_at_3_std value: -3.1916 - type: nauc_mrr_at_3_diff1 value: 6.3759999999999994 - type: nauc_mrr_at_5_max value: -2.0079 - type: nauc_mrr_at_5_std value: -2.9791000000000003 - type: nauc_mrr_at_5_diff1 value: 6.3531 - type: nauc_mrr_at_10_max value: -1.41 - type: nauc_mrr_at_10_std value: -2.7921 - type: nauc_mrr_at_10_diff1 value: 6.514200000000001 - type: nauc_mrr_at_20_max value: -1.35 - type: nauc_mrr_at_20_std value: -2.6331 - type: nauc_mrr_at_20_diff1 value: 6.4700999999999995 - type: nauc_mrr_at_100_max value: -1.393 - type: nauc_mrr_at_100_std value: -2.5819 - type: nauc_mrr_at_100_diff1 value: 6.5875 - type: nauc_mrr_at_1000_max value: -1.4037000000000002 - type: nauc_mrr_at_1000_std value: -2.5989 - type: nauc_mrr_at_1000_diff1 value: 6.583799999999999 - type: main_score value: 48.068 task: type: Retrieval - dataset: config: default name: MTEB AskUbuntuDupQuestions (default) revision: 2000358ca161889fa9c082cb41daa8dcfb161a54 split: test type: mteb/askubuntudupquestions-reranking metrics: - type: map value: 56.5225 - type: mrr value: 70.5146 - type: nAUC_map_max value: 18.224 - type: nAUC_map_std value: 12.5352 - type: nAUC_map_diff1 value: 14.0464 - type: nAUC_mrr_max value: 28.619699999999998 - type: nAUC_mrr_std value: 21.69 - type: nAUC_mrr_diff1 value: 15.8021 - type: main_score value: 56.5225 task: type: Reranking - dataset: config: default name: MTEB BIOSSES (default) revision: d3fb88f8f02e40887cd149695127462bbcf29b4a split: test type: mteb/biosses-sts metrics: - type: pearson value: 86.6855 - type: spearman value: 83.17360000000001 - type: cosine_pearson value: 86.6855 - type: cosine_spearman value: 83.17360000000001 - type: manhattan_pearson value: 85.5442 - type: manhattan_spearman value: 83.9501 - type: euclidean_pearson value: 85.0403 - type: euclidean_spearman value: 83.17360000000001 - type: main_score value: 83.17360000000001 task: type: STS - dataset: config: default name: MTEB Banking77Classification (default) revision: 0fd18e25b25c072e09e0d92ab615fda904d66300 split: test type: mteb/banking77 metrics: - type: accuracy value: 76.3312 - type: f1 value: 75.4609 - type: f1_weighted value: 75.4609 - type: main_score value: 76.3312 task: type: Classification - dataset: config: default name: MTEB BiorxivClusteringP2P.v2 (default) revision: f5dbc242e11dd8e24def4c4268607a49e02946dc split: test type: mteb/biorxiv-clustering-p2p metrics: - type: v_measure value: 33.6692 - type: v_measure_std value: 0.769 - type: main_score value: 33.6692 task: type: Clustering - dataset: config: default name: MTEB CQADupstackGamingRetrieval (default) revision: 4885aa143210c98657558c04aaf3dc47cfb54340 split: test type: mteb/cqadupstack-gaming metrics: - type: ndcg_at_1 value: 30.345 - type: ndcg_at_3 value: 37.726 - type: ndcg_at_5 value: 39.999 - type: ndcg_at_10 value: 42.732 - type: ndcg_at_20 value: 44.696000000000005 - type: ndcg_at_100 value: 47.461 - type: ndcg_at_1000 value: 49.341 - type: map_at_1 value: 26.484999999999996 - type: map_at_3 value: 34.474 - type: map_at_5 value: 35.94 - type: map_at_10 value: 37.24 - type: map_at_20 value: 37.852999999999994 - type: map_at_100 value: 38.286 - type: map_at_1000 value: 38.369 - type: recall_at_1 value: 26.484999999999996 - type: recall_at_3 value: 42.857 - type: recall_at_5 value: 48.501 - type: recall_at_10 value: 56.48 - type: recall_at_20 value: 63.81099999999999 - type: recall_at_100 value: 77.518 - type: recall_at_1000 value: 90.89 - type: precision_at_1 value: 30.345 - type: precision_at_3 value: 17.241 - type: precision_at_5 value: 11.962 - type: precision_at_10 value: 7.204000000000001 - type: precision_at_20 value: 4.1290000000000004 - type: precision_at_100 value: 1.0330000000000001 - type: precision_at_1000 value: 0.127 - type: mrr_at_1 value: 30.3448 - type: mrr_at_3 value: 37.5131 - type: mrr_at_5 value: 38.8516 - type: mrr_at_10 value: 39.915299999999995 - type: mrr_at_20 value: 40.428599999999996 - type: mrr_at_100 value: 40.7757 - type: mrr_at_1000 value: 40.8275 - type: nauc_ndcg_at_1_max value: 30.5442 - type: nauc_ndcg_at_1_std value: -10.3888 - type: nauc_ndcg_at_1_diff1 value: 52.476 - type: nauc_ndcg_at_3_max value: 28.6927 - type: nauc_ndcg_at_3_std value: -8.8728 - type: nauc_ndcg_at_3_diff1 value: 45.094699999999996 - type: nauc_ndcg_at_5_max value: 29.259600000000002 - type: nauc_ndcg_at_5_std value: -7.945399999999999 - type: nauc_ndcg_at_5_diff1 value: 44.600699999999996 - type: nauc_ndcg_at_10_max value: 29.9977 - type: nauc_ndcg_at_10_std value: -6.1746 - type: nauc_ndcg_at_10_diff1 value: 44.2832 - type: nauc_ndcg_at_20_max value: 30.034100000000002 - type: nauc_ndcg_at_20_std value: -4.8941 - type: nauc_ndcg_at_20_diff1 value: 43.3814 - type: nauc_ndcg_at_100_max value: 30.812800000000003 - type: nauc_ndcg_at_100_std value: -3.5000999999999998 - type: nauc_ndcg_at_100_diff1 value: 43.345 - type: nauc_ndcg_at_1000_max value: 30.9884 - type: nauc_ndcg_at_1000_std value: -3.9316999999999998 - type: nauc_ndcg_at_1000_diff1 value: 43.6512 - type: nauc_map_at_1_max value: 27.442800000000002 - type: nauc_map_at_1_std value: -9.8884 - type: nauc_map_at_1_diff1 value: 52.666999999999994 - type: nauc_map_at_3_max value: 27.897100000000002 - type: nauc_map_at_3_std value: -9.777 - type: nauc_map_at_3_diff1 value: 47.013 - type: nauc_map_at_5_max value: 28.3476 - type: nauc_map_at_5_std value: -9.3335 - type: nauc_map_at_5_diff1 value: 46.7246 - type: nauc_map_at_10_max value: 28.921000000000003 - type: nauc_map_at_10_std value: -8.4018 - type: nauc_map_at_10_diff1 value: 46.5358 - type: nauc_map_at_20_max value: 29.033900000000003 - type: nauc_map_at_20_std value: -7.985100000000001 - type: nauc_map_at_20_diff1 value: 46.2362 - type: nauc_map_at_100_max value: 29.2382 - type: nauc_map_at_100_std value: -7.7172 - type: nauc_map_at_100_diff1 value: 46.2663 - type: nauc_map_at_1000_max value: 29.263699999999996 - type: nauc_map_at_1000_std value: -7.7108 - type: nauc_map_at_1000_diff1 value: 46.2735 - type: nauc_recall_at_1_max value: 27.442800000000002 - type: nauc_recall_at_1_std value: -9.8884 - type: nauc_recall_at_1_diff1 value: 52.666999999999994 - type: nauc_recall_at_3_max value: 25.7102 - type: nauc_recall_at_3_std value: -8.2064 - type: nauc_recall_at_3_diff1 value: 39.145 - type: nauc_recall_at_5_max value: 27.244699999999998 - type: nauc_recall_at_5_std value: -5.943 - type: nauc_recall_at_5_diff1 value: 38.024 - type: nauc_recall_at_10_max value: 29.226000000000003 - type: nauc_recall_at_10_std value: -0.2402 - type: nauc_recall_at_10_diff1 value: 36.58 - type: nauc_recall_at_20_max value: 29.567500000000003 - type: nauc_recall_at_20_std value: 6.2502 - type: nauc_recall_at_20_diff1 value: 32.092999999999996 - type: nauc_recall_at_100_max value: 33.8086 - type: nauc_recall_at_100_std value: 20.092 - type: nauc_recall_at_100_diff1 value: 27.5754 - type: nauc_recall_at_1000_max value: 38.0782 - type: nauc_recall_at_1000_std value: 34.3309 - type: nauc_recall_at_1000_diff1 value: 17.712 - type: nauc_precision_at_1_max value: 30.5442 - type: nauc_precision_at_1_std value: -10.3888 - type: nauc_precision_at_1_diff1 value: 52.476 - type: nauc_precision_at_3_max value: 29.0858 - type: nauc_precision_at_3_std value: -5.8233 - type: nauc_precision_at_3_diff1 value: 33.480900000000005 - type: nauc_precision_at_5_max value: 30.425200000000004 - type: nauc_precision_at_5_std value: -2.0077000000000003 - type: nauc_precision_at_5_diff1 value: 29.5631 - type: nauc_precision_at_10_max value: 30.8693 - type: nauc_precision_at_10_std value: 4.5986 - type: nauc_precision_at_10_diff1 value: 23.346600000000002 - type: nauc_precision_at_20_max value: 29.6844 - type: nauc_precision_at_20_std value: 9.4699 - type: nauc_precision_at_20_diff1 value: 15.9193 - type: nauc_precision_at_100_max value: 29.7036 - type: nauc_precision_at_100_std value: 19.0186 - type: nauc_precision_at_100_diff1 value: 5.9221 - type: nauc_precision_at_1000_max value: 24.6994 - type: nauc_precision_at_1000_std value: 18.0033 - type: nauc_precision_at_1000_diff1 value: -3.2275 - type: nauc_mrr_at_1_max value: 30.5442 - type: nauc_mrr_at_1_std value: -10.3888 - type: nauc_mrr_at_1_diff1 value: 52.476 - type: nauc_mrr_at_3_max value: 29.7504 - type: nauc_mrr_at_3_std value: -9.5234 - type: nauc_mrr_at_3_diff1 value: 46.5068 - type: nauc_mrr_at_5_max value: 30.341099999999997 - type: nauc_mrr_at_5_std value: -8.4966 - type: nauc_mrr_at_5_diff1 value: 46.051199999999994 - type: nauc_mrr_at_10_max value: 30.6066 - type: nauc_mrr_at_10_std value: -7.8854 - type: nauc_mrr_at_10_diff1 value: 46.035199999999996 - type: nauc_mrr_at_20_max value: 30.570199999999996 - type: nauc_mrr_at_20_std value: -7.614700000000001 - type: nauc_mrr_at_20_diff1 value: 45.8861 - type: nauc_mrr_at_100_max value: 30.589100000000002 - type: nauc_mrr_at_100_std value: -7.5529 - type: nauc_mrr_at_100_diff1 value: 45.907 - type: nauc_mrr_at_1000_max value: 30.587799999999998 - type: nauc_mrr_at_1000_std value: -7.5716 - type: nauc_mrr_at_1000_diff1 value: 45.9244 - type: main_score value: 42.732 task: type: Retrieval - dataset: config: default name: MTEB CQADupstackUnixRetrieval (default) revision: 6c6430d3a6d36f8d2a829195bc5dc94d7e063e53 split: test type: mteb/cqadupstack-unix metrics: - type: ndcg_at_1 value: 18.843 - type: ndcg_at_3 value: 22.131 - type: ndcg_at_5 value: 23.772 - type: ndcg_at_10 value: 25.661 - type: ndcg_at_20 value: 27.939999999999998 - type: ndcg_at_100 value: 31.645 - type: ndcg_at_1000 value: 34.687 - type: map_at_1 value: 16.194 - type: map_at_3 value: 20.068 - type: map_at_5 value: 21.075 - type: map_at_10 value: 21.913 - type: map_at_20 value: 22.569 - type: map_at_100 value: 23.107 - type: map_at_1000 value: 23.23 - type: recall_at_1 value: 16.194 - type: recall_at_3 value: 24.704 - type: recall_at_5 value: 28.859 - type: recall_at_10 value: 34.402 - type: recall_at_20 value: 42.714 - type: recall_at_100 value: 61.19799999999999 - type: recall_at_1000 value: 82.953 - type: precision_at_1 value: 18.843 - type: precision_at_3 value: 9.919 - type: precision_at_5 value: 7.071 - type: precision_at_10 value: 4.328 - type: precision_at_20 value: 2.752 - type: precision_at_100 value: 0.823 - type: precision_at_1000 value: 0.121 - type: mrr_at_1 value: 18.8433 - type: mrr_at_3 value: 22.776699999999998 - type: mrr_at_5 value: 23.9055 - type: mrr_at_10 value: 24.7244 - type: mrr_at_20 value: 25.3919 - type: mrr_at_100 value: 25.8783 - type: mrr_at_1000 value: 25.957900000000002 - type: nauc_ndcg_at_1_max value: 35.1013 - type: nauc_ndcg_at_1_std value: 4.116899999999999 - type: nauc_ndcg_at_1_diff1 value: 54.3984 - type: nauc_ndcg_at_3_max value: 35.1035 - type: nauc_ndcg_at_3_std value: 5.3618 - type: nauc_ndcg_at_3_diff1 value: 47.4455 - type: nauc_ndcg_at_5_max value: 34.3845 - type: nauc_ndcg_at_5_std value: 5.4364 - type: nauc_ndcg_at_5_diff1 value: 44.8757 - type: nauc_ndcg_at_10_max value: 33.4252 - type: nauc_ndcg_at_10_std value: 7.100099999999999 - type: nauc_ndcg_at_10_diff1 value: 43.0854 - type: nauc_ndcg_at_20_max value: 33.2135 - type: nauc_ndcg_at_20_std value: 7.750500000000001 - type: nauc_ndcg_at_20_diff1 value: 42.5065 - type: nauc_ndcg_at_100_max value: 34.0845 - type: nauc_ndcg_at_100_std value: 9.0937 - type: nauc_ndcg_at_100_diff1 value: 40.9634 - type: nauc_ndcg_at_1000_max value: 34.3716 - type: nauc_ndcg_at_1000_std value: 9.8049 - type: nauc_ndcg_at_1000_diff1 value: 41.606 - type: nauc_map_at_1_max value: 35.054 - type: nauc_map_at_1_std value: 3.4526000000000003 - type: nauc_map_at_1_diff1 value: 55.69840000000001 - type: nauc_map_at_3_max value: 34.861 - type: nauc_map_at_3_std value: 4.6036 - type: nauc_map_at_3_diff1 value: 49.338 - type: nauc_map_at_5_max value: 34.3213 - type: nauc_map_at_5_std value: 4.7856000000000005 - type: nauc_map_at_5_diff1 value: 47.856 - type: nauc_map_at_10_max value: 33.9813 - type: nauc_map_at_10_std value: 5.649 - type: nauc_map_at_10_diff1 value: 47.0563 - type: nauc_map_at_20_max value: 33.8854 - type: nauc_map_at_20_std value: 5.9026000000000005 - type: nauc_map_at_20_diff1 value: 46.876200000000004 - type: nauc_map_at_100_max value: 33.996500000000005 - type: nauc_map_at_100_std value: 6.094200000000001 - type: nauc_map_at_100_diff1 value: 46.6388 - type: nauc_map_at_1000_max value: 34.0082 - type: nauc_map_at_1000_std value: 6.1436 - type: nauc_map_at_1000_diff1 value: 46.643 - type: nauc_recall_at_1_max value: 35.054 - type: nauc_recall_at_1_std value: 3.4526000000000003 - type: nauc_recall_at_1_diff1 value: 55.69840000000001 - type: nauc_recall_at_3_max value: 34.2271 - type: nauc_recall_at_3_std value: 5.573 - type: nauc_recall_at_3_diff1 value: 42.0593 - type: nauc_recall_at_5_max value: 32.7785 - type: nauc_recall_at_5_std value: 6.188599999999999 - type: nauc_recall_at_5_diff1 value: 36.9345 - type: nauc_recall_at_10_max value: 29.7004 - type: nauc_recall_at_10_std value: 10.3771 - type: nauc_recall_at_10_diff1 value: 31.6352 - type: nauc_recall_at_20_max value: 28.474100000000004 - type: nauc_recall_at_20_std value: 12.3244 - type: nauc_recall_at_20_diff1 value: 29.6458 - type: nauc_recall_at_100_max value: 31.2612 - type: nauc_recall_at_100_std value: 19.1574 - type: nauc_recall_at_100_diff1 value: 19.7616 - type: nauc_recall_at_1000_max value: 33.2982 - type: nauc_recall_at_1000_std value: 36.4068 - type: nauc_recall_at_1000_diff1 value: 15.3188 - type: nauc_precision_at_1_max value: 35.1013 - type: nauc_precision_at_1_std value: 4.116899999999999 - type: nauc_precision_at_1_diff1 value: 54.3984 - type: nauc_precision_at_3_max value: 34.4651 - type: nauc_precision_at_3_std value: 7.8735 - type: nauc_precision_at_3_diff1 value: 39.7844 - type: nauc_precision_at_5_max value: 32.2792 - type: nauc_precision_at_5_std value: 8.465 - type: nauc_precision_at_5_diff1 value: 34.130700000000004 - type: nauc_precision_at_10_max value: 28.197699999999998 - type: nauc_precision_at_10_std value: 12.1518 - type: nauc_precision_at_10_diff1 value: 28.672900000000002 - type: nauc_precision_at_20_max value: 27.2073 - type: nauc_precision_at_20_std value: 14.113100000000001 - type: nauc_precision_at_20_diff1 value: 23.623 - type: nauc_precision_at_100_max value: 22.906399999999998 - type: nauc_precision_at_100_std value: 16.7201 - type: nauc_precision_at_100_diff1 value: 7.0853 - type: nauc_precision_at_1000_max value: 10.5167 - type: nauc_precision_at_1000_std value: 11.5017 - type: nauc_precision_at_1000_diff1 value: -6.6079 - type: nauc_mrr_at_1_max value: 35.1013 - type: nauc_mrr_at_1_std value: 4.116899999999999 - type: nauc_mrr_at_1_diff1 value: 54.3984 - type: nauc_mrr_at_3_max value: 35.489399999999996 - type: nauc_mrr_at_3_std value: 5.097700000000001 - type: nauc_mrr_at_3_diff1 value: 48.8783 - type: nauc_mrr_at_5_max value: 35.2093 - type: nauc_mrr_at_5_std value: 5.2317 - type: nauc_mrr_at_5_diff1 value: 47.3602 - type: nauc_mrr_at_10_max value: 34.731 - type: nauc_mrr_at_10_std value: 5.7762 - type: nauc_mrr_at_10_diff1 value: 46.495999999999995 - type: nauc_mrr_at_20_max value: 34.6509 - type: nauc_mrr_at_20_std value: 5.8511 - type: nauc_mrr_at_20_diff1 value: 46.386500000000005 - type: nauc_mrr_at_100_max value: 34.7761 - type: nauc_mrr_at_100_std value: 6.0355 - type: nauc_mrr_at_100_diff1 value: 46.2476 - type: nauc_mrr_at_1000_max value: 34.792699999999996 - type: nauc_mrr_at_1000_std value: 6.0607 - type: nauc_mrr_at_1000_diff1 value: 46.281800000000004 - type: main_score value: 25.661 task: type: Retrieval - dataset: config: default name: MTEB ClimateFEVERHardNegatives (default) revision: 3a309e201f3c2c4b13bd4a367a8f37eee2ec1d21 split: test type: mteb/ClimateFEVER_test_top_250_only_w_correct-v2 metrics: - type: ndcg_at_1 value: 16.8 - type: ndcg_at_3 value: 15.503 - type: ndcg_at_5 value: 17.5 - type: ndcg_at_10 value: 20.642 - type: ndcg_at_20 value: 23.07 - type: ndcg_at_100 value: 27.639000000000003 - type: ndcg_at_1000 value: 32.041 - type: map_at_1 value: 7.885000000000001 - type: map_at_3 value: 11.128 - type: map_at_5 value: 12.565999999999999 - type: map_at_10 value: 13.876 - type: map_at_20 value: 14.66 - type: map_at_100 value: 15.432000000000002 - type: map_at_1000 value: 15.655 - type: recall_at_1 value: 7.885000000000001 - type: recall_at_3 value: 14.957 - type: recall_at_5 value: 19.675 - type: recall_at_10 value: 26.868 - type: recall_at_20 value: 33.94 - type: recall_at_100 value: 51.833 - type: recall_at_1000 value: 76.822 - type: precision_at_1 value: 16.8 - type: precision_at_3 value: 11.533 - type: precision_at_5 value: 9.56 - type: precision_at_10 value: 6.83 - type: precision_at_20 value: 4.41 - type: precision_at_100 value: 1.432 - type: precision_at_1000 value: 0.22499999999999998 - type: mrr_at_1 value: 16.8 - type: mrr_at_3 value: 23.2333 - type: mrr_at_5 value: 25.2183 - type: mrr_at_10 value: 26.775 - type: mrr_at_20 value: 27.4121 - type: mrr_at_100 value: 27.882299999999997 - type: mrr_at_1000 value: 27.9472 - type: nauc_ndcg_at_1_max value: 28.3609 - type: nauc_ndcg_at_1_std value: 10.5951 - type: nauc_ndcg_at_1_diff1 value: 16.566 - type: nauc_ndcg_at_3_max value: 33.3794 - type: nauc_ndcg_at_3_std value: 14.645900000000001 - type: nauc_ndcg_at_3_diff1 value: 15.4617 - type: nauc_ndcg_at_5_max value: 33.5092 - type: nauc_ndcg_at_5_std value: 16.209699999999998 - type: nauc_ndcg_at_5_diff1 value: 16.7386 - type: nauc_ndcg_at_10_max value: 37.101299999999995 - type: nauc_ndcg_at_10_std value: 20.939 - type: nauc_ndcg_at_10_diff1 value: 15.1232 - type: nauc_ndcg_at_20_max value: 38.3563 - type: nauc_ndcg_at_20_std value: 22.3038 - type: nauc_ndcg_at_20_diff1 value: 14.613100000000001 - type: nauc_ndcg_at_100_max value: 39.5793 - type: nauc_ndcg_at_100_std value: 23.3348 - type: nauc_ndcg_at_100_diff1 value: 13.6571 - type: nauc_ndcg_at_1000_max value: 39.2582 - type: nauc_ndcg_at_1000_std value: 22.5989 - type: nauc_ndcg_at_1000_diff1 value: 12.6784 - type: nauc_map_at_1_max value: 36.9819 - type: nauc_map_at_1_std value: 11.5065 - type: nauc_map_at_1_diff1 value: 22.4791 - type: nauc_map_at_3_max value: 35.324299999999994 - type: nauc_map_at_3_std value: 13.572000000000001 - type: nauc_map_at_3_diff1 value: 19.3415 - type: nauc_map_at_5_max value: 35.0138 - type: nauc_map_at_5_std value: 14.857600000000001 - type: nauc_map_at_5_diff1 value: 19.5352 - type: nauc_map_at_10_max value: 36.8267 - type: nauc_map_at_10_std value: 17.6287 - type: nauc_map_at_10_diff1 value: 18.2802 - type: nauc_map_at_20_max value: 37.5214 - type: nauc_map_at_20_std value: 18.319399999999998 - type: nauc_map_at_20_diff1 value: 18.0343 - type: nauc_map_at_100_max value: 37.933499999999995 - type: nauc_map_at_100_std value: 18.6864 - type: nauc_map_at_100_diff1 value: 17.7119 - type: nauc_map_at_1000_max value: 37.9509 - type: nauc_map_at_1000_std value: 18.6975 - type: nauc_map_at_1000_diff1 value: 17.5997 - type: nauc_recall_at_1_max value: 36.9819 - type: nauc_recall_at_1_std value: 11.5065 - type: nauc_recall_at_1_diff1 value: 22.4791 - type: nauc_recall_at_3_max value: 33.0875 - type: nauc_recall_at_3_std value: 16.3976 - type: nauc_recall_at_3_diff1 value: 15.6164 - type: nauc_recall_at_5_max value: 30.604799999999997 - type: nauc_recall_at_5_std value: 17.1699 - type: nauc_recall_at_5_diff1 value: 15.639800000000001 - type: nauc_recall_at_10_max value: 35.342400000000005 - type: nauc_recall_at_10_std value: 24.665599999999998 - type: nauc_recall_at_10_diff1 value: 11.9499 - type: nauc_recall_at_20_max value: 35.956700000000005 - type: nauc_recall_at_20_std value: 26.556800000000003 - type: nauc_recall_at_20_diff1 value: 10.0239 - type: nauc_recall_at_100_max value: 36.1012 - type: nauc_recall_at_100_std value: 27.8055 - type: nauc_recall_at_100_diff1 value: 6.3591 - type: nauc_recall_at_1000_max value: 34.7202 - type: nauc_recall_at_1000_std value: 26.378 - type: nauc_recall_at_1000_diff1 value: -0.7171000000000001 - type: nauc_precision_at_1_max value: 28.3609 - type: nauc_precision_at_1_std value: 10.5951 - type: nauc_precision_at_1_diff1 value: 16.566 - type: nauc_precision_at_3_max value: 30.490000000000002 - type: nauc_precision_at_3_std value: 16.270899999999997 - type: nauc_precision_at_3_diff1 value: 9.7026 - type: nauc_precision_at_5_max value: 29.3491 - type: nauc_precision_at_5_std value: 19.084699999999998 - type: nauc_precision_at_5_diff1 value: 10.7809 - type: nauc_precision_at_10_max value: 34.753699999999995 - type: nauc_precision_at_10_std value: 28.155 - type: nauc_precision_at_10_diff1 value: 5.6554 - type: nauc_precision_at_20_max value: 33.3812 - type: nauc_precision_at_20_std value: 27.122400000000003 - type: nauc_precision_at_20_diff1 value: 3.6636 - type: nauc_precision_at_100_max value: 28.7799 - type: nauc_precision_at_100_std value: 23.9905 - type: nauc_precision_at_100_diff1 value: -0.5301 - type: nauc_precision_at_1000_max value: 13.068399999999999 - type: nauc_precision_at_1000_std value: 12.9133 - type: nauc_precision_at_1000_diff1 value: -8.8717 - type: nauc_mrr_at_1_max value: 28.3609 - type: nauc_mrr_at_1_std value: 10.5951 - type: nauc_mrr_at_1_diff1 value: 16.566 - type: nauc_mrr_at_3_max value: 30.9311 - type: nauc_mrr_at_3_std value: 13.9549 - type: nauc_mrr_at_3_diff1 value: 12.851399999999998 - type: nauc_mrr_at_5_max value: 30.893700000000003 - type: nauc_mrr_at_5_std value: 14.464599999999999 - type: nauc_mrr_at_5_diff1 value: 13.2001 - type: nauc_mrr_at_10_max value: 32.277499999999996 - type: nauc_mrr_at_10_std value: 15.9378 - type: nauc_mrr_at_10_diff1 value: 12.9887 - type: nauc_mrr_at_20_max value: 32.3817 - type: nauc_mrr_at_20_std value: 16.0469 - type: nauc_mrr_at_20_diff1 value: 13.039200000000001 - type: nauc_mrr_at_100_max value: 32.386900000000004 - type: nauc_mrr_at_100_std value: 15.966800000000001 - type: nauc_mrr_at_100_diff1 value: 12.982 - type: nauc_mrr_at_1000_max value: 32.347300000000004 - type: nauc_mrr_at_1000_std value: 15.9096 - type: nauc_mrr_at_1000_diff1 value: 12.9742 - type: main_score value: 20.642 task: type: Retrieval - dataset: config: default name: MTEB FEVERHardNegatives (default) revision: 080c9ed6267b65029207906e815d44a9240bafca split: test type: mteb/FEVER_test_top_250_only_w_correct-v2 metrics: - type: ndcg_at_1 value: 46.9 - type: ndcg_at_3 value: 57.825 - type: ndcg_at_5 value: 61.245000000000005 - type: ndcg_at_10 value: 63.836000000000006 - type: ndcg_at_20 value: 65.408 - type: ndcg_at_100 value: 66.796 - type: ndcg_at_1000 value: 67.216 - type: map_at_1 value: 43.999 - type: map_at_3 value: 53.813 - type: map_at_5 value: 55.741 - type: map_at_10 value: 56.852999999999994 - type: map_at_20 value: 57.30800000000001 - type: map_at_100 value: 57.54 - type: map_at_1000 value: 57.56099999999999 - type: recall_at_1 value: 43.999 - type: recall_at_3 value: 66.184 - type: recall_at_5 value: 74.557 - type: recall_at_10 value: 82.394 - type: recall_at_20 value: 88.51 - type: recall_at_100 value: 95.253 - type: recall_at_1000 value: 98.031 - type: precision_at_1 value: 46.9 - type: precision_at_3 value: 23.599999999999998 - type: precision_at_5 value: 15.98 - type: precision_at_10 value: 8.85 - type: precision_at_20 value: 4.760000000000001 - type: precision_at_100 value: 1.045 - type: precision_at_1000 value: 0.11 - type: mrr_at_1 value: 46.9 - type: mrr_at_3 value: 57.0167 - type: mrr_at_5 value: 59.046699999999994 - type: mrr_at_10 value: 60.1422 - type: mrr_at_20 value: 60.535799999999995 - type: mrr_at_100 value: 60.716 - type: mrr_at_1000 value: 60.7232 - type: nauc_ndcg_at_1_max value: 12.741900000000001 - type: nauc_ndcg_at_1_std value: -20.011000000000003 - type: nauc_ndcg_at_1_diff1 value: 51.02100000000001 - type: nauc_ndcg_at_3_max value: 17.416400000000003 - type: nauc_ndcg_at_3_std value: -20.9336 - type: nauc_ndcg_at_3_diff1 value: 46.3134 - type: nauc_ndcg_at_5_max value: 18.2369 - type: nauc_ndcg_at_5_std value: -21.5645 - type: nauc_ndcg_at_5_diff1 value: 46.261799999999994 - type: nauc_ndcg_at_10_max value: 18.8528 - type: nauc_ndcg_at_10_std value: -20.6893 - type: nauc_ndcg_at_10_diff1 value: 46.5862 - type: nauc_ndcg_at_20_max value: 18.0211 - type: nauc_ndcg_at_20_std value: -19.652 - type: nauc_ndcg_at_20_diff1 value: 46.5482 - type: nauc_ndcg_at_100_max value: 17.766000000000002 - type: nauc_ndcg_at_100_std value: -18.7245 - type: nauc_ndcg_at_100_diff1 value: 47.0345 - type: nauc_ndcg_at_1000_max value: 17.596500000000002 - type: nauc_ndcg_at_1000_std value: -19.0628 - type: nauc_ndcg_at_1000_diff1 value: 47.12 - type: nauc_map_at_1_max value: 13.017599999999998 - type: nauc_map_at_1_std value: -18.8296 - type: nauc_map_at_1_diff1 value: 49.8762 - type: nauc_map_at_3_max value: 16.2438 - type: nauc_map_at_3_std value: -20.1711 - type: nauc_map_at_3_diff1 value: 47.2236 - type: nauc_map_at_5_max value: 16.541 - type: nauc_map_at_5_std value: -20.4952 - type: nauc_map_at_5_diff1 value: 47.1971 - type: nauc_map_at_10_max value: 16.7266 - type: nauc_map_at_10_std value: -20.1189 - type: nauc_map_at_10_diff1 value: 47.2762 - type: nauc_map_at_20_max value: 16.5198 - type: nauc_map_at_20_std value: -19.8167 - type: nauc_map_at_20_diff1 value: 47.266799999999996 - type: nauc_map_at_100_max value: 16.467200000000002 - type: nauc_map_at_100_std value: -19.7016 - type: nauc_map_at_100_diff1 value: 47.3389 - type: nauc_map_at_1000_max value: 16.466900000000003 - type: nauc_map_at_1000_std value: -19.704 - type: nauc_map_at_1000_diff1 value: 47.341 - type: nauc_recall_at_1_max value: 13.017599999999998 - type: nauc_recall_at_1_std value: -18.8296 - type: nauc_recall_at_1_diff1 value: 49.8762 - type: nauc_recall_at_3_max value: 20.579700000000003 - type: nauc_recall_at_3_std value: -21.263399999999997 - type: nauc_recall_at_3_diff1 value: 40.7412 - type: nauc_recall_at_5_max value: 23.308799999999998 - type: nauc_recall_at_5_std value: -23.0915 - type: nauc_recall_at_5_diff1 value: 38.2001 - type: nauc_recall_at_10_max value: 27.296 - type: nauc_recall_at_10_std value: -19.2697 - type: nauc_recall_at_10_diff1 value: 35.9711 - type: nauc_recall_at_20_max value: 23.9957 - type: nauc_recall_at_20_std value: -10.1564 - type: nauc_recall_at_20_diff1 value: 30.5332 - type: nauc_recall_at_100_max value: 27.0148 - type: nauc_recall_at_100_std value: 25.655299999999997 - type: nauc_recall_at_100_diff1 value: 23.1136 - type: nauc_recall_at_1000_max value: 28.9392 - type: nauc_recall_at_1000_std value: 47.491 - type: nauc_recall_at_1000_diff1 value: 15.6225 - type: nauc_precision_at_1_max value: 12.741900000000001 - type: nauc_precision_at_1_std value: -20.011000000000003 - type: nauc_precision_at_1_diff1 value: 51.02100000000001 - type: nauc_precision_at_3_max value: 20.477999999999998 - type: nauc_precision_at_3_std value: -24.4646 - type: nauc_precision_at_3_diff1 value: 41.1551 - type: nauc_precision_at_5_max value: 24.364 - type: nauc_precision_at_5_std value: -27.1997 - type: nauc_precision_at_5_diff1 value: 38.9501 - type: nauc_precision_at_10_max value: 30.684299999999997 - type: nauc_precision_at_10_std value: -23.1531 - type: nauc_precision_at_10_diff1 value: 34.6829 - type: nauc_precision_at_20_max value: 24.1828 - type: nauc_precision_at_20_std value: -10.783800000000001 - type: nauc_precision_at_20_diff1 value: 22.662399999999998 - type: nauc_precision_at_100_max value: 12.189 - type: nauc_precision_at_100_std value: 10.600999999999999 - type: nauc_precision_at_100_diff1 value: -0.2197 - type: nauc_precision_at_1000_max value: 1.1533 - type: nauc_precision_at_1000_std value: 6.2423 - type: nauc_precision_at_1000_diff1 value: -10.4662 - type: nauc_mrr_at_1_max value: 12.741900000000001 - type: nauc_mrr_at_1_std value: -20.011000000000003 - type: nauc_mrr_at_1_diff1 value: 51.02100000000001 - type: nauc_mrr_at_3_max value: 16.4501 - type: nauc_mrr_at_3_std value: -21.337500000000002 - type: nauc_mrr_at_3_diff1 value: 48.4594 - type: nauc_mrr_at_5_max value: 16.8928 - type: nauc_mrr_at_5_std value: -21.7254 - type: nauc_mrr_at_5_diff1 value: 48.619299999999996 - type: nauc_mrr_at_10_max value: 17.0057 - type: nauc_mrr_at_10_std value: -21.465899999999998 - type: nauc_mrr_at_10_diff1 value: 48.848200000000006 - type: nauc_mrr_at_20_max value: 16.745099999999997 - type: nauc_mrr_at_20_std value: -21.2914 - type: nauc_mrr_at_20_diff1 value: 48.861900000000006 - type: nauc_mrr_at_100_max value: 16.653399999999998 - type: nauc_mrr_at_100_std value: -21.1954 - type: nauc_mrr_at_100_diff1 value: 48.9097 - type: nauc_mrr_at_1000_max value: 16.650000000000002 - type: nauc_mrr_at_1000_std value: -21.2048 - type: nauc_mrr_at_1000_diff1 value: 48.911500000000004 - type: main_score value: 63.836000000000006 task: type: Retrieval - dataset: config: default name: MTEB FiQA2018 (default) revision: 27a168819829fe9bcd655c2df245fb19452e8e06 split: test type: mteb/fiqa metrics: - type: ndcg_at_1 value: 25.154 - type: ndcg_at_3 value: 22.85 - type: ndcg_at_5 value: 23.788999999999998 - type: ndcg_at_10 value: 25.657000000000004 - type: ndcg_at_20 value: 28.058 - type: ndcg_at_100 value: 32.019999999999996 - type: ndcg_at_1000 value: 36.124 - type: map_at_1 value: 12.594 - type: map_at_3 value: 17.345 - type: map_at_5 value: 18.740000000000002 - type: map_at_10 value: 19.871 - type: map_at_20 value: 20.71 - type: map_at_100 value: 21.404 - type: map_at_1000 value: 21.616 - type: recall_at_1 value: 12.594 - type: recall_at_3 value: 20.682000000000002 - type: recall_at_5 value: 24.735 - type: recall_at_10 value: 30.217 - type: recall_at_20 value: 37.714999999999996 - type: recall_at_100 value: 54.364000000000004 - type: recall_at_1000 value: 79.487 - type: precision_at_1 value: 25.154 - type: precision_at_3 value: 15.174999999999999 - type: precision_at_5 value: 11.235000000000001 - type: precision_at_10 value: 7.13 - type: precision_at_20 value: 4.522 - type: precision_at_100 value: 1.341 - type: precision_at_1000 value: 0.20500000000000002 - type: mrr_at_1 value: 25.154300000000003 - type: mrr_at_3 value: 30.324099999999998 - type: mrr_at_5 value: 31.581799999999998 - type: mrr_at_10 value: 32.5208 - type: mrr_at_20 value: 33.055 - type: mrr_at_100 value: 33.4738 - type: mrr_at_1000 value: 33.5533 - type: nauc_ndcg_at_1_max value: 20.836199999999998 - type: nauc_ndcg_at_1_std value: -2.4346 - type: nauc_ndcg_at_1_diff1 value: 41.3264 - type: nauc_ndcg_at_3_max value: 21.4673 - type: nauc_ndcg_at_3_std value: -0.35760000000000003 - type: nauc_ndcg_at_3_diff1 value: 36.5457 - type: nauc_ndcg_at_5_max value: 21.0022 - type: nauc_ndcg_at_5_std value: 0.30079999999999996 - type: nauc_ndcg_at_5_diff1 value: 35.1377 - type: nauc_ndcg_at_10_max value: 21.4511 - type: nauc_ndcg_at_10_std value: 1.9931 - type: nauc_ndcg_at_10_diff1 value: 35.367599999999996 - type: nauc_ndcg_at_20_max value: 21.9794 - type: nauc_ndcg_at_20_std value: 3.2666 - type: nauc_ndcg_at_20_diff1 value: 33.9954 - type: nauc_ndcg_at_100_max value: 22.666900000000002 - type: nauc_ndcg_at_100_std value: 6.1648000000000005 - type: nauc_ndcg_at_100_diff1 value: 32.5715 - type: nauc_ndcg_at_1000_max value: 23.9645 - type: nauc_ndcg_at_1000_std value: 7.031 - type: nauc_ndcg_at_1000_diff1 value: 32.6535 - type: nauc_map_at_1_max value: 13.436699999999998 - type: nauc_map_at_1_std value: -6.1377 - type: nauc_map_at_1_diff1 value: 46.1518 - type: nauc_map_at_3_max value: 17.6491 - type: nauc_map_at_3_std value: -3.3383000000000003 - type: nauc_map_at_3_diff1 value: 39.909800000000004 - type: nauc_map_at_5_max value: 18.4969 - type: nauc_map_at_5_std value: -1.8129 - type: nauc_map_at_5_diff1 value: 38.4072 - type: nauc_map_at_10_max value: 19.4823 - type: nauc_map_at_10_std value: -0.2211 - type: nauc_map_at_10_diff1 value: 38.1346 - type: nauc_map_at_20_max value: 19.9898 - type: nauc_map_at_20_std value: 0.6002000000000001 - type: nauc_map_at_20_diff1 value: 37.755100000000006 - type: nauc_map_at_100_max value: 20.2321 - type: nauc_map_at_100_std value: 1.2189999999999999 - type: nauc_map_at_100_diff1 value: 37.379 - type: nauc_map_at_1000_max value: 20.3676 - type: nauc_map_at_1000_std value: 1.3561999999999999 - type: nauc_map_at_1000_diff1 value: 37.3216 - type: nauc_recall_at_1_max value: 13.436699999999998 - type: nauc_recall_at_1_std value: -6.1377 - type: nauc_recall_at_1_diff1 value: 46.1518 - type: nauc_recall_at_3_max value: 17.4283 - type: nauc_recall_at_3_std value: -2.0456 - type: nauc_recall_at_3_diff1 value: 34.5422 - type: nauc_recall_at_5_max value: 18.2169 - type: nauc_recall_at_5_std value: 0.7002 - type: nauc_recall_at_5_diff1 value: 29.7798 - type: nauc_recall_at_10_max value: 19.6832 - type: nauc_recall_at_10_std value: 4.6769 - type: nauc_recall_at_10_diff1 value: 27.8829 - type: nauc_recall_at_20_max value: 20.095 - type: nauc_recall_at_20_std value: 6.884899999999999 - type: nauc_recall_at_20_diff1 value: 22.7741 - type: nauc_recall_at_100_max value: 20.5351 - type: nauc_recall_at_100_std value: 19.2636 - type: nauc_recall_at_100_diff1 value: 16.2238 - type: nauc_recall_at_1000_max value: 27.9838 - type: nauc_recall_at_1000_std value: 33.3099 - type: nauc_recall_at_1000_diff1 value: 12.701699999999999 - type: nauc_precision_at_1_max value: 20.836199999999998 - type: nauc_precision_at_1_std value: -2.4346 - type: nauc_precision_at_1_diff1 value: 41.3264 - type: nauc_precision_at_3_max value: 26.558500000000002 - type: nauc_precision_at_3_std value: 3.6578 - type: nauc_precision_at_3_diff1 value: 27.0323 - type: nauc_precision_at_5_max value: 28.794199999999996 - type: nauc_precision_at_5_std value: 8.6533 - type: nauc_precision_at_5_diff1 value: 21.9488 - type: nauc_precision_at_10_max value: 29.7713 - type: nauc_precision_at_10_std value: 13.645399999999999 - type: nauc_precision_at_10_diff1 value: 20.1386 - type: nauc_precision_at_20_max value: 28.0465 - type: nauc_precision_at_20_std value: 16.3569 - type: nauc_precision_at_20_diff1 value: 14.969299999999999 - type: nauc_precision_at_100_max value: 26.7123 - type: nauc_precision_at_100_std value: 19.1407 - type: nauc_precision_at_100_diff1 value: 5.7822 - type: nauc_precision_at_1000_max value: 23.6681 - type: nauc_precision_at_1000_std value: 16.3438 - type: nauc_precision_at_1000_diff1 value: -3.3699 - type: nauc_mrr_at_1_max value: 20.836199999999998 - type: nauc_mrr_at_1_std value: -2.4346 - type: nauc_mrr_at_1_diff1 value: 41.3264 - type: nauc_mrr_at_3_max value: 22.4267 - type: nauc_mrr_at_3_std value: -0.1948 - type: nauc_mrr_at_3_diff1 value: 36.9255 - type: nauc_mrr_at_5_max value: 22.6662 - type: nauc_mrr_at_5_std value: 0.4444 - type: nauc_mrr_at_5_diff1 value: 35.957 - type: nauc_mrr_at_10_max value: 22.5111 - type: nauc_mrr_at_10_std value: 0.7020000000000001 - type: nauc_mrr_at_10_diff1 value: 35.6976 - type: nauc_mrr_at_20_max value: 22.4416 - type: nauc_mrr_at_20_std value: 0.8706999999999999 - type: nauc_mrr_at_20_diff1 value: 35.2034 - type: nauc_mrr_at_100_max value: 22.4571 - type: nauc_mrr_at_100_std value: 1.0563 - type: nauc_mrr_at_100_diff1 value: 35.177 - type: nauc_mrr_at_1000_max value: 22.4743 - type: nauc_mrr_at_1000_std value: 1.0505 - type: nauc_mrr_at_1000_diff1 value: 35.2186 - type: main_score value: 25.657000000000004 task: type: Retrieval - dataset: config: default name: MTEB HotpotQAHardNegatives (default) revision: 617612fa63afcb60e3b134bed8b7216a99707c37 split: test type: mteb/HotpotQA_test_top_250_only_w_correct-v2 metrics: - type: ndcg_at_1 value: 58.9 - type: ndcg_at_3 value: 45.092999999999996 - type: ndcg_at_5 value: 47.806 - type: ndcg_at_10 value: 50.666 - type: ndcg_at_20 value: 52.644000000000005 - type: ndcg_at_100 value: 56.071000000000005 - type: ndcg_at_1000 value: 58.262 - type: map_at_1 value: 29.45 - type: map_at_3 value: 37.675 - type: map_at_5 value: 39.562999999999995 - type: map_at_10 value: 41.056 - type: map_at_20 value: 41.765 - type: map_at_100 value: 42.425000000000004 - type: map_at_1000 value: 42.54 - type: recall_at_1 value: 29.45 - type: recall_at_3 value: 41.75 - type: recall_at_5 value: 47.099999999999994 - type: recall_at_10 value: 54.300000000000004 - type: recall_at_20 value: 60.699999999999996 - type: recall_at_100 value: 75.9 - type: recall_at_1000 value: 90.3 - type: precision_at_1 value: 58.9 - type: precision_at_3 value: 27.833000000000002 - type: precision_at_5 value: 18.84 - type: precision_at_10 value: 10.86 - type: precision_at_20 value: 6.069999999999999 - type: precision_at_100 value: 1.518 - type: precision_at_1000 value: 0.181 - type: mrr_at_1 value: 58.9 - type: mrr_at_3 value: 64.81670000000001 - type: mrr_at_5 value: 65.9717 - type: mrr_at_10 value: 66.84750000000001 - type: mrr_at_20 value: 67.1864 - type: mrr_at_100 value: 67.3796 - type: mrr_at_1000 value: 67.3962 - type: nauc_ndcg_at_1_max value: 40.6699 - type: nauc_ndcg_at_1_std value: -6.4051 - type: nauc_ndcg_at_1_diff1 value: 61.4074 - type: nauc_ndcg_at_3_max value: 36.086200000000005 - type: nauc_ndcg_at_3_std value: -3.8372 - type: nauc_ndcg_at_3_diff1 value: 44.0991 - type: nauc_ndcg_at_5_max value: 35.1661 - type: nauc_ndcg_at_5_std value: -3.4778000000000002 - type: nauc_ndcg_at_5_diff1 value: 41.2298 - type: nauc_ndcg_at_10_max value: 34.5689 - type: nauc_ndcg_at_10_std value: -0.7254 - type: nauc_ndcg_at_10_diff1 value: 38.9824 - type: nauc_ndcg_at_20_max value: 35.4153 - type: nauc_ndcg_at_20_std value: 0.9502999999999999 - type: nauc_ndcg_at_20_diff1 value: 38.5558 - type: nauc_ndcg_at_100_max value: 36.187799999999996 - type: nauc_ndcg_at_100_std value: 3.3059 - type: nauc_ndcg_at_100_diff1 value: 37.775 - type: nauc_ndcg_at_1000_max value: 36.9076 - type: nauc_ndcg_at_1000_std value: 3.2030000000000003 - type: nauc_ndcg_at_1000_diff1 value: 39.6691 - type: nauc_map_at_1_max value: 40.6699 - type: nauc_map_at_1_std value: -6.4051 - type: nauc_map_at_1_diff1 value: 61.4074 - type: nauc_map_at_3_max value: 34.8654 - type: nauc_map_at_3_std value: -1.9401000000000002 - type: nauc_map_at_3_diff1 value: 40.4559 - type: nauc_map_at_5_max value: 34.0362 - type: nauc_map_at_5_std value: -1.677 - type: nauc_map_at_5_diff1 value: 38.384 - type: nauc_map_at_10_max value: 33.8136 - type: nauc_map_at_10_std value: -0.2753 - type: nauc_map_at_10_diff1 value: 37.1326 - type: nauc_map_at_20_max value: 34.1981 - type: nauc_map_at_20_std value: 0.2882 - type: nauc_map_at_20_diff1 value: 36.996 - type: nauc_map_at_100_max value: 34.2694 - type: nauc_map_at_100_std value: 0.596 - type: nauc_map_at_100_diff1 value: 36.858200000000004 - type: nauc_map_at_1000_max value: 34.3301 - type: nauc_map_at_1000_std value: 0.6459 - type: nauc_map_at_1000_diff1 value: 36.9437 - type: nauc_recall_at_1_max value: 40.6699 - type: nauc_recall_at_1_std value: -6.4051 - type: nauc_recall_at_1_diff1 value: 61.4074 - type: nauc_recall_at_3_max value: 33.4227 - type: nauc_recall_at_3_std value: -2.6978 - type: nauc_recall_at_3_diff1 value: 35.5329 - type: nauc_recall_at_5_max value: 29.759900000000002 - type: nauc_recall_at_5_std value: -1.7928 - type: nauc_recall_at_5_diff1 value: 27.8553 - type: nauc_recall_at_10_max value: 27.2765 - type: nauc_recall_at_10_std value: 5.0284 - type: nauc_recall_at_10_diff1 value: 21.5188 - type: nauc_recall_at_20_max value: 27.456500000000002 - type: nauc_recall_at_20_std value: 10.4452 - type: nauc_recall_at_20_diff1 value: 17.377100000000002 - type: nauc_recall_at_100_max value: 27.960400000000003 - type: nauc_recall_at_100_std value: 26.0653 - type: nauc_recall_at_100_diff1 value: 5.9226 - type: nauc_recall_at_1000_max value: 33.996700000000004 - type: nauc_recall_at_1000_std value: 44.291199999999996 - type: nauc_recall_at_1000_diff1 value: 7.6986 - type: nauc_precision_at_1_max value: 40.6699 - type: nauc_precision_at_1_std value: -6.4051 - type: nauc_precision_at_1_diff1 value: 61.4074 - type: nauc_precision_at_3_max value: 33.4227 - type: nauc_precision_at_3_std value: -2.6978 - type: nauc_precision_at_3_diff1 value: 35.5329 - type: nauc_precision_at_5_max value: 29.759900000000002 - type: nauc_precision_at_5_std value: -1.7928 - type: nauc_precision_at_5_diff1 value: 27.8553 - type: nauc_precision_at_10_max value: 27.2765 - type: nauc_precision_at_10_std value: 5.0284 - type: nauc_precision_at_10_diff1 value: 21.5188 - type: nauc_precision_at_20_max value: 27.456500000000002 - type: nauc_precision_at_20_std value: 10.4452 - type: nauc_precision_at_20_diff1 value: 17.377100000000002 - type: nauc_precision_at_100_max value: 27.960400000000003 - type: nauc_precision_at_100_std value: 26.0653 - type: nauc_precision_at_100_diff1 value: 5.9226 - type: nauc_precision_at_1000_max value: 33.996700000000004 - type: nauc_precision_at_1000_std value: 44.291199999999996 - type: nauc_precision_at_1000_diff1 value: 7.6986 - type: nauc_mrr_at_1_max value: 40.6699 - type: nauc_mrr_at_1_std value: -6.4051 - type: nauc_mrr_at_1_diff1 value: 61.4074 - type: nauc_mrr_at_3_max value: 40.4193 - type: nauc_mrr_at_3_std value: -8.072899999999999 - type: nauc_mrr_at_3_diff1 value: 58.589400000000005 - type: nauc_mrr_at_5_max value: 40.6559 - type: nauc_mrr_at_5_std value: -8.1937 - type: nauc_mrr_at_5_diff1 value: 58.30650000000001 - type: nauc_mrr_at_10_max value: 40.515699999999995 - type: nauc_mrr_at_10_std value: -7.4325 - type: nauc_mrr_at_10_diff1 value: 58.1284 - type: nauc_mrr_at_20_max value: 40.63 - type: nauc_mrr_at_20_std value: -7.1578 - type: nauc_mrr_at_20_diff1 value: 58.215799999999994 - type: nauc_mrr_at_100_max value: 40.693 - type: nauc_mrr_at_100_std value: -7.0889 - type: nauc_mrr_at_100_diff1 value: 58.22389999999999 - type: nauc_mrr_at_1000_max value: 40.700900000000004 - type: nauc_mrr_at_1000_std value: -7.098400000000001 - type: nauc_mrr_at_1000_diff1 value: 58.2458 - type: main_score value: 50.666 task: type: Retrieval - dataset: config: default name: MTEB ImdbClassification (default) revision: 3d86128a09e091d6018b6d26cad27f2739fc2db7 split: test type: mteb/imdb metrics: - type: accuracy value: 68.1712 - type: f1 value: 67.982 - type: f1_weighted value: 67.982 - type: ap value: 62.572799999999994 - type: ap_weighted value: 62.572799999999994 - type: main_score value: 68.1712 task: type: Classification - dataset: config: en name: MTEB MTOPDomainClassification (en) revision: d80d48c1eb48d3562165c59d59d0034df9fff0bf split: test type: mteb/mtop_domain metrics: - type: accuracy value: 90.4423 - type: f1 value: 90.08840000000001 - type: f1_weighted value: 90.44919999999999 - type: main_score value: 90.4423 task: type: Classification - dataset: config: en name: MTEB MassiveIntentClassification (en) revision: 4672e20407010da34463acc759c162ca9734bca6 split: test type: mteb/amazon_massive_intent metrics: - type: accuracy value: 65.4371 - type: f1 value: 62.8737 - type: f1_weighted value: 64.2218 - type: main_score value: 65.4371 task: type: Classification - dataset: config: en name: MTEB MassiveScenarioClassification (en) revision: fad2c6e8459f9e1c45d9315f4953d921437d70f8 split: test type: mteb/amazon_massive_scenario metrics: - type: accuracy value: 70.4371 - type: f1 value: 69.75200000000001 - type: f1_weighted value: 69.7839 - type: main_score value: 70.4371 task: type: Classification - dataset: config: default name: MTEB MedrxivClusteringP2P.v2 (default) revision: e7a26af6f3ae46b30dde8737f02c07b1505bcc73 split: test type: mteb/medrxiv-clustering-p2p metrics: - type: v_measure value: 35.1864 - type: v_measure_std value: 0.7835 - type: main_score value: 35.1864 task: type: Clustering - dataset: config: default name: MTEB MedrxivClusteringS2S.v2 (default) revision: 35191c8c0dca72d8ff3efcd72aa802307d469663 split: test type: mteb/medrxiv-clustering-s2s metrics: - type: v_measure value: 31.8693 - type: v_measure_std value: 0.662 - type: main_score value: 31.8693 task: type: Clustering - dataset: config: default name: MTEB MindSmallReranking (default) revision: 59042f120c80e8afa9cdbb224f67076cec0fc9a7 split: test type: mteb/mind_small metrics: - type: map value: 29.4367 - type: mrr value: 30.318299999999997 - type: nAUC_map_max value: -21.5343 - type: nAUC_map_std value: -6.4848 - type: nAUC_map_diff1 value: 12.8559 - type: nAUC_mrr_max value: -15.981200000000001 - type: nAUC_mrr_std value: -4.2437000000000005 - type: nAUC_mrr_diff1 value: 12.4087 - type: main_score value: 29.4367 task: type: Reranking - dataset: config: default name: MTEB SCIDOCS (default) revision: f8c2fcf00f625baaa80f62ec5bd9e1fff3b8ae88 split: test type: mteb/scidocs metrics: - type: ndcg_at_1 value: 19.5 - type: ndcg_at_3 value: 15.673 - type: ndcg_at_5 value: 13.389000000000001 - type: ndcg_at_10 value: 16.179 - type: ndcg_at_20 value: 18.88 - type: ndcg_at_100 value: 23.812 - type: ndcg_at_1000 value: 29.833 - type: map_at_1 value: 3.963 - type: map_at_3 value: 6.93 - type: map_at_5 value: 8.062 - type: map_at_10 value: 9.328 - type: map_at_20 value: 10.283000000000001 - type: map_at_100 value: 11.197 - type: map_at_1000 value: 11.522 - type: recall_at_1 value: 3.963 - type: recall_at_3 value: 8.813 - type: recall_at_5 value: 11.658 - type: recall_at_10 value: 16.803 - type: recall_at_20 value: 23.169999999999998 - type: recall_at_100 value: 39.163 - type: recall_at_1000 value: 68.572 - type: precision_at_1 value: 19.5 - type: precision_at_3 value: 14.499999999999998 - type: precision_at_5 value: 11.5 - type: precision_at_10 value: 8.3 - type: precision_at_20 value: 5.71 - type: precision_at_100 value: 1.9300000000000002 - type: precision_at_1000 value: 0.338 - type: mrr_at_1 value: 19.5 - type: mrr_at_3 value: 26.016699999999997 - type: mrr_at_5 value: 27.526699999999998 - type: mrr_at_10 value: 28.9305 - type: mrr_at_20 value: 29.628100000000003 - type: mrr_at_100 value: 30.131400000000003 - type: mrr_at_1000 value: 30.201800000000002 - type: nauc_ndcg_at_1_max value: 25.1197 - type: nauc_ndcg_at_1_std value: 4.7176 - type: nauc_ndcg_at_1_diff1 value: 24.2336 - type: nauc_ndcg_at_3_max value: 30.050900000000002 - type: nauc_ndcg_at_3_std value: 11.4719 - type: nauc_ndcg_at_3_diff1 value: 20.4572 - type: nauc_ndcg_at_5_max value: 32.224399999999996 - type: nauc_ndcg_at_5_std value: 15.0585 - type: nauc_ndcg_at_5_diff1 value: 19.991600000000002 - type: nauc_ndcg_at_10_max value: 33.7156 - type: nauc_ndcg_at_10_std value: 19.2797 - type: nauc_ndcg_at_10_diff1 value: 20.3735 - type: nauc_ndcg_at_20_max value: 34.7518 - type: nauc_ndcg_at_20_std value: 23.227600000000002 - type: nauc_ndcg_at_20_diff1 value: 19.2851 - type: nauc_ndcg_at_100_max value: 36.6006 - type: nauc_ndcg_at_100_std value: 28.511599999999998 - type: nauc_ndcg_at_100_diff1 value: 18.0315 - type: nauc_ndcg_at_1000_max value: 36.3651 - type: nauc_ndcg_at_1000_std value: 29.7201 - type: nauc_ndcg_at_1000_diff1 value: 16.5988 - type: nauc_map_at_1_max value: 24.954 - type: nauc_map_at_1_std value: 4.7878 - type: nauc_map_at_1_diff1 value: 24.7611 - type: nauc_map_at_3_max value: 30.0634 - type: nauc_map_at_3_std value: 9.9217 - type: nauc_map_at_3_diff1 value: 21.9063 - type: nauc_map_at_5_max value: 32.1685 - type: nauc_map_at_5_std value: 12.8527 - type: nauc_map_at_5_diff1 value: 21.033099999999997 - type: nauc_map_at_10_max value: 33.840199999999996 - type: nauc_map_at_10_std value: 16.304299999999998 - type: nauc_map_at_10_diff1 value: 21.9142 - type: nauc_map_at_20_max value: 34.2084 - type: nauc_map_at_20_std value: 18.709799999999998 - type: nauc_map_at_20_diff1 value: 21.2113 - type: nauc_map_at_100_max value: 35.1304 - type: nauc_map_at_100_std value: 20.8559 - type: nauc_map_at_100_diff1 value: 20.8642 - type: nauc_map_at_1000_max value: 35.1972 - type: nauc_map_at_1000_std value: 21.2306 - type: nauc_map_at_1000_diff1 value: 20.7425 - type: nauc_recall_at_1_max value: 24.954 - type: nauc_recall_at_1_std value: 4.7878 - type: nauc_recall_at_1_diff1 value: 24.7611 - type: nauc_recall_at_3_max value: 31.1016 - type: nauc_recall_at_3_std value: 14.1642 - type: nauc_recall_at_3_diff1 value: 18.676000000000002 - type: nauc_recall_at_5_max value: 33.8509 - type: nauc_recall_at_5_std value: 19.503899999999998 - type: nauc_recall_at_5_diff1 value: 17.1764 - type: nauc_recall_at_10_max value: 34.085300000000004 - type: nauc_recall_at_10_std value: 25.536199999999997 - type: nauc_recall_at_10_diff1 value: 16.8913 - type: nauc_recall_at_20_max value: 34.1879 - type: nauc_recall_at_20_std value: 31.5486 - type: nauc_recall_at_20_diff1 value: 13.852300000000001 - type: nauc_recall_at_100_max value: 34.313700000000004 - type: nauc_recall_at_100_std value: 40.6137 - type: nauc_recall_at_100_diff1 value: 9.043800000000001 - type: nauc_recall_at_1000_max value: 27.090500000000002 - type: nauc_recall_at_1000_std value: 42.398799999999994 - type: nauc_recall_at_1000_diff1 value: -0.9452999999999999 - type: nauc_precision_at_1_max value: 25.1197 - type: nauc_precision_at_1_std value: 4.7176 - type: nauc_precision_at_1_diff1 value: 24.2336 - type: nauc_precision_at_3_max value: 31.4429 - type: nauc_precision_at_3_std value: 14.1941 - type: nauc_precision_at_3_diff1 value: 18.4824 - type: nauc_precision_at_5_max value: 34.2219 - type: nauc_precision_at_5_std value: 19.703699999999998 - type: nauc_precision_at_5_diff1 value: 17.0964 - type: nauc_precision_at_10_max value: 34.380300000000005 - type: nauc_precision_at_10_std value: 25.6554 - type: nauc_precision_at_10_diff1 value: 16.8487 - type: nauc_precision_at_20_max value: 34.462199999999996 - type: nauc_precision_at_20_std value: 31.465500000000002 - type: nauc_precision_at_20_diff1 value: 13.9038 - type: nauc_precision_at_100_max value: 34.7074 - type: nauc_precision_at_100_std value: 40.3278 - type: nauc_precision_at_100_diff1 value: 9.2637 - type: nauc_precision_at_1000_max value: 27.213900000000002 - type: nauc_precision_at_1000_std value: 40.8382 - type: nauc_precision_at_1000_diff1 value: -0.5306 - type: nauc_mrr_at_1_max value: 25.1197 - type: nauc_mrr_at_1_std value: 4.7176 - type: nauc_mrr_at_1_diff1 value: 24.2336 - type: nauc_mrr_at_3_max value: 27.9362 - type: nauc_mrr_at_3_std value: 9.9578 - type: nauc_mrr_at_3_diff1 value: 20.809 - type: nauc_mrr_at_5_max value: 29.0381 - type: nauc_mrr_at_5_std value: 11.7807 - type: nauc_mrr_at_5_diff1 value: 20.8787 - type: nauc_mrr_at_10_max value: 28.860799999999998 - type: nauc_mrr_at_10_std value: 12.269 - type: nauc_mrr_at_10_diff1 value: 20.7762 - type: nauc_mrr_at_20_max value: 29.2051 - type: nauc_mrr_at_20_std value: 12.7588 - type: nauc_mrr_at_20_diff1 value: 20.9176 - type: nauc_mrr_at_100_max value: 29.2288 - type: nauc_mrr_at_100_std value: 12.7523 - type: nauc_mrr_at_100_diff1 value: 20.9235 - type: nauc_mrr_at_1000_max value: 29.2119 - type: nauc_mrr_at_1000_std value: 12.697600000000001 - type: nauc_mrr_at_1000_diff1 value: 20.9131 - type: main_score value: 16.179 task: type: Retrieval - dataset: config: default name: MTEB SICK-R (default) revision: 20a6d6f312dd54037fe07a32d58e5e168867909d split: test type: mteb/sickr-sts metrics: - type: pearson value: 84.5347 - type: spearman value: 79.80850000000001 - type: cosine_pearson value: 84.5347 - type: cosine_spearman value: 79.80850000000001 - type: manhattan_pearson value: 81.0701 - type: manhattan_spearman value: 79.6721 - type: euclidean_pearson value: 81.20349999999999 - type: euclidean_spearman value: 79.80850000000001 - type: main_score value: 79.80850000000001 task: type: STS - dataset: config: default name: MTEB STS12 (default) revision: a0d554a64d88156834ff5ae9920b964011b16384 split: test type: mteb/sts12-sts metrics: - type: pearson value: 86.88 - type: spearman value: 78.1076 - type: cosine_pearson value: 86.88 - type: cosine_spearman value: 78.1052 - type: manhattan_pearson value: 83.3712 - type: manhattan_spearman value: 78.0898 - type: euclidean_pearson value: 83.3731 - type: euclidean_spearman value: 78.1052 - type: main_score value: 78.1052 task: type: STS - dataset: config: default name: MTEB STS13 (default) revision: 7e90230a92c190f1bf69ae9002b8cea547a64cca split: test type: mteb/sts13-sts metrics: - type: pearson value: 83.5938 - type: spearman value: 84.2951 - type: cosine_pearson value: 83.5938 - type: cosine_spearman value: 84.2951 - type: manhattan_pearson value: 83.2541 - type: manhattan_spearman value: 83.8292 - type: euclidean_pearson value: 83.69640000000001 - type: euclidean_spearman value: 84.2951 - type: main_score value: 84.2951 task: type: STS - dataset: config: default name: MTEB STS14 (default) revision: 6031580fec1f6af667f0bd2da0a551cf4f0b2375 split: test type: mteb/sts14-sts metrics: - type: pearson value: 82.6003 - type: spearman value: 81.3569 - type: cosine_pearson value: 82.6003 - type: cosine_spearman value: 81.357 - type: manhattan_pearson value: 81.5087 - type: manhattan_spearman value: 81.17229999999999 - type: euclidean_pearson value: 81.7147 - type: euclidean_spearman value: 81.3569 - type: main_score value: 81.357 task: type: STS - dataset: config: default name: MTEB STS15 (default) revision: ae752c7c21bf194d8b67fd573edf7ae58183cbe3 split: test type: mteb/sts15-sts metrics: - type: pearson value: 86.4161 - type: spearman value: 87.0039 - type: cosine_pearson value: 86.4161 - type: cosine_spearman value: 87.0039 - type: manhattan_pearson value: 86.2482 - type: manhattan_spearman value: 86.934 - type: euclidean_pearson value: 86.3344 - type: euclidean_spearman value: 87.0039 - type: main_score value: 87.0039 task: type: STS - dataset: config: en-en name: MTEB STS17 (en-en) revision: faeb762787bd10488a50c8b5be4a3b82e411949c split: test type: mteb/sts17-crosslingual-sts metrics: - type: pearson value: 88.6011 - type: spearman value: 88.1023 - type: cosine_pearson value: 88.6011 - type: cosine_spearman value: 88.1023 - type: manhattan_pearson value: 88.18639999999999 - type: manhattan_spearman value: 88.55380000000001 - type: euclidean_pearson value: 88.011 - type: euclidean_spearman value: 88.1023 - type: main_score value: 88.1023 task: type: STS - dataset: config: en name: MTEB STS22.v2 (en) revision: d31f33a128469b20e357535c39b82fb3c3f6f2bd split: test type: mteb/sts22-crosslingual-sts metrics: - type: pearson value: 65.7746 - type: spearman value: 64.7997 - type: cosine_pearson value: 65.7746 - type: cosine_spearman value: 64.7997 - type: manhattan_pearson value: 67.5417 - type: manhattan_spearman value: 65.27629999999999 - type: euclidean_pearson value: 67.2574 - type: euclidean_spearman value: 64.7997 - type: main_score value: 64.7997 task: type: STS - dataset: config: default name: MTEB STSBenchmark (default) revision: b0fddb56ed78048fa8b90373c8a3cfc37b684831 split: test type: mteb/stsbenchmark-sts metrics: - type: pearson value: 84.4276 - type: spearman value: 84.9631 - type: cosine_pearson value: 84.4276 - type: cosine_spearman value: 84.9631 - type: manhattan_pearson value: 84.4743 - type: manhattan_spearman value: 84.7686 - type: euclidean_pearson value: 84.6058 - type: euclidean_spearman value: 84.9631 - type: main_score value: 84.9631 task: type: STS - dataset: config: default name: MTEB SprintDuplicateQuestions (default) revision: d66bd1f72af766a5cc4b0ca5e00c162f89e8cc46 split: test type: mteb/sprintduplicatequestions-pairclassification metrics: - type: similarity_accuracy value: 99.7931 - type: similarity_accuracy_threshold value: 69.6798 - type: similarity_f1 value: 89.4293 - type: similarity_f1_threshold value: 68.3132 - type: similarity_precision value: 88.76849999999999 - type: similarity_recall value: 90.10000000000001 - type: similarity_ap value: 94.3099 - type: cosine_accuracy value: 99.7931 - type: cosine_accuracy_threshold value: 69.6798 - type: cosine_f1 value: 89.4293 - type: cosine_f1_threshold value: 68.3132 - type: cosine_precision value: 88.76849999999999 - type: cosine_recall value: 90.10000000000001 - type: cosine_ap value: 94.3099 - type: manhattan_accuracy value: 99.7792 - type: manhattan_accuracy_threshold value: 1354.3922 - type: manhattan_f1 value: 88.71289999999999 - type: manhattan_f1_threshold value: 1389.3319999999999 - type: manhattan_precision value: 87.84309999999999 - type: manhattan_recall value: 89.60000000000001 - type: manhattan_ap value: 93.8459 - type: euclidean_accuracy value: 99.7931 - type: euclidean_accuracy_threshold value: 77.872 - type: euclidean_f1 value: 89.4293 - type: euclidean_f1_threshold value: 79.6075 - type: euclidean_precision value: 88.76849999999999 - type: euclidean_recall value: 90.10000000000001 - type: euclidean_ap value: 94.3099 - type: dot_accuracy value: 99.7931 - type: dot_accuracy_threshold value: 69.6798 - type: dot_f1 value: 89.4293 - type: dot_f1_threshold value: 68.3132 - type: dot_precision value: 88.76849999999999 - type: dot_recall value: 90.10000000000001 - type: dot_ap value: 94.3099 - type: max_accuracy value: 99.7931 - type: max_f1 value: 89.4293 - type: max_precision value: 88.76849999999999 - type: max_recall value: 90.10000000000001 - type: max_ap value: 94.3099 - type: main_score value: 94.3099 task: type: PairClassification - dataset: config: default name: MTEB StackExchangeClustering.v2 (default) revision: 6cbc1f7b2bc0622f2e39d2c77fa502909748c259 split: test type: mteb/stackexchange-clustering metrics: - type: v_measure value: 53.9397 - type: v_measure_std value: 0.7764 - type: main_score value: 53.9397 task: type: Clustering - dataset: config: default name: MTEB StackExchangeClusteringP2P.v2 (default) revision: 815ca46b2622cec33ccafc3735d572c266efdb44 split: test type: mteb/stackexchange-clustering-p2p metrics: - type: v_measure value: 40.6498 - type: v_measure_std value: 0.439 - type: main_score value: 40.6498 task: type: Clustering - dataset: config: default name: MTEB SummEvalSummarization.v2 (default) revision: cda12ad7615edc362dbf25a00fdd61d3b1eaf93c split: test type: mteb/summeval metrics: - type: pearson value: 28.6283 - type: spearman value: 26.0828 - type: cosine_spearman value: 26.0828 - type: cosine_pearson value: 28.6283 - type: dot_spearman value: 26.0828 - type: dot_pearson value: 28.6283 - type: main_score value: 26.0828 task: type: Summarization - dataset: config: default name: MTEB TRECCOVID (default) revision: bb9466bac8153a0349341eb1b22e06409e78ef4e split: test type: mteb/trec-covid metrics: - type: ndcg_at_1 value: 66 - type: ndcg_at_3 value: 64.592 - type: ndcg_at_5 value: 63.405 - type: ndcg_at_10 value: 60.077999999999996 - type: ndcg_at_20 value: 57.202 - type: ndcg_at_100 value: 44.643 - type: ndcg_at_1000 value: 42.104 - type: map_at_1 value: 0.193 - type: map_at_3 value: 0.514 - type: map_at_5 value: 0.783 - type: map_at_10 value: 1.3719999999999999 - type: map_at_20 value: 2.371 - type: map_at_100 value: 7.353 - type: map_at_1000 value: 17.855999999999998 - type: recall_at_1 value: 0.193 - type: recall_at_3 value: 0.563 - type: recall_at_5 value: 0.907 - type: recall_at_10 value: 1.683 - type: recall_at_20 value: 3.118 - type: recall_at_100 value: 11.051 - type: recall_at_1000 value: 39.973 - type: precision_at_1 value: 74 - type: precision_at_3 value: 71.333 - type: precision_at_5 value: 68.8 - type: precision_at_10 value: 63.800000000000004 - type: precision_at_20 value: 60.5 - type: precision_at_100 value: 45.519999999999996 - type: precision_at_1000 value: 18.451999999999998 - type: mrr_at_1 value: 74 - type: mrr_at_3 value: 83.3333 - type: mrr_at_5 value: 83.7333 - type: mrr_at_10 value: 84.3524 - type: mrr_at_20 value: 84.3524 - type: mrr_at_100 value: 84.3524 - type: mrr_at_1000 value: 84.3524 - type: nauc_ndcg_at_1_max value: 11.527800000000001 - type: nauc_ndcg_at_1_std value: 17.1352 - type: nauc_ndcg_at_1_diff1 value: 24.955199999999998 - type: nauc_ndcg_at_3_max value: 11.7829 - type: nauc_ndcg_at_3_std value: 23.1421 - type: nauc_ndcg_at_3_diff1 value: 20.884 - type: nauc_ndcg_at_5_max value: 8.8058 - type: nauc_ndcg_at_5_std value: 27.9156 - type: nauc_ndcg_at_5_diff1 value: 7.002 - type: nauc_ndcg_at_10_max value: 16.561 - type: nauc_ndcg_at_10_std value: 40.528999999999996 - type: nauc_ndcg_at_10_diff1 value: -6.1467 - type: nauc_ndcg_at_20_max value: 25.0792 - type: nauc_ndcg_at_20_std value: 54.0689 - type: nauc_ndcg_at_20_diff1 value: -9.6224 - type: nauc_ndcg_at_100_max value: 43.2818 - type: nauc_ndcg_at_100_std value: 75.4432 - type: nauc_ndcg_at_100_diff1 value: -11.4618 - type: nauc_ndcg_at_1000_max value: 50.360099999999996 - type: nauc_ndcg_at_1000_std value: 76.03999999999999 - type: nauc_ndcg_at_1000_diff1 value: -12.5796 - type: nauc_map_at_1_max value: 4.3809000000000005 - type: nauc_map_at_1_std value: -17.5338 - type: nauc_map_at_1_diff1 value: 24.837 - type: nauc_map_at_3_max value: 4.7842 - type: nauc_map_at_3_std value: -8.9273 - type: nauc_map_at_3_diff1 value: 19.7729 - type: nauc_map_at_5_max value: 3.6865 - type: nauc_map_at_5_std value: -1.1584 - type: nauc_map_at_5_diff1 value: 7.3548 - type: nauc_map_at_10_max value: 7.556400000000001 - type: nauc_map_at_10_std value: 11.2599 - type: nauc_map_at_10_diff1 value: -3.4863999999999997 - type: nauc_map_at_20_max value: 12.6951 - type: nauc_map_at_20_std value: 27.3531 - type: nauc_map_at_20_diff1 value: -11.968 - type: nauc_map_at_100_max value: 41.625099999999996 - type: nauc_map_at_100_std value: 66.5204 - type: nauc_map_at_100_diff1 value: -12.020999999999999 - type: nauc_map_at_1000_max value: 56.6014 - type: nauc_map_at_1000_std value: 80.6523 - type: nauc_map_at_1000_diff1 value: -11.9876 - type: nauc_recall_at_1_max value: 4.3809000000000005 - type: nauc_recall_at_1_std value: -17.5338 - type: nauc_recall_at_1_diff1 value: 24.837 - type: nauc_recall_at_3_max value: -0.8904000000000001 - type: nauc_recall_at_3_std value: -11.2455 - type: nauc_recall_at_3_diff1 value: 17.6352 - type: nauc_recall_at_5_max value: -4.6216 - type: nauc_recall_at_5_std value: -3.5367999999999995 - type: nauc_recall_at_5_diff1 value: 3.3192 - type: nauc_recall_at_10_max value: 1.8993 - type: nauc_recall_at_10_std value: 6.844600000000001 - type: nauc_recall_at_10_diff1 value: -6.0693 - type: nauc_recall_at_20_max value: 5.733 - type: nauc_recall_at_20_std value: 20.6114 - type: nauc_recall_at_20_diff1 value: -11.631 - type: nauc_recall_at_100_max value: 32.7146 - type: nauc_recall_at_100_std value: 55.6053 - type: nauc_recall_at_100_diff1 value: -10.7219 - type: nauc_recall_at_1000_max value: 50.7544 - type: nauc_recall_at_1000_std value: 68.4639 - type: nauc_recall_at_1000_diff1 value: -10.431600000000001 - type: nauc_precision_at_1_max value: 13.8681 - type: nauc_precision_at_1_std value: -3.4711 - type: nauc_precision_at_1_diff1 value: 36.945 - type: nauc_precision_at_3_max value: 11.6309 - type: nauc_precision_at_3_std value: 5.0299000000000005 - type: nauc_precision_at_3_diff1 value: 28.5186 - type: nauc_precision_at_5_max value: 10.1297 - type: nauc_precision_at_5_std value: 19.049599999999998 - type: nauc_precision_at_5_diff1 value: 7.918500000000001 - type: nauc_precision_at_10_max value: 21.3492 - type: nauc_precision_at_10_std value: 39.6679 - type: nauc_precision_at_10_diff1 value: -10.7691 - type: nauc_precision_at_20_max value: 32.4627 - type: nauc_precision_at_20_std value: 57.2564 - type: nauc_precision_at_20_diff1 value: -12.0336 - type: nauc_precision_at_100_max value: 47.7277 - type: nauc_precision_at_100_std value: 77.0329 - type: nauc_precision_at_100_diff1 value: -9.2173 - type: nauc_precision_at_1000_max value: 47.6622 - type: nauc_precision_at_1000_std value: 62.8329 - type: nauc_precision_at_1000_diff1 value: -5.9713 - type: nauc_mrr_at_1_max value: 13.8681 - type: nauc_mrr_at_1_std value: -3.4711 - type: nauc_mrr_at_1_diff1 value: 36.945 - type: nauc_mrr_at_3_max value: 9.6673 - type: nauc_mrr_at_3_std value: -4.3877 - type: nauc_mrr_at_3_diff1 value: 39.2075 - type: nauc_mrr_at_5_max value: 7.9742999999999995 - type: nauc_mrr_at_5_std value: -4.8388 - type: nauc_mrr_at_5_diff1 value: 38.314 - type: nauc_mrr_at_10_max value: 11.6962 - type: nauc_mrr_at_10_std value: -2.7085000000000004 - type: nauc_mrr_at_10_diff1 value: 37.695 - type: nauc_mrr_at_20_max value: 11.6962 - type: nauc_mrr_at_20_std value: -2.7085000000000004 - type: nauc_mrr_at_20_diff1 value: 37.695 - type: nauc_mrr_at_100_max value: 11.6962 - type: nauc_mrr_at_100_std value: -2.7085000000000004 - type: nauc_mrr_at_100_diff1 value: 37.695 - type: nauc_mrr_at_1000_max value: 11.6962 - type: nauc_mrr_at_1000_std value: -2.7085000000000004 - type: nauc_mrr_at_1000_diff1 value: 37.695 - type: main_score value: 60.077999999999996 task: type: Retrieval - dataset: config: default name: MTEB Touche2020Retrieval.v3 (default) revision: 431886eaecc48f067a3975b70d0949ea2862463c split: test type: mteb/webis-touche2020-v3 metrics: - type: ndcg_at_1 value: 58.163 - type: ndcg_at_3 value: 58.884 - type: ndcg_at_5 value: 53.062 - type: ndcg_at_10 value: 47.571999999999996 - type: ndcg_at_20 value: 43.984 - type: ndcg_at_100 value: 51.559999999999995 - type: ndcg_at_1000 value: 64.25800000000001 - type: map_at_1 value: 2.759 - type: map_at_3 value: 7.310999999999999 - type: map_at_5 value: 10.077 - type: map_at_10 value: 15.722 - type: map_at_20 value: 21.917 - type: map_at_100 value: 29.582000000000004 - type: map_at_1000 value: 32.608 - type: recall_at_1 value: 2.759 - type: recall_at_3 value: 7.870000000000001 - type: recall_at_5 value: 11.26 - type: recall_at_10 value: 19.211 - type: recall_at_20 value: 30.134 - type: recall_at_100 value: 54.96 - type: recall_at_1000 value: 85.78099999999999 - type: precision_at_1 value: 67.34700000000001 - type: precision_at_3 value: 68.027 - type: precision_at_5 value: 59.184000000000005 - type: precision_at_10 value: 50.815999999999995 - type: precision_at_20 value: 41.939 - type: precision_at_100 value: 17.041 - type: precision_at_1000 value: 2.963 - type: mrr_at_1 value: 67.3469 - type: mrr_at_3 value: 80.6122 - type: mrr_at_5 value: 80.6122 - type: mrr_at_10 value: 80.9524 - type: mrr_at_20 value: 80.9524 - type: mrr_at_100 value: 80.9524 - type: mrr_at_1000 value: 80.9524 - type: nauc_ndcg_at_1_max value: -18.7982 - type: nauc_ndcg_at_1_std value: 13.605500000000001 - type: nauc_ndcg_at_1_diff1 value: 21.2588 - type: nauc_ndcg_at_3_max value: -9.0937 - type: nauc_ndcg_at_3_std value: 23.259900000000002 - type: nauc_ndcg_at_3_diff1 value: 24.2989 - type: nauc_ndcg_at_5_max value: -13.242300000000002 - type: nauc_ndcg_at_5_std value: 9.7464 - type: nauc_ndcg_at_5_diff1 value: 18.601799999999997 - type: nauc_ndcg_at_10_max value: -12.045599999999999 - type: nauc_ndcg_at_10_std value: 7.5604000000000005 - type: nauc_ndcg_at_10_diff1 value: 20.1203 - type: nauc_ndcg_at_20_max value: -13.2776 - type: nauc_ndcg_at_20_std value: 8.2692 - type: nauc_ndcg_at_20_diff1 value: 21.38 - type: nauc_ndcg_at_100_max value: -21.1315 - type: nauc_ndcg_at_100_std value: 8.4079 - type: nauc_ndcg_at_100_diff1 value: 29.3124 - type: nauc_ndcg_at_1000_max value: -3.7026999999999997 - type: nauc_ndcg_at_1000_std value: 34.970600000000005 - type: nauc_ndcg_at_1000_diff1 value: 22.3636 - type: nauc_map_at_1_max value: -36.432500000000005 - type: nauc_map_at_1_std value: -23.9669 - type: nauc_map_at_1_diff1 value: 37.2073 - type: nauc_map_at_3_max value: -32.8613 - type: nauc_map_at_3_std value: -18.0951 - type: nauc_map_at_3_diff1 value: 36.3228 - type: nauc_map_at_5_max value: -31.355 - type: nauc_map_at_5_std value: -21.148500000000002 - type: nauc_map_at_5_diff1 value: 27.999200000000002 - type: nauc_map_at_10_max value: -25.3787 - type: nauc_map_at_10_std value: -18.564700000000002 - type: nauc_map_at_10_diff1 value: 24.076800000000002 - type: nauc_map_at_20_max value: -20.954 - type: nauc_map_at_20_std value: -12.6847 - type: nauc_map_at_20_diff1 value: 24.3842 - type: nauc_map_at_100_max value: -15.7801 - type: nauc_map_at_100_std value: -2.823 - type: nauc_map_at_100_diff1 value: 24.8472 - type: nauc_map_at_1000_max value: -11.8023 - type: nauc_map_at_1000_std value: 3.9041 - type: nauc_map_at_1000_diff1 value: 23.3312 - type: nauc_recall_at_1_max value: -36.432500000000005 - type: nauc_recall_at_1_std value: -23.9669 - type: nauc_recall_at_1_diff1 value: 37.2073 - type: nauc_recall_at_3_max value: -36.3448 - type: nauc_recall_at_3_std value: -18.4742 - type: nauc_recall_at_3_diff1 value: 38.4857 - type: nauc_recall_at_5_max value: -35.4207 - type: nauc_recall_at_5_std value: -23.7906 - type: nauc_recall_at_5_diff1 value: 28.3854 - type: nauc_recall_at_10_max value: -28.4266 - type: nauc_recall_at_10_std value: -21.3224 - type: nauc_recall_at_10_diff1 value: 27.0746 - type: nauc_recall_at_20_max value: -23.1205 - type: nauc_recall_at_20_std value: -12.3539 - type: nauc_recall_at_20_diff1 value: 27.127499999999998 - type: nauc_recall_at_100_max value: -22.0703 - type: nauc_recall_at_100_std value: 10.1339 - type: nauc_recall_at_100_diff1 value: 29.759900000000002 - type: nauc_recall_at_1000_max value: 13.5147 - type: nauc_recall_at_1000_std value: 78.4907 - type: nauc_recall_at_1000_diff1 value: 12.151 - type: nauc_precision_at_1_max value: -20.1082 - type: nauc_precision_at_1_std value: 13.5123 - type: nauc_precision_at_1_diff1 value: 16.7562 - type: nauc_precision_at_3_max value: -11.2979 - type: nauc_precision_at_3_std value: 23.0876 - type: nauc_precision_at_3_diff1 value: 20.738 - type: nauc_precision_at_5_max value: -18.1198 - type: nauc_precision_at_5_std value: -2.4168 - type: nauc_precision_at_5_diff1 value: 5.1223 - type: nauc_precision_at_10_max value: -4.7656 - type: nauc_precision_at_10_std value: 1.5377 - type: nauc_precision_at_10_diff1 value: 8.2175 - type: nauc_precision_at_20_max value: 7.571999999999999 - type: nauc_precision_at_20_std value: 17.309 - type: nauc_precision_at_20_diff1 value: 5.2156 - type: nauc_precision_at_100_max value: 35.02 - type: nauc_precision_at_100_std value: 57.2867 - type: nauc_precision_at_100_diff1 value: -12.814200000000001 - type: nauc_precision_at_1000_max value: 54.8988 - type: nauc_precision_at_1000_std value: 55.970699999999994 - type: nauc_precision_at_1000_diff1 value: -36.8074 - type: nauc_mrr_at_1_max value: -20.1082 - type: nauc_mrr_at_1_std value: 13.5123 - type: nauc_mrr_at_1_diff1 value: 16.7562 - type: nauc_mrr_at_3_max value: -23.668300000000002 - type: nauc_mrr_at_3_std value: 16.883699999999997 - type: nauc_mrr_at_3_diff1 value: 20.6687 - type: nauc_mrr_at_5_max value: -23.668300000000002 - type: nauc_mrr_at_5_std value: 16.883699999999997 - type: nauc_mrr_at_5_diff1 value: 20.6687 - type: nauc_mrr_at_10_max value: -21.8234 - type: nauc_mrr_at_10_std value: 15.1609 - type: nauc_mrr_at_10_diff1 value: 19.6023 - type: nauc_mrr_at_20_max value: -21.8234 - type: nauc_mrr_at_20_std value: 15.1609 - type: nauc_mrr_at_20_diff1 value: 19.6023 - type: nauc_mrr_at_100_max value: -21.8234 - type: nauc_mrr_at_100_std value: 15.1609 - type: nauc_mrr_at_100_diff1 value: 19.6023 - type: nauc_mrr_at_1000_max value: -21.8234 - type: nauc_mrr_at_1000_std value: 15.1609 - type: nauc_mrr_at_1000_diff1 value: 19.6023 - type: main_score value: 47.571999999999996 task: type: Retrieval - dataset: config: default name: MTEB ToxicConversationsClassification (default) revision: edfaf9da55d3dd50d43143d90c1ac476895ae6de split: test type: mteb/toxic_conversations_50k metrics: - type: accuracy value: 63.608399999999996 - type: f1 value: 48.6248 - type: f1_weighted value: 71.6158 - type: ap value: 10.9541 - type: ap_weighted value: 10.9541 - type: main_score value: 63.608399999999996 task: type: Classification - dataset: config: default name: MTEB TweetSentimentExtractionClassification (default) revision: d604517c81ca91fe16a244d1248fc021f9ecee7a split: test type: mteb/tweet_sentiment_extraction metrics: - type: accuracy value: 60.506499999999996 - type: f1 value: 60.711499999999994 - type: f1_weighted value: 59.695699999999995 - type: main_score value: 60.506499999999996 task: type: Classification - dataset: config: default name: MTEB TwentyNewsgroupsClustering.v2 (default) revision: 6125ec4e24fa026cec8a478383ee943acfbd5449 split: test type: mteb/twentynewsgroups-clustering metrics: - type: v_measure value: 33.5462 - type: v_measure_std value: 1.3361 - type: main_score value: 33.5462 task: type: Clustering - dataset: config: default name: MTEB TwitterSemEval2015 (default) revision: 70970daeab8776df92f5ea462b6173c0b46fd2d1 split: test type: mteb/twittersemeval2015-pairclassification metrics: - type: similarity_accuracy value: 82.51180000000001 - type: similarity_accuracy_threshold value: 69.4516 - type: similarity_f1 value: 58.483399999999996 - type: similarity_f1_threshold value: 61.3852 - type: similarity_precision value: 56.29880000000001 - type: similarity_recall value: 60.8443 - type: similarity_ap value: 61.8784 - type: cosine_accuracy value: 82.51180000000001 - type: cosine_accuracy_threshold value: 69.4516 - type: cosine_f1 value: 58.483399999999996 - type: cosine_f1_threshold value: 61.3852 - type: cosine_precision value: 56.29880000000001 - type: cosine_recall value: 60.8443 - type: cosine_ap value: 61.8784 - type: manhattan_accuracy value: 82.60119999999999 - type: manhattan_accuracy_threshold value: 1395.2354 - type: manhattan_f1 value: 59.3387 - type: manhattan_f1_threshold value: 1544.4108 - type: manhattan_precision value: 56.284 - type: manhattan_recall value: 62.7441 - type: manhattan_ap value: 62.407999999999994 - type: euclidean_accuracy value: 82.51180000000001 - type: euclidean_accuracy_threshold value: 78.1645 - type: euclidean_f1 value: 58.483399999999996 - type: euclidean_f1_threshold value: 87.88040000000001 - type: euclidean_precision value: 56.29880000000001 - type: euclidean_recall value: 60.8443 - type: euclidean_ap value: 61.8784 - type: dot_accuracy value: 82.51180000000001 - type: dot_accuracy_threshold value: 69.4516 - type: dot_f1 value: 58.483399999999996 - type: dot_f1_threshold value: 61.3852 - type: dot_precision value: 56.29880000000001 - type: dot_recall value: 60.8443 - type: dot_ap value: 61.8784 - type: max_accuracy value: 82.60119999999999 - type: max_f1 value: 59.3387 - type: max_precision value: 56.29880000000001 - type: max_recall value: 62.7441 - type: max_ap value: 62.407999999999994 - type: main_score value: 62.407999999999994 task: type: PairClassification - dataset: config: default name: MTEB TwitterURLCorpus (default) revision: 8b6510b0b1fa4e4c4f879467980e9be563ec1cdf split: test type: mteb/twitterurlcorpus-pairclassification metrics: - type: similarity_accuracy value: 87.84880000000001 - type: similarity_accuracy_threshold value: 62.77890000000001 - type: similarity_f1 value: 75.968 - type: similarity_f1_threshold value: 57.5925 - type: similarity_precision value: 71.909 - type: similarity_recall value: 80.5128 - type: similarity_ap value: 83.6557 - type: cosine_accuracy value: 87.84880000000001 - type: cosine_accuracy_threshold value: 62.77890000000001 - type: cosine_f1 value: 75.968 - type: cosine_f1_threshold value: 57.5925 - type: cosine_precision value: 71.909 - type: cosine_recall value: 80.5128 - type: cosine_ap value: 83.6557 - type: manhattan_accuracy value: 87.69940000000001 - type: manhattan_accuracy_threshold value: 1524.1733 - type: manhattan_f1 value: 76.01830000000001 - type: manhattan_f1_threshold value: 1597.1845 - type: manhattan_precision value: 72.981 - type: manhattan_recall value: 79.3194 - type: manhattan_ap value: 83.63629999999999 - type: euclidean_accuracy value: 87.84880000000001 - type: euclidean_accuracy_threshold value: 86.2799 - type: euclidean_f1 value: 75.968 - type: euclidean_f1_threshold value: 92.0951 - type: euclidean_precision value: 71.909 - type: euclidean_recall value: 80.5128 - type: euclidean_ap value: 83.6557 - type: dot_accuracy value: 87.84880000000001 - type: dot_accuracy_threshold value: 62.77890000000001 - type: dot_f1 value: 75.968 - type: dot_f1_threshold value: 57.5925 - type: dot_precision value: 71.909 - type: dot_recall value: 80.5128 - type: dot_ap value: 83.6557 - type: max_accuracy value: 87.84880000000001 - type: max_f1 value: 76.01830000000001 - type: max_precision value: 72.981 - type: max_recall value: 80.5128 - type: max_ap value: 83.6557 - type: main_score value: 83.6557 task: type: PairClassification license: apache-2.0 --- # RetrievaEmbedding-01: AMBER The **AMBER (Adaptive Multitask Bilingual Embedding Representations)** is a text embedding model trained by Retrieva, Inc. This model is primarily designed for Japanese, but it also supports English. We trained this model on various datasets related to Japanese and English. This model size is 132M parameters (base size). ## Model Details ### Model Description The AMBER model is a text embedding model based on the [sbintuitions/modernbert-ja-130m](https://huggingface.co/sbintuitions/modernbert-ja-130m) architecture, designed for Japanese text. This model was trained on a variety of datasets related to Japanese, and also includes English datasets. The model can be used for English text as well. During training, prompts (instructions) in natural language were included, allowing the model to generate embeddings tailored to specific tasks. - **Developed by:** Retrieva, Inc. - **Model type:** Based on the [ModernBERT](https://arxiv.org/abs/2412.13663) Architecture. - **Language(s) (NLP):** Primarily Japanese (optional support for English). - **License:** Apache 2.0 - **Finetuned from model:** `sbintuitions/modernbert-ja-130m` - **Model Type:** Sentence Transformer - **Maximum Sequence Length:** 512 tokens - **Output Dimensionality:** 512 dimensions - **Similarity Function:** Cosine Similarity ## Uses ## How to Get Started with the Model ### Install Library First install the python library using pip: ```bash pip install sentence-transformers sentencepiece ``` ### Run Inference Then you can load this model and run inference. You can specify the prompt at inference time by adding an argument called `prompt` to `model.encode`. The prompts used in the Japanese benchmark are described in `jmteb/tasks`, and the prompts used in the English benchmark are described in `mteb/models/retrieva_en.py`. ```python from sentence_transformers import SentenceTransformer # Download from the 🤗 Hub model = SentenceTransformer("retrieva-jp/amber-base") # Run inference queries = [ "自然言語処理とはなんですか?", "株式会社レトリバについて教えて", ] documents = [ "自然言語処理(しぜんげんごしょり、英語: Natural language processing、略称:NLP)は、人間が日常的に使っている自然言語をコンピュータに処理させる一連の技術であり、人工知能と言語学の一分野である。", "株式会社レトリバは、自然言語処理と機械学習を核としたAI技術で組織の課題解決を支援するテクノロジー企業である。", ] queries_embeddings = model.encode(queries, prompt_name="Retrieval-query") documents_embeddings = model.encode(documents, prompt_name="Retrieval-passage") similarities = model.similarity(queries_embeddings, documents_embeddings) print(similarities.shape) ``` ## Training Details ### Training Data We used multiple datasets to train this model. We selected datasets from [llm-jp-eval](https://github.com/llm-jp/llm-jp-eval), [llm-japanese-dataset](https://github.com/masanorihirano/llm-japanese-dataset), and [hpprc/emb](https://huggingface.co/datasets/hpprc/emb) for Japanese datasets. For English datasets, we mainly used some of the datasets utilized in [Asai et al. (2023)](https://arxiv.org/abs/2211.09260). Additionally, we partially used the English datasets at [the sentence-transformers repository](https://huggingface.co/sentence-transformers) and [kilt-tasks](https://huggingface.co/datasets/facebook/kilt_tasks). To consider cross-lingual between Japanese and English, we also used translation datasets between Japanese and English. For Japanese, we used synthetic data created by LLM to prepare a sufficient amount of training data. ## Evaluation We evaluated the model on the following benchmarks: - Japanese Benchmark: [JMTEB](https://github.com/sbintuitions/JMTEB) - Japanese Retrieval Tasks: [JQaRA](https://github.com/hotchpotch/JQaRA/), [JaCWIR](https://github.com/hotchpotch/JaCWIR/), [MLDR Japanese Subset](https://huggingface.co/datasets/Shitao/MLDR) - English Benchmark: [MTEB(eng, v2)](https://github.com/embeddings-benchmark/mteb). The scores in the table are all calculated by us unless otherwise noted. ### Japanese Benchmark: JMTEB Note that the `Mean (TaskType)` in the following leaderboard is the same as the `Avg.` in the original JMTEB leaderboard. The files used for evaluation are stored in the `jmteb` directory. | Model | # Parameters | Mean (TaskType) | Mean (Task) | Retrieval | STS | Classification | Reranking | Clustering | PairClassification | | :--- | --- | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: | | base models | < 300M | | | | | | | | | | [cl-nagoya/ruri-base](https://huggingface.co/cl-nagoya/ruri-base) | 111M | 72.60 | 71.56 | 69.53 | 82.87 | 75.49 | 92.91 | 52.40 | 62.38 | | AMBER-base
(this model) | 130M | 72.12 | 72.12 | **73.40** | 77.81 | **76.14** | **93.27** | 48.05 | **64.03** | | [pkshatech/GLuCoSE-base-ja-v2](https://huggingface.co/pkshatech/GLuCoSE-base-ja-v2) | 133M | **72.89** | **72.47** | 73.03 | **82.96** | 74.02 | 93.01 | 51.96 | 62.37 | | [pkshatech/RoSEtta-base-ja](https://huggingface.co/pkshatech/RoSEtta-base-ja) | 190M | 72.49 | 72.05 | 73.14 | 81.39 | 72.37 | 92.69 | **53.60** | 61.74 | | [intfloat/multilingual-e5-base](https://huggingface.co/intfloat/multilingual-e5-base) | 278M | 71.11 | 69.72 | 69.45 | 80.45 | 69.86 | 92.90 | 51.62 | 62.35 | | large models | 300M < | | | | | | | | | | [AMBER-large](https://huggingface.co/retrieva-jp/amber-large) | 315M | 72.52 | **73.22** | **75.40** | 79.32 | 77.14 | **93.54** | 48.73 | 60.97 | | [cl-nagoya/ruri-large](https://huggingface.co/cl-nagoya/ruri-large) | 337M | **73.20** | 73.06 | 72.86 | **83.14** | **77.15** | 93.00 | 50.78 | 62.29 | | [intfloat/multilingual-e5-large](https://huggingface.co/intfloat/multilingual-e5-large) | 560M | 72.06 | 71.29 | 71.71 | 80.87 | 72.45 | 93.29 | **51.59** | **62.42** | ### Japanese Retrieval Tasks: JQaRA, JaCWIR, MLDR Japanese Subset The files used for MLDR are stored in the `mldr` directory. The prompts used in JQaRA and JaCWIR are `Retrieval-query` and `Retrieval-passage` described in `config_sentence_transformers.json`. | Model | # Parameters | JQaRA (nDCG@10) | JaCWIR (MAP@10) | MLDR Japanese Subset (nDCG@10) | | :--- | --- | ---: | ---: | ---: | | base models | < 300M | | | | | [cl-nagoya/ruri-base](https://huggingface.co/cl-nagoya/ruri-base) | 111M | 58.4 | 83.3 | 32.77 | | AMBER-base
(this model) | 130M | 57.1 | 81.6 | **35.69** | | [pkshatech/GLuCoSE-base-ja-v2](https://huggingface.co/pkshatech/GLuCoSE-base-ja-v2) | 133M | **60.6** | **85.3** | 33.99 | | [intfloat/multilingual-e5-base](https://huggingface.co/intfloat/multilingual-e5-base) | 278M | 47.1 | **85.3** | 25.46 | | large models | 300M < | | | | | [AMBER-large](https://huggingface.co/retrieva-jp/amber-large) | 315M | 62.5 | 82.4 | 34.57 | | [cl-nagoya/ruri-large](https://huggingface.co/cl-nagoya/ruri-large) | 337M | **62.8** | 82.5 | **34.78** | | [intfloat/multilingual-e5-large](https://huggingface.co/intfloat/multilingual-e5-large) | 560M | 55.4| **87.3** | 29.95 | ### English Benchmark: MTEB(eng, v2) The files used for evaluation are stored in the `mteb` directory. | Model | # Parameters | Mean (TaskType) | Mean (Task) | Retrieval | STS | Classification | Reranking | Clustering | PairClassification | Summarization | | :--- | --- | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: | ---: | | base models | < 300M | | | | | | | | | | | AMBER-base
(this model) | 130M | 54.75 | 58.20 | 40.11 | **81.29** | 70.39 | 42.98 | **42.27** | 80.12 | 26.08 | | [intfloat/multilingual-e5-base](https://huggingface.co/intfloat/multilingual-e5-base) | 278M | **56.21** | **59.75** | **43.22** | 80.50 | **73.84** | **43.87** | 42.19 | **83.74** | **26.10** | | large models | 300M < | | | | | | | | | | | [AMBER-large](https://huggingface.co/retrieva-jp/amber-large) | 315M | 56.08 | 59.13 | 41.04 | **81.52** | 72.23 | 43.83 | **42.71** | 81.00 | **30.21** | | [intfloat/multilingual-e5-large](https://huggingface.co/intfloat/multilingual-e5-large) | 560M | **57.06** | **60.84** | **46.17** | 81.11 | **74.88** | **44.31** | 41.91 | **84.33** | 26.67 | ## More Information TBA ## Model Card Authors Satoru Katsumata, Daisuke Kimura, Jiro Nishitoba ## Model Card Contact pr[at]retrieva.jp