Upload eval_results/HuggingFaceH4/zephyr-7b-beta-ift/v1.5/bbh/results_2024-03-19T09-27-11.722655.json with huggingface_hub acad715 verified lewtun HF staff commited on Mar 19, 2024
Upload eval_results/HuggingFaceH4/zephyr-7b-beta-ift/v1.4/bbh/results_2024-03-19T09-26-27.067508.json with huggingface_hub d64ecd4 verified lewtun HF staff commited on Mar 19, 2024
Upload eval_results/HuggingFaceH4/zephyr-7b-beta-ift/v1.0/bbh/results_2024-03-19T09-26-07.598400.json with huggingface_hub fdb6355 verified lewtun HF staff commited on Mar 19, 2024
Upload eval_results/HuggingFaceH4/zephyr-7b-beta-ift/v1.2/bbh/results_2024-03-19T09-26-04.565494.json with huggingface_hub 92d255d verified lewtun HF staff commited on Mar 19, 2024
Upload eval_results/HuggingFaceH4/zephyr-7b-beta-ift/v1.3/bbh/results_2024-03-19T09-25-53.215602.json with huggingface_hub fe6a135 verified lewtun HF staff commited on Mar 19, 2024
Upload eval_results/mistralai/Mixtral-8x7B-Instruct-v0.1/main/bbh/results_2024-03-18T20-58-12.014656.json with huggingface_hub 1956f9c verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/HuggingFaceH4/zephyr-7b-beta/main/gsm8k/results_2024-03-18T20-52-36.143317.json with huggingface_hub 406cb35 verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/deepseek-ai/deepseek-llm-67b-chat/main/bbh/results_2024-03-18T20-51-01.093533.json with huggingface_hub a866518 verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/HuggingFaceH4/zephyr-7b-beta-ift/v0.2/gsm8k/results_2024-03-18T20-49-24.740849.json with huggingface_hub 4ae95c5 verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/google/gemma-7b-it/main/gsm8k/results_2024-03-18T20-47-15.598662.json with huggingface_hub e7751e6 verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/NousResearch/Nous-Hermes-2-Yi-34B/main/bbh/results_2024-03-18T20-43-57.392460.json with huggingface_hub ae13434 verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/Qwen/Qwen1.5-72B-Chat/main/bbh/results_2024-03-18T20-40-36.554904.json with huggingface_hub af53d82 verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/google/gemma-2b-it/main/gsm8k/results_2024-03-18T20-39-56.154693.json with huggingface_hub 40e84db verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO/main/bbh/results_2024-03-18T20-39-32.747348.json with huggingface_hub d30acf6 verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/google/gemma-7b-it/main/bbh/results_2024-03-18T20-38-30.588009.json with huggingface_hub 4c6f00d verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/google/gemma-2b-it/main/bbh/results_2024-03-18T20-36-40.384973.json with huggingface_hub 589bc06 verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/HuggingFaceH4/zephyr-7b-beta/main/bbh/results_2024-03-18T20-36-18.052319.json with huggingface_hub 6cb7ce4 verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/HuggingFaceH4/zephyr-7b-beta-ift/v0.2/bbh/results_2024-03-18T20-33-46.216888.json with huggingface_hub ba09d8b verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/Qwen/Qwen1.5-14B-Chat/main/bbh/results_2024-03-18T20-23-46.729702.json with huggingface_hub 737bf1f verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/Qwen/Qwen1.5-4B-Chat/main/bbh/results_2024-03-18T20-20-24.322898.json with huggingface_hub 1b733bf verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/Qwen/Qwen1.5-1.8B-Chat/main/bbh/results_2024-03-18T20-11-24.511185.json with huggingface_hub 754f9ed verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/teknium/OpenHermes-2.5-Mistral-7B/main/bbh/results_2024-03-18T19-49-31.908303.json with huggingface_hub 40f3905 verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/bbh/results_2024-03-18T19-43-00.075213.json with huggingface_hub 7add925 verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/ifeval/results_2024-03-18T19-36-03.767073.json with huggingface_hub 73b2107 verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/mmlu/results_2024-03-18T17-01-47.545823.json with huggingface_hub b9d64e1 verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/gsm8k/results_2024-03-18T17-01-30.856762.json with huggingface_hub 29394c9 verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/hellaswag/results_2024-03-18T16-55-05.213121.json with huggingface_hub 8e02df0 verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/arc/results_2024-03-18T16-50-06.159327.json with huggingface_hub 1a1c5de verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/truthfulqa/results_2024-03-18T16-49-56.225132.json with huggingface_hub 3c8fa5b verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/winogrande/results_2024-03-18T16-49-32.523482.json with huggingface_hub 64a7a0d verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.13/gsm8k/results_2024-03-18T16-42-34.222418.json with huggingface_hub 221b580 verified edbeeching HF staff commited on Mar 18, 2024
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.13/ifeval/results_2024-03-18T16-40-04.435577.json with huggingface_hub e9088c8 verified edbeeching HF staff commited on Mar 18, 2024
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.9/gsm8k/results_2024-03-18T15-21-24.552464.json with huggingface_hub 437b3d5 verified edbeeching HF staff commited on Mar 18, 2024
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.9/ifeval/results_2024-03-18T15-18-41.904972.json with huggingface_hub 06c723e verified edbeeching HF staff commited on Mar 18, 2024
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.6/gsm8k/results_2024-03-16T22-45-24.270002.json with huggingface_hub 47e1f74 verified edbeeching HF staff commited on Mar 16, 2024
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.7/gsm8k/results_2024-03-16T22-43-25.759701.json with huggingface_hub 63ae265 verified edbeeching HF staff commited on Mar 16, 2024
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.8/gsm8k/results_2024-03-16T22-43-20.193012.json with huggingface_hub f7627c3 verified edbeeching HF staff commited on Mar 16, 2024
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.6/ifeval/results_2024-03-16T22-42-33.074718.json with huggingface_hub 2836f47 verified edbeeching HF staff commited on Mar 16, 2024
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.7/ifeval/results_2024-03-16T22-41-15.459095.json with huggingface_hub 3374a12 verified edbeeching HF staff commited on Mar 16, 2024
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.8/ifeval/results_2024-03-16T22-40-25.432296.json with huggingface_hub d2783be verified edbeeching HF staff commited on Mar 16, 2024
Upload eval_results/HuggingFaceH4/zephyr-7b-beta-ift/v0.3/gsm8k/results_2024-03-14T23-26-33.789538.json with huggingface_hub 5136f11 verified lewtun HF staff commited on Mar 14, 2024
Upload eval_results/HuggingFaceH4/zephyr-7b-beta-ift/v0.3/ifeval/results_2024-03-14T23-22-03.186605.json with huggingface_hub 4bf4caa verified lewtun HF staff commited on Mar 14, 2024
Upload eval_results/HuggingFaceH4/zephyr-7b-beta-dpo/v0.2.7/gsm8k/results_2024-03-14T22-48-09.910141.json with huggingface_hub d67ef71 verified lewtun HF staff commited on Mar 14, 2024
Upload eval_results/HuggingFaceH4/zephyr-7b-beta-dpo/v0.2.7/ifeval/results_2024-03-14T22-41-25.248142.json with huggingface_hub 781f88b verified lewtun HF staff commited on Mar 14, 2024
Upload eval_results/HuggingFaceH4/zephyr-7b-beta-dpo/v0.2.6/gsm8k/results_2024-03-14T20-44-20.773569.json with huggingface_hub b60d877 verified lewtun HF staff commited on Mar 14, 2024
Upload eval_results/HuggingFaceH4/zephyr-7b-beta-dpo/v0.2.6/ifeval/results_2024-03-14T20-36-49.238793.json with huggingface_hub 46ddb36 verified lewtun HF staff commited on Mar 14, 2024
Upload eval_results/HuggingFaceH4/zephyr-7b-beta-ift/v0.2/gsm8k/results_2024-03-14T18-39-43.255609.json with huggingface_hub dd32f1c verified lewtun HF staff commited on Mar 14, 2024
Upload eval_results/HuggingFaceH4/zephyr-7b-beta-ift/v0.2/ifeval/results_2024-03-14T18-32-27.610398.json with huggingface_hub 71c0675 verified lewtun HF staff commited on Mar 14, 2024