Commit History

Upload eval_results/HuggingFaceH4/zephyr-7b-alpha/main/agieval/results_2024-03-28T16-41-57.836994.json with huggingface_hub
7dca1cb
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-7b-beta/main/agieval/results_2024-03-28T16-41-08.142040.json with huggingface_hub
2961e00
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-7b-gemma-v0.1/main/agieval/results_2024-03-28T16-40-43.592094.json with huggingface_hub
d4240c5
verified

lewtun HF staff commited on

Fix round
a96d97e

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-7b-gemma-v0.1/main/bbh/results_2024-03-28T16-39-37.888825.json with huggingface_hub
5834ead
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-7b-beta/main/bbh/results_2024-03-28T16-39-32.239458.json with huggingface_hub
89c611e
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/agieval/results_2024-03-28T16-38-49.297471.json with huggingface_hub
e63c614
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/bbh/results_2024-03-28T16-37-54.545961.json with huggingface_hub
3945ac9
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-1.8B-Chat/main/agieval/results_2024-03-28T16-37-40.171062.json with huggingface_hub
e91ca1a
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-4B-Chat/main/agieval/results_2024-03-28T16-36-45.773188.json with huggingface_hub
7fc097e
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-72B-Chat/main/bbh/results_2024-03-28T16-36-49.368383.json with huggingface_hub
f1358c6
verified

lewtun HF staff commited on

Fix search
6e537e5

lewtun HF staff commited on

Upload eval_results/deepseek-ai/deepseek-llm-67b-chat/main/bbh/results_2024-03-28T16-35-56.180836.json with huggingface_hub
eae468e
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-7B-Chat/main/agieval/results_2024-03-28T16-35-39.599864.json with huggingface_hub
101c175
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-4B-Chat/main/bbh/results_2024-03-28T16-35-21.201380.json with huggingface_hub
8a5a47d
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-1.8B-Chat/main/bbh/results_2024-03-28T16-35-20.182652.json with huggingface_hub
0413bdb
verified

lewtun HF staff commited on

Upload eval_results/mistralai/Mixtral-8x7B-Instruct-v0.1/main/bbh/results_2024-03-28T16-34-40.956556.json with huggingface_hub
3d2d1e5
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-14B-Chat/main/agieval/results_2024-03-28T16-34-28.540955.json with huggingface_hub
156b699
verified

lewtun HF staff commited on

Upload eval_results/NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO/main/bbh/results_2024-03-28T16-33-31.664648.json with huggingface_hub
8b1bdca
verified

lewtun HF staff commited on

Bump gradio
86f22b4

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-7B-Chat/main/bbh/results_2024-03-28T16-32-02.577573.json with huggingface_hub
010fbea
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-14B-Chat/main/bbh/results_2024-03-28T16-31-39.052861.json with huggingface_hub
66fe7cb
verified

lewtun HF staff commited on

Upload eval_results/NousResearch/Nous-Hermes-2-Yi-34B/main/bbh/results_2024-03-28T16-30-43.156201.json with huggingface_hub
089496a
verified

lewtun HF staff commited on

Upload eval_results/openchat/openchat-3.5-0106/main/agieval/results_2024-03-28T16-28-08.688920.json with huggingface_hub
a9251bd
verified

lewtun HF staff commited on

Upload eval_results/openchat/openchat-3.5-0106/main/bbh/results_2024-03-28T16-27-09.965319.json with huggingface_hub
cb4e203
verified

lewtun HF staff commited on

Reorder columns
c259781

lewtun HF staff commited on

Update evals
88fd41c

lewtun HF staff commited on

Upload eval_results/teknium/OpenHermes-2.5-Mistral-7B/main/agieval/results_2024-03-28T15-53-33.021821.json with huggingface_hub
486aa44
verified

lewtun HF staff commited on

Upload eval_results/teknium/OpenHermes-2.5-Mistral-7B/main/bbh/results_2024-03-28T15-51-57.294715.json with huggingface_hub
11555e5
verified

lewtun HF staff commited on

Use qem for BBH
6d771b5

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-7b-alpha/main/bbh/results_2024-03-28T15-04-26.956255.json with huggingface_hub
561f184
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-odpo/v1.12/gsm8k/results_2024-03-28T14-28-14.339992.json with huggingface_hub
7f10305
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-odpo/v1.11/mmlu/results_2024-03-28T13-58-59.590038.json with huggingface_hub
b03f72d
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-odpo/v1.11/mmlu/results_2024-03-28T14-25-38.508285.json with huggingface_hub
79de9a3
verified

edbeeching HF staff commited on

Upload eval_results/mistralai/Mixtral-8x7B-Instruct-v0.1/main/bbh/results_2024-03-28T14-24-16.417420.json with huggingface_hub
75df6a5
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-odpo/v1.15/gsm8k/results_2024-03-28T14-10-04.030214.json with huggingface_hub
ac9346c
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-odpo/v1.14/gsm8k/results_2024-03-28T14-06-36.471547.json with huggingface_hub
143c84b
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-odpo/v1.11/gsm8k/results_2024-03-28T14-05-29.366088.json with huggingface_hub
292409b
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-odpo/v1.14/mmlu/results_2024-03-28T14-01-38.404907.json with huggingface_hub
8b47767
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-odpo/v1.15/hellaswag/results_2024-03-28T13-55-13.460342.json with huggingface_hub
6dbf3a8
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-odpo/v1.10/gsm8k/results_2024-03-28T13-55-45.736641.json with huggingface_hub
03d2b25
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-odpo/v1.12/mmlu/results_2024-03-28T13-54-30.081224.json with huggingface_hub
4204a6f
verified

edbeeching HF staff commited on

Upload eval_results/Qwen/Qwen1.5-7B-Chat/main/ifeval/results_2024-03-28T13-55-10.959962.json with huggingface_hub
4a5e806
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-odpo/v1.14/hellaswag/results_2024-03-28T13-53-26.974752.json with huggingface_hub
6242038
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-odpo/v1.9/gsm8k/results_2024-03-28T13-53-27.577239.json with huggingface_hub
a26a535
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-odpo/v1.11/hellaswag/results_2024-03-28T13-51-12.990475.json with huggingface_hub
e00a8f0
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-odpo/v1.10/mmlu/results_2024-03-28T13-49-47.123685.json with huggingface_hub
b0e7cd1
verified

edbeeching HF staff commited on

Upload eval_results/Qwen/Qwen1.5-7B-Chat/main/bbh/results_2024-03-28T13-49-45.748302.json with huggingface_hub
0b07157
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-odpo/v1.9/mmlu/results_2024-03-28T13-48-32.137790.json with huggingface_hub
77b3925
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-odpo/v1.15/winogrande/results_2024-03-28T13-49-17.864327.json with huggingface_hub
7d78de8
verified

edbeeching HF staff commited on