Commit History

Upload eval_results/meta-llama/Llama-2-13b-chat-hf/main/agieval/results_2024-03-28T16-53-48.762881.json with huggingface_hub
5f452f9
verified

lewtun HF staff commited on

Upload eval_results/alvarobartt/mistral-7b-orpo-airoboros-pref-10k/main/agieval/results_2024-03-28T16-53-48.794914.json with huggingface_hub
80d2acf
verified

lewtun HF staff commited on

Upload eval_results/meta-llama/Llama-2-7b-chat-hf/main/agieval/results_2024-03-28T16-52-55.106765.json with huggingface_hub
4e6391d
verified

lewtun HF staff commited on

Upload eval_results/alvarobartt/mistral-7b-orpo-airoboros-pref-10k/main/bbh/results_2024-03-28T16-50-37.614589.json with huggingface_hub
4405a99
verified

lewtun HF staff commited on

Upload eval_results/NousResearch/Nous-Hermes-2-Yi-34B/main/agieval/results_2024-03-28T16-50-16.504147.json with huggingface_hub
ea2024b
verified

lewtun HF staff commited on

Upload eval_results/meta-llama/Llama-2-7b-chat-hf/main/bbh/results_2024-03-28T16-50-14.791397.json with huggingface_hub
393e212
verified

lewtun HF staff commited on

Upload eval_results/meta-llama/Llama-2-13b-chat-hf/main/bbh/results_2024-03-28T16-49-39.720727.json with huggingface_hub
3c1de15
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/starcoder2-15b-dpo/v4.1/agieval/results_2024-03-28T16-48-35.789845.json with huggingface_hub
90690d1
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/starcoder2-15b-dpo/v4.1/bbh/results_2024-03-28T16-46-39.712895.json with huggingface_hub
5b24793
verified

lewtun HF staff commited on

Upload eval_results/mistralai/Mistral-7B-Instruct-v0.2/main/agieval/results_2024-03-28T16-44-41.848289.json with huggingface_hub
b9e292c
verified

lewtun HF staff commited on

Upload eval_results/mistralai/Mistral-7B-Instruct-v0.1/main/agieval/results_2024-03-28T16-44-00.331653.json with huggingface_hub
ee217c8
verified

lewtun HF staff commited on

Upload eval_results/mistralai/Mistral-7B-Instruct-v0.2/main/bbh/results_2024-03-28T16-43-06.532358.json with huggingface_hub
93d8112
verified

lewtun HF staff commited on

Upload eval_results/mistralai/Mistral-7B-Instruct-v0.1/main/bbh/results_2024-03-28T16-42-37.358150.json with huggingface_hub
3c721e3
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-7b-alpha/main/agieval/results_2024-03-28T16-41-57.836994.json with huggingface_hub
7dca1cb
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-7b-beta/main/agieval/results_2024-03-28T16-41-08.142040.json with huggingface_hub
2961e00
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-7b-gemma-v0.1/main/agieval/results_2024-03-28T16-40-43.592094.json with huggingface_hub
d4240c5
verified

lewtun HF staff commited on

Fix round
a96d97e

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-7b-gemma-v0.1/main/bbh/results_2024-03-28T16-39-37.888825.json with huggingface_hub
5834ead
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-7b-beta/main/bbh/results_2024-03-28T16-39-32.239458.json with huggingface_hub
89c611e
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/agieval/results_2024-03-28T16-38-49.297471.json with huggingface_hub
e63c614
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/bbh/results_2024-03-28T16-37-54.545961.json with huggingface_hub
3945ac9
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-1.8B-Chat/main/agieval/results_2024-03-28T16-37-40.171062.json with huggingface_hub
e91ca1a
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-4B-Chat/main/agieval/results_2024-03-28T16-36-45.773188.json with huggingface_hub
7fc097e
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-72B-Chat/main/bbh/results_2024-03-28T16-36-49.368383.json with huggingface_hub
f1358c6
verified

lewtun HF staff commited on

Fix search
6e537e5

lewtun HF staff commited on

Upload eval_results/deepseek-ai/deepseek-llm-67b-chat/main/bbh/results_2024-03-28T16-35-56.180836.json with huggingface_hub
eae468e
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-7B-Chat/main/agieval/results_2024-03-28T16-35-39.599864.json with huggingface_hub
101c175
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-4B-Chat/main/bbh/results_2024-03-28T16-35-21.201380.json with huggingface_hub
8a5a47d
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-1.8B-Chat/main/bbh/results_2024-03-28T16-35-20.182652.json with huggingface_hub
0413bdb
verified

lewtun HF staff commited on

Upload eval_results/mistralai/Mixtral-8x7B-Instruct-v0.1/main/bbh/results_2024-03-28T16-34-40.956556.json with huggingface_hub
3d2d1e5
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-14B-Chat/main/agieval/results_2024-03-28T16-34-28.540955.json with huggingface_hub
156b699
verified

lewtun HF staff commited on

Upload eval_results/NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO/main/bbh/results_2024-03-28T16-33-31.664648.json with huggingface_hub
8b1bdca
verified

lewtun HF staff commited on

Bump gradio
86f22b4

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-7B-Chat/main/bbh/results_2024-03-28T16-32-02.577573.json with huggingface_hub
010fbea
verified

lewtun HF staff commited on

Upload eval_results/Qwen/Qwen1.5-14B-Chat/main/bbh/results_2024-03-28T16-31-39.052861.json with huggingface_hub
66fe7cb
verified

lewtun HF staff commited on

Upload eval_results/NousResearch/Nous-Hermes-2-Yi-34B/main/bbh/results_2024-03-28T16-30-43.156201.json with huggingface_hub
089496a
verified

lewtun HF staff commited on

Upload eval_results/openchat/openchat-3.5-0106/main/agieval/results_2024-03-28T16-28-08.688920.json with huggingface_hub
a9251bd
verified

lewtun HF staff commited on

Upload eval_results/openchat/openchat-3.5-0106/main/bbh/results_2024-03-28T16-27-09.965319.json with huggingface_hub
cb4e203
verified

lewtun HF staff commited on

Reorder columns
c259781

lewtun HF staff commited on

Update evals
88fd41c

lewtun HF staff commited on

Upload eval_results/teknium/OpenHermes-2.5-Mistral-7B/main/agieval/results_2024-03-28T15-53-33.021821.json with huggingface_hub
486aa44
verified

lewtun HF staff commited on

Upload eval_results/teknium/OpenHermes-2.5-Mistral-7B/main/bbh/results_2024-03-28T15-51-57.294715.json with huggingface_hub
11555e5
verified

lewtun HF staff commited on

Use qem for BBH
6d771b5

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-7b-alpha/main/bbh/results_2024-03-28T15-04-26.956255.json with huggingface_hub
561f184
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-odpo/v1.12/gsm8k/results_2024-03-28T14-28-14.339992.json with huggingface_hub
7f10305
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-odpo/v1.11/mmlu/results_2024-03-28T13-58-59.590038.json with huggingface_hub
b03f72d
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-odpo/v1.11/mmlu/results_2024-03-28T14-25-38.508285.json with huggingface_hub
79de9a3
verified

edbeeching HF staff commited on

Upload eval_results/mistralai/Mixtral-8x7B-Instruct-v0.1/main/bbh/results_2024-03-28T14-24-16.417420.json with huggingface_hub
75df6a5
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-odpo/v1.15/gsm8k/results_2024-03-28T14-10-04.030214.json with huggingface_hub
ac9346c
verified

edbeeching HF staff commited on

Upload eval_results/HuggingFaceH4/mistral-7b-odpo/v1.14/gsm8k/results_2024-03-28T14-06-36.471547.json with huggingface_hub
143c84b
verified

edbeeching HF staff commited on