[Add] add bbh and gpqa benchmarks again with correct answer_index selection 0a42e99 Joschka Strueber commited on Feb 7