distily_attn_mlp_sweep / benchmarks.shelve.dir
lapp0's picture
End of training
a20cb45 verified