distily_attn_mlp_sweep / benchmarks.shelve.dat
lapp0's picture
End of training
a20cb45 verified