tim-lawson's picture
Push model using huggingface_hub.
5f39eb7 verified
raw
history blame
150 Bytes
{
"auxk": 256,
"dead_steps_threshold": 76,
"dead_threshold": 0.001,
"k": 128,
"n_inputs": 768,
"n_latents": 49152,
"standardize": true
}