mlx_lm.convert --hf-path THUDM/LongWriter-llama3.1-8b --mlx-path noguchis/mlx-LongWriter-llama3.1-8b --quantize --q-bits 8 --dtype bfloat16
Downloads last month
8
Safetensors
Model size
2.26B params
Tensor type
FP16
·
U32
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.