Model pruning

#179
by Clausss - opened

as far I know(fix me if I am wrong) llama-quantize now supports layer pruning via the --prune-layers flag
so is possible to prune model?

Sign up or log in to comment