neuralmagic/Llama-2-7b-ultrachat200k-pruned_50-quantized-deepsparse Text Generation • Updated May 7, 2024 • 17
neuralmagic/Llama-2-7b-ultrachat200k-pruned_70-quantized-deepsparse Text Generation • Updated May 15, 2024 • 17
neuralmagic/Llama-2-7b-evol-code-alpaca-pruned_50-quantized-deepsparse Text Generation • Updated May 15, 2024 • 15
neuralmagic/Llama-2-7b-evol-code-alpaca-pruned_70-quantized-deepsparse Text Generation • Updated May 15, 2024 • 14
neuralmagic/Llama-2-7b-dolphin-open_platypus-pruned_50-quantized-deepsparse Text Generation • Updated May 16, 2024 • 17
neuralmagic/Llama-2-7b-dolphin-open_platypus-pruned_70-quantized-deepsparse Text Generation • Updated May 16, 2024 • 11 • 1
RichardErkhov/neuralmagic_-_Llama-2-7b-evolcodealpaca-4bits Text Generation • Updated May 10, 2024 • 80
RichardErkhov/neuralmagic_-_Llama-2-7b-evolcodealpaca-8bits Text Generation • Updated May 10, 2024 • 80
RichardErkhov/neuralmagic_-_Llama-2-7b-dolphin-open_platypus-pruned_70-gguf Updated Jul 16, 2024 • 20
neuralmagic/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-FP8-dynamic Text Generation • Updated Dec 19, 2024 • 41 • 1
neuralmagic/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-quantized.w4a16 Text Generation • Updated Dec 19, 2024 • 130 • 3