GGUF importance matrix (imatrix) quants for https://huggingface.co/codefuse-ai/CodeFuse-DeepSeek-33B

Layers Context Template
62
16384
<s>system
{instructions}
<s>human
{prompt}
<s>bot
{response}<|end▁of▁sentence|>
Downloads last month
2
GGUF
Model size
33.3B params
Architecture
llama
Hardware compatibility
Log In to view the estimation

3-bit

4-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support