Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
RedHatAI
/
quantization
like
6
Follow
Red Hat AI
1.33k
kernel
License:
apache-2.0
Model card
Files
Files and versions
Community
1
c31b5ce
quantization
Commit History
Add GPTQ-Marlin
c31b5ce
danieldk
HF Staff
commited on
Dec 10, 2024
Build
c5018b2
danieldk
HF Staff
commited on
Dec 9, 2024
Add `scaled_(int|fp8)_quant` and `fp8_marlin_gemm`
5c6fb68
danieldk
HF Staff
commited on
Dec 9, 2024
Build
a77838d
danieldk
HF Staff
commited on
Dec 9, 2024
Fixup metadata
c7e38f0
danieldk
HF Staff
commited on
Dec 9, 2024
Add cutlass_w8a8
b4cad21
danieldk
HF Staff
commited on
Dec 9, 2024
initial commit
e87d8e6
verified
danieldk
HF Staff
commited on
Dec 9, 2024