Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
RedHatAI
/
quantization
like
5
Follow
Red Hat AI
1.31k
kernel
License:
apache-2.0
Model card
Files
Files and versions
Community
1
main
quantization
/
cutlass_w8a8
Ctrl+K
Ctrl+K
2 contributors
History:
4 commits
danieldk
HF Staff
Sync to vLLM 20250627
8aa00a3
29 days ago
c3x
Sync to vLLM 20250627
29 days ago
Epilogues.md
5.68 kB
Sync to vLLM 20250627
29 days ago
scaled_mm_c2x.cu
8.59 kB
Sync with vLLM
7 months ago
scaled_mm_c2x.cuh
7.83 kB
Sync to vLLM 20250627
29 days ago
scaled_mm_c2x_sm75_dispatch.cuh
5.08 kB
Add cutlass_w8a8
8 months ago
scaled_mm_c2x_sm80_dispatch.cuh
5.83 kB
Add cutlass_w8a8
8 months ago
scaled_mm_c2x_sm89_fp8_dispatch.cuh
16.4 kB
Sync to vLLM 20250627
29 days ago
scaled_mm_c2x_sm89_int8_dispatch.cuh
14.9 kB
Sync to vLLM 20250627
29 days ago
scaled_mm_c3x_sm100.cu
758 Bytes
Sync to vLLM 20250627
29 days ago
scaled_mm_c3x_sm90.cu
1.45 kB
Sync to vLLM 20250627
29 days ago
scaled_mm_entry.cu
9.41 kB
Sync to vLLM 20250627
29 days ago