Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
kernels-community
/
quantization
like
2
Follow
kernels-community
107
kernel
License:
apache-2.0
Model card
Files
Files and versions
Community
1
8aa00a3
quantization
Ctrl+K
Ctrl+K
2 contributors
History:
37 commits
danieldk
HF Staff
Sync to vLLM 20250627
8aa00a3
4 days ago
attention
Sync to vLLM 20250627
4 days ago
build
Build (aarch64)
3 months ago
compressed_tensors
Sync to vLLM 20250627
4 days ago
core
Sync to vLLM 20250627
4 days ago
cutlass_extensions
Sync to vLLM 20250627
4 days ago
cutlass_w8a8
Sync to vLLM 20250627
4 days ago
fp8
Sync to vLLM 20250627
4 days ago
gptq_marlin
Sync to vLLM 20250627
4 days ago
marlin
Sync to vLLM 20250627
4 days ago
tests
Sync to vLLM 20250627
4 days ago
torch-ext
Sync to vLLM 20250627
4 days ago
.gitattributes
Safe
1.56 kB
Build
7 months ago
LICENSE
Safe
11.4 kB
Add cutlass_w8a8
7 months ago
README.md
Safe
195 Bytes
Update README.md (#1)
4 months ago
build.toml
5.83 kB
Sync to vLLM 20250627
4 days ago
cuda_utils.h
1.41 kB
Sync on vLLM 20240402
3 months ago
dispatch_utils.h
3.9 kB
Sync to vLLM 20250627
4 days ago
flake.lock
4.47 kB
Sync to vLLM 20250627
4 days ago
flake.nix
Safe
335 Bytes
Add support for ROCm
3 months ago
utils.cuh
1.84 kB
Sync on vLLM 20240402
3 months ago
vectorization.cuh
878 Bytes
Sync to vLLM 20250627
4 days ago
vectorization_utils.cuh
2.61 kB
Sync to vLLM 20250627
4 days ago