Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
kernels-community
/
quantization
like
3
Follow
kernels-community
110
kernel
License:
apache-2.0
Model card
Files
Files and versions
Community
1
8aa00a3
quantization
/
gptq_marlin
Ctrl+K
Ctrl+K
2 contributors
History:
4 commits
danieldk
HF Staff
Sync to vLLM 20250627
8aa00a3
8 days ago
awq_marlin_repack.cu
Safe
8.8 kB
Sync to vLLM 20250627
8 days ago
dequant.h
Safe
18.6 kB
Sync to vLLM 20250627
8 days ago
generate_kernels.py
Safe
4.39 kB
Sync to vLLM 20250627
8 days ago
gptq_marlin.cu
Safe
35.7 kB
Sync to vLLM 20250627
8 days ago
gptq_marlin_repack.cu
Safe
11.1 kB
Sync to vLLM 20250627
8 days ago
kernel.h
Safe
1.93 kB
Sync to vLLM 20250627
8 days ago
kernel_bf16_kfe2m1f.cu
Safe
2.17 kB
Sync to vLLM 20250627
8 days ago
kernel_bf16_kfe4m3fn.cu
Safe
4.24 kB
Sync to vLLM 20250627
8 days ago
kernel_bf16_ku4.cu
Safe
8.02 kB
Sync to vLLM 20250627
8 days ago
kernel_bf16_ku4b8.cu
Safe
10.1 kB
Sync to vLLM 20250627
8 days ago
kernel_bf16_ku8b128.cu
Safe
10.3 kB
Sync to vLLM 20250627
8 days ago
kernel_fp16_kfe2m1f.cu
Safe
2.06 kB
Sync to vLLM 20250627
8 days ago
kernel_fp16_kfe4m3fn.cu
Safe
4.03 kB
Sync to vLLM 20250627
8 days ago
kernel_fp16_ku4.cu
Safe
9.44 kB
Sync to vLLM 20250627
8 days ago
kernel_fp16_ku4b8.cu
Safe
9.61 kB
Sync to vLLM 20250627
8 days ago
kernel_fp16_ku8b128.cu
Safe
9.76 kB
Sync to vLLM 20250627
8 days ago
marlin.cuh
Safe
2.42 kB
Sync to vLLM 20250627
8 days ago
marlin_dtypes.cuh
Safe
2.1 kB
Sync to vLLM 20250627
8 days ago
marlin_template.h
Safe
67.5 kB
Sync to vLLM 20250627
8 days ago