Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

kernels-community
/
quantization

kernel
Model card Files Files and versions Community
1
quantization
Ctrl+K
Ctrl+K
  • 2 contributors
History: 37 commits
danieldk's picture
danieldk HF Staff
Sync to vLLM 20250627
8aa00a3 4 days ago
  • attention
    Sync to vLLM 20250627 4 days ago
  • build
    Build (aarch64) 3 months ago
  • compressed_tensors
    Sync to vLLM 20250627 4 days ago
  • core
    Sync to vLLM 20250627 4 days ago
  • cutlass_extensions
    Sync to vLLM 20250627 4 days ago
  • cutlass_w8a8
    Sync to vLLM 20250627 4 days ago
  • fp8
    Sync to vLLM 20250627 4 days ago
  • gptq_marlin
    Sync to vLLM 20250627 4 days ago
  • marlin
    Sync to vLLM 20250627 4 days ago
  • tests
    Sync to vLLM 20250627 4 days ago
  • torch-ext
    Sync to vLLM 20250627 4 days ago
  • .gitattributes
    1.56 kB
    Build 7 months ago
  • LICENSE
    11.4 kB
    Add cutlass_w8a8 7 months ago
  • README.md
    195 Bytes
    Update README.md (#1) 4 months ago
  • build.toml
    5.83 kB
    Sync to vLLM 20250627 4 days ago
  • cuda_utils.h
    1.41 kB
    Sync on vLLM 20240402 3 months ago
  • dispatch_utils.h
    3.9 kB
    Sync to vLLM 20250627 4 days ago
  • flake.lock
    4.47 kB
    Sync to vLLM 20250627 4 days ago
  • flake.nix
    335 Bytes
    Add support for ROCm 3 months ago
  • utils.cuh
    1.84 kB
    Sync on vLLM 20240402 3 months ago
  • vectorization.cuh
    878 Bytes
    Sync to vLLM 20250627 4 days ago
  • vectorization_utils.cuh
    2.61 kB
    Sync to vLLM 20250627 4 days ago