license: apache-2.0 | |
tags: | |
- kernel | |
## triton-scaled-mm | |
Triton scaled matrix multiplication kernel [from vLLM](https://github.com/vllm-project/vllm/blob/main/vllm/model_executor/layers/quantization/compressed_tensors/triton_scaled_mm.py). | |
license: apache-2.0 | |
tags: | |
- kernel | |
## triton-scaled-mm | |
Triton scaled matrix multiplication kernel [from vLLM](https://github.com/vllm-project/vllm/blob/main/vllm/model_executor/layers/quantization/compressed_tensors/triton_scaled_mm.py). | |