quantization / ext-torch /__init__.py

Commit History

Expose ops (handy for tests etc.)
e3a7455

danieldk HF staff commited on

Add full Marlin support and tests for Marlin/CUTLASS
165b25c

danieldk HF staff commited on

Import CUTLASS tests and add missing scaled mm with zp signature
2dd62c9

danieldk HF staff commited on

Add `scaled_(int|fp8)_quant` and `fp8_marlin_gemm`
5c6fb68

danieldk HF staff commited on

Add cutlass_w8a8
b4cad21

danieldk HF staff commited on