Skip to content

A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer

License

Notifications You must be signed in to change notification settings

maqtech/cutlass_fpA_intB_gemm

About

A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C++ 86.6%
  • Cuda 12.3%
  • CMake 1.1%