Writing TensorRT plugins using Triton and Python

Triton enables write CUDA kernels. To integrate an kernel into TensorRT is necessary create a plugin.

Since TensorRT 9.1 version is possible create plugins using Python API.

In this tutorial we will learn this.

Do you need:

pip install -U torch tensorrt cuda-python onnx onnx_graphsurgeon

python tensorrt_python_plugin.py

For more details please read the poster in medium:

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
LICENSE		LICENSE
README.md		README.md
tensorrt_python_plugin.py		tensorrt_python_plugin.py
tensorrt_python_plugin_triton_kernel.ipynb		tensorrt_python_plugin_triton_kernel.ipynb

Provide feedback