TensorRT OSS Release Changelog

21.04 - 2021-04-12

Added

SM86 kernels for BERT MHA plugin
Added opset13 support for SoftMax, LogSoftmax, Squeeze, and Unsqueeze.
Added support for the EyeLike and GatherElements operators.

Changed

Updated TensorRT version to v7.2.3.4.
Update to ONNX-TensorRT 21.03
ONNX-GraphSurgeon (v0.3.4) - updates fold_constants to correctly exit early.
Set default CUDA_INSTALL_DIR #798
Plugin bugfixes, qkv kernels for sm86
Fixed GroupNorm CMakeFile for cu sources #1083
Permit groupadd with non-unique GID in build containers #1091
Avoid reinterpret_cast #146
Clang-format plugins and samples
Avoid arithmetic on void pointer in multilevelProposeROIPlugin.cpp #1028
Update BERT plugin documentation.

Removed

Removes extra terminate call in InstanceNorm

21.03 - 2021-03-09

Added

Optimized FP16 NMS/batchedNMS plugins with n-bit radix sort and based on IPluginV2DynamicExt
ProposalDynamic and CropAndResizeDynamic plugins based on IPluginV2DynamicExt

Changed

Removed

N/A

21.02 - 2021-02-01

Added

TensorRT Python API bindings
TensorRT Python samples
FP16 support to batchedNMSPlugin #1002
Configurable input size for TLT MaskRCNN Plugin #986

Changed

TensorRT version updated to 7.2.2.3
ONNX-TensorRT v21.02 update
Polygraphy v0.21.1 update
PyTorch-Quantization Toolkit v2.1.0 update
- Documentation update, ONNX opset 13 support, ResNet example
ONNX-GraphSurgeon v0.28 update
demoBERT builder updated to work with Tensorflow2 (in compatibility mode)
Refactor Dockerfiles for OSS container

Removed

N/A

20.12 - 2020-12-18

Added

Add configurable input size for TLT MaskRCNN Plugin

Changed

Update symbol export map for plugins
Correctly use channel dimension when creating Prelu node
Fix Jetson cross compilation CMakefile

Removed

N/A

20.11 - 2020-11-20

Added

API documentation for ONNX-GraphSurgeon

Changed

Support for SM86 in demoBERT
Updated NGC checkpoint URLs for demoBERT and Tacotron2.

Removed

N/A

20.10 - 2020-10-22

Added

Polygraphy v0.20.13 - Deep Learning Inference Prototyping and Debugging Toolkit
PyTorch-Quantization Toolkit v2.0.0
Updated BERT plugins for variable sequence length inputs
Optimized kernels for sequence lengths of 64 and 96 added
Added Tacotron2 + Waveglow TTS demo #677
Re-enable GridAnchorRect_TRT plugin with rectangular feature maps #679
Update batchedNMS plugin to IPluginV2DynamicExt interface #738
Support 3D inputs in InstanceNormalization plugin #745
Added this CHANGELOG.md

Changed

ONNX GraphSurgeon - v0.2.7 with bugfixes, new examples.
demo/BERT bugfixes for Jetson Xavier
Updated build Dockerfile to cuda-11.1
Updated ClangFormat style specification according to TensorRT coding guidelines

Removed

N/A

7.2.1 - 2020-10-20

Added

Polygraphy v0.20.13 - Deep Learning Inference Prototyping and Debugging Toolkit
PyTorch-Quantization Toolkit v2.0.0
Updated BERT plugins for variable sequence length inputs
- Optimized kernels for sequence lengths of 64 and 96 added
Added Tacotron2 + Waveglow TTS demo #677
Re-enable GridAnchorRect_TRT plugin with rectangular feature maps #679
Update batchedNMS plugin to IPluginV2DynamicExt interface #738
Support 3D inputs in InstanceNormalization plugin #745
Added this CHANGELOG.md

Changed

ONNX GraphSurgeon - v0.2.7 with bugfixes, new examples.
demo/BERT bugfixes for Jetson Xavier
Updated build Dockerfile to cuda-11.1
Updated ClangFormat style specification according to TensorRT coding guidelines

Removed

N/A