Lists (5)
Sort Name ascending (A-Z)
Starred repositories
A library for efficient similarity search and clustering of dense vectors.
Development repository for the Triton language and compiler
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
Optimized primitives for collective multi-GPU communication
cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it
A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
Quantitative finance example applications on GPUs using portable programming models.
C++ HPC Tutorial materials