Stars
Flops counter for convolutional networks in pytorch framework
Deep insight tensorrt, including but not limited to qat, ptq, plugin, triton_inference, cuda
Common source, scripts and utilities for creating Triton backends.
Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector DB, and RAG.
All image quality metrics you need in one package.
📊 A simple command-line utility for querying and monitoring GPU status
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
Video classification tools using 3D ResNet
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系[email protected] 版权所有,违权必究 Tan 2018.06
Demonstrate all the questions on LeetCode in the form of animation.(用动画的形式呈现解LeetCode题目的思路)
A collection of pre-trained, state-of-the-art models in the ONNX format
100-Days-Of-ML-Code中文版
WebUI extension for ControlNet
TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, etc. It compresses deep learning models for downstream d…
TensorRT Extension for Stable Diffusion Web UI
Lightning fast C++/CUDA neural network framework
Brevitas: neural network quantization in PyTorch
NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.
Cross-platform, customizable ML solutions for live and streaming media.
A simple tool that can generate TensorRT plugin code quickly.
A simple C++11 Thread Pool implementation
C++/CUDA/Python multimedia utilities for NVIDIA Jetson