-
快手
- Beijing, China.
Highlights
-
TensorRT-LLM Public
Forked from NVIDIA/TensorRT-LLMTensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
C++ Apache License 2.0 UpdatedDec 30, 2024 -
distiller Public
Forked from IntelLabs/distillerNeural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distiller
Jupyter Notebook Apache License 2.0 UpdatedApr 24, 2023 -
jetson-inference Public
Forked from dusty-nv/jetson-inferenceHello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.
C++ MIT License UpdatedSep 3, 2021 -
FasterTransformer Public
Forked from NVIDIA/FasterTransformerTransformer related optimization, including BERT, GPT
C++ Apache License 2.0 UpdatedMay 6, 2021 -
custom-op Public template
Forked from tensorflow/custom-opGuide for building custom op for TensorFlow
Smarty Apache License 2.0 UpdatedMar 18, 2021 -
CLIP Public
Forked from openai/CLIPContrastive Language-Image Pretraining
Jupyter Notebook MIT License UpdatedJan 27, 2021 -
transformer Public
Forked from Kyubyong/transformerA TensorFlow Implementation of the Transformer: Attention Is All You Need
Python Apache License 2.0 UpdatedFeb 14, 2020