-
tensorrtllm_backend Public
Forked from triton-inference-server/tensorrtllm_backendThe Triton TensorRT-LLM Backend
Python Apache License 2.0 UpdatedFeb 21, 2024 -
TensorRT-LLM Public
Forked from NVIDIA/TensorRT-LLMTensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
C++ Apache License 2.0 UpdatedJan 16, 2024 -
FasterTransformer Public
Forked from NVIDIA/FasterTransformerTransformer related optimization, including BERT, GPT
-
Megatron-DeepSpeed Public
Forked from microsoft/Megatron-DeepSpeedOngoing research training transformer language models at scale, including: BERT & GPT-2
Python Other UpdatedJan 16, 2023 -
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedMay 12, 2022 -
tensorflow Public
Forked from tensorflow/tensorflowAn Open Source Machine Learning Framework for Everyone
C++ Apache License 2.0 UpdatedSep 25, 2021 -
TensorRT Public
Forked from NVIDIA/TensorRTTensorRT is a C++ library for high performance inference on NVIDIA GPUs and deep learning accelerators.
C++ Apache License 2.0 UpdatedJun 10, 2021 -
HugeCTR Public
Forked from NVIDIA-Merlin/HugeCTRHugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training
C++ Apache License 2.0 UpdatedNov 24, 2020 -
-
DeepLearningExamples Public
Forked from NVIDIA/DeepLearningExamplesDeep Learning Examples
-
-
bert Public
Forked from google-research/bertTensorFlow code and pre-trained models for BERT
-
-
caffe2 Public
Forked from facebookarchive/caffe2Caffe2 is a lightweight, modular, and scalable deep learning framework.
Jupyter Notebook Other UpdatedAug 2, 2017 -
-
-
-
-
-
-
-
-
-
-
-
-
-