Skip to content
View lauthu's full-sized avatar
🎯
Focusing
🎯
Focusing
  • 快手
  • Beijing, China.

Block or report lauthu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

    C++ Apache License 2.0 Updated Dec 30, 2024
  • distiller Public

    Forked from IntelLabs/distiller

    Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distiller

    Jupyter Notebook Apache License 2.0 Updated Apr 24, 2023
  • Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.

    C++ MIT License Updated Sep 3, 2021
  • Transformer related optimization, including BERT, GPT

    C++ Apache License 2.0 Updated May 6, 2021
  • custom-op Public template

    Forked from tensorflow/custom-op

    Guide for building custom op for TensorFlow

    Smarty Apache License 2.0 Updated Mar 18, 2021
  • CLIP Public

    Forked from openai/CLIP

    Contrastive Language-Image Pretraining

    Jupyter Notebook MIT License Updated Jan 27, 2021
  • A TensorFlow Implementation of the Transformer: Attention Is All You Need

    Python Apache License 2.0 Updated Feb 14, 2020