Skip to content
View caiyueliang's full-sized avatar

Block or report caiyueliang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • DeepSpeed Public

    Forked from microsoft/DeepSpeed

    DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

    Python Apache License 2.0 Updated Nov 7, 2024
  • sglang Public

    Forked from sgl-project/sglang

    SGLang is a fast serving framework for large language models and vision language models.

    Python Apache License 2.0 Updated Aug 26, 2024
  • PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.

    Python Apache License 2.0 Updated Jul 10, 2024
  • 🦜🔗 Build context-aware reasoning applications

    Python MIT License Updated Jun 14, 2024
  • LLaVA Public

    Forked from haotian-liu/LLaVA

    [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

    Python Apache License 2.0 Updated Mar 26, 2024
  • TensorRT_1 Public

    Forked from rajeevsrao/TensorRT

    TensorRT is a C++ library for high performance inference on NVIDIA GPUs and deep learning accelerators.

    C++ Apache License 2.0 Updated Mar 14, 2024
  • Qwen Public

    Forked from QwenLM/Qwen

    The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

    Python Apache License 2.0 Updated Feb 28, 2024
  • diffusers Public

    Forked from huggingface/diffusers

    🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

    Python Apache License 2.0 Updated Feb 26, 2024
  • An easy to use PyTorch to TensorRT converter

    Python MIT License Updated Feb 21, 2024
  • This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, s…

    Cuda Apache License 2.0 Updated Feb 6, 2024
  • Transformer related optimization, including BERT, GPT

    C++ Apache License 2.0 Updated Jan 15, 2024
  • The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

    Python MIT License Updated Nov 6, 2023
  • vllm Public

    Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python Apache License 2.0 Updated Oct 8, 2023
  • triton Public

    Forked from triton-lang/triton

    Development repository for the Triton language and compiler

    C++ MIT License Updated Sep 12, 2023
  • 云原生一站式机器学习平台,多租户,数据资产,notebook在线开发,拖拉拽任务流编排,多机多卡分布式训练,超参搜索,推理服务,多集群调度,多项目组资源组,边缘计算,大模型实时训练, ai应用商店

    Jupyter Notebook Other Updated Aug 15, 2023
  • Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.

    C++ BSD 3-Clause "New" or "Revised" License Updated Apr 28, 2023
  • A collection of diffusion models based on MindSpore

    Python Apache License 2.0 Updated Apr 13, 2023
  • wenet Public

    Forked from wenet-e2e/wenet

    Production First and Production Ready End-to-End Speech Recognition Toolkit

    C++ Apache License 2.0 Updated Apr 10, 2023
  • The Triton Inference Server provides an optimized cloud and edge inferencing solution.

    Python BSD 3-Clause "New" or "Revised" License Updated Mar 29, 2023
  • A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

    Python BSD 3-Clause "New" or "Revised" License Updated Feb 14, 2023
  • http://www.facegood.cc

    Python MIT License Updated Feb 8, 2023
  • stock Public

    Forked from pythonstock/stock

    stock,股票系统。使用python进行开发。

    Python Apache License 2.0 Updated Aug 18, 2022
  • A Deep Learning Recommender System

    Python Apache License 2.0 Updated Jul 25, 2022
  • nni Public

    Forked from microsoft/nni

    An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

    Python MIT License Updated Jan 6, 2022
  • 中文文本分类,TextCNN,TextRNN,FastText,TextRCNN,BiLSTM_Attention,DPCNN,Transformer,基于pytorch,开箱即用。

    Python MIT License Updated Oct 20, 2021
  • grpc Public

    Forked from grpc/grpc

    The C based gRPC (C++, Python, Ruby, Objective-C, PHP, C#)

    C++ Apache License 2.0 Updated Jun 24, 2021
  • bloaty Public

    Forked from google/bloaty

    Bloaty McBloatface: a size profiler for binaries

    C++ Apache License 2.0 Updated Jun 24, 2021
  • Python server for communicating with Kaldi from the browser using WebRTC

    Python Apache License 2.0 Updated May 17, 2021
  • web_frame Public

    Vue MIT License Updated Oct 9, 2020
  • kaldi_demo Public

    Shell Updated Mar 11, 2020