Skip to content
View LeshengJin's full-sized avatar
🏠
Working from home
🏠
Working from home

Highlights

  • Pro

Block or report LeshengJin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • sglang Public

    Forked from sgl-project/sglang

    SGLang is a fast serving framework for large language models and vision language models.

    Python Apache License 2.0 Updated Jan 27, 2025
  • FlashInfer: Kernel Library for LLM Serving

    Cuda Apache License 2.0 Updated Oct 2, 2024
  • mlc-llm Public

    Forked from mlc-ai/mlc-llm

    Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

    Python Apache License 2.0 Updated Jun 14, 2024
  • tvm Public

    Forked from apache/tvm

    Open deep learning compiler stack for cpu, gpu and specialized accelerators

    Python Apache License 2.0 Updated May 8, 2024
  • rocm_test Public

    Python Updated Apr 22, 2024
  • FastChat Public

    Forked from lm-sys/FastChat

    The release repo for "Vicuna: An Open Chatbot Impressing GPT-4"

    Python Apache License 2.0 Updated Apr 18, 2024
  • Standalone Flash Attention v2 kernel without libtorch dependency

    C++ BSD 3-Clause "New" or "Revised" License Updated Apr 5, 2024
  • whisperX Public

    Forked from m-bain/whisperX

    WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

    Python BSD 4-Clause "Original" or "Old" License Updated Mar 1, 2024
  • CTranslate2 Public

    Forked from OpenNMT/CTranslate2

    Fast inference engine for Transformer models

    C++ MIT License Updated Mar 1, 2024
  • Faster Whisper transcription with CTranslate2

    Python MIT License Updated Mar 1, 2024
  • JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

    Jupyter Notebook Apache License 2.0 Updated Mar 1, 2024
  • mlc-relax Public

    Forked from mlc-ai/relax
    Python Apache License 2.0 Updated Jan 14, 2024
  • Shell Updated Oct 31, 2023
  • web-llm Public

    Forked from mlc-ai/web-llm

    Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.

    Python Apache License 2.0 Updated May 12, 2023
  • Python Updated Apr 9, 2023
  • relax Public

    Forked from tlc-pack/relax

    Temp repo for prototyping relax(relay next), the effort will be upstreamed. We use the wiki pages on this repo to host design docs.

    Python Apache License 2.0 Updated Feb 2, 2023
  • TypeScript Other Updated Jun 2, 2022
  • models Public

    Forked from tensorflow/models

    Models and examples built with TensorFlow

    Python Apache License 2.0 Updated Dec 19, 2021