-
OctoML
- Riverside
Highlights
- Pro
-
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedJan 27, 2025 -
flashinfer Public
Forked from flashinfer-ai/flashinferFlashInfer: Kernel Library for LLM Serving
Cuda Apache License 2.0 UpdatedOct 2, 2024 -
mlc-llm Public
Forked from mlc-ai/mlc-llmEnable everyone to develop, optimize and deploy AI models natively on everyone's devices.
Python Apache License 2.0 UpdatedJun 14, 2024 -
tvm Public
Forked from apache/tvmOpen deep learning compiler stack for cpu, gpu and specialized accelerators
Python Apache License 2.0 UpdatedMay 8, 2024 -
-
FastChat Public
Forked from lm-sys/FastChatThe release repo for "Vicuna: An Open Chatbot Impressing GPT-4"
Python Apache License 2.0 UpdatedApr 18, 2024 -
libflash_attn Public
Forked from tlc-pack/libflash_attnStandalone Flash Attention v2 kernel without libtorch dependency
C++ BSD 3-Clause "New" or "Revised" License UpdatedApr 5, 2024 -
whisperX Public
Forked from m-bain/whisperXWhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Python BSD 4-Clause "Original" or "Old" License UpdatedMar 1, 2024 -
CTranslate2 Public
Forked from OpenNMT/CTranslate2Fast inference engine for Transformer models
C++ MIT License UpdatedMar 1, 2024 -
faster-whisper Public
Forked from SYSTRAN/faster-whisperFaster Whisper transcription with CTranslate2
Python MIT License UpdatedMar 1, 2024 -
whisper-jax Public
Forked from sanchit-gandhi/whisper-jaxJAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
Jupyter Notebook Apache License 2.0 UpdatedMar 1, 2024 -
-
-
web-llm Public
Forked from mlc-ai/web-llmBringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.
Python Apache License 2.0 UpdatedMay 12, 2023 -
-
relax Public
Forked from tlc-pack/relaxTemp repo for prototyping relax(relay next), the effort will be upstreamed. We use the wiki pages on this repo to host design docs.
Python Apache License 2.0 UpdatedFeb 2, 2023 -
chocopy-wasm-compiler-B Public archive
Forked from ucsd-cse231-s22/chocopy-wasm-compiler-BTypeScript Other UpdatedJun 2, 2022 -
models Public
Forked from tensorflow/modelsModels and examples built with TensorFlow
Python Apache License 2.0 UpdatedDec 19, 2021