TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,699 1,149 Updated Mar 13, 2025

Tlntin / Qwen-TensorRT-LLM

Python 602 55 Updated Jul 31, 2024

tomaarsen / attention_sinks

Extend existing LLMs way beyond the original training length with constant memory usage, without retraining

Python 690 41 Updated Apr 10, 2024

krahets / hello-algo

《Hello 算法》：动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新，English version ongoing

Java 110,059 13,697 Updated Mar 11, 2025

facebookresearch / seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,384 1,123 Updated Nov 14, 2024

lllyasviel / Fooocus

Focus on prompting and generating

Python 43,763 6,613 Updated Jan 24, 2025

genggui001 / Megatron-DeepSpeed-Llama

Python 84 13 Updated Sep 9, 2023

THUDM / ChatGLM2-6B

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Python 15,754 1,853 Updated Jun 27, 2024

soulteary / docker-prompt-generator

Using a Model to generate prompts for Model applications. / 使用模型来生成作图咒语的偷懒工具，支持 MidJourney、Stable Diffusion 等。

Python 1,173 113 Updated Apr 5, 2023

CVI-SZU / Linly

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型；ChatFlow中文对话模型；中文OpenLLaMA模型；NLP预训练/指令微调数据集

Python 3,048 234 Updated Apr 14, 2024

esbatmop / MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,765 263 Updated Mar 13, 2025

Stability-AI / StableLM

StableLM: Stability AI Language Models

Jupyter Notebook 15,838 1,033 Updated Apr 8, 2024

xionghonglin / DoctorGLM

基于ChatGLM-6B的中文问诊模型

Python 806 84 Updated Oct 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Biały Wilk white-wolf-tech

Achievements

Achievements

Block or report white-wolf-tech

Stars

deepseek-ai / DualPipe

deepseek-ai / FlashMLA

yule-BUAA / MergeLM

DefTruth / Awesome-LLM-Inference

karpathy / LLM101n

flashinfer-ai / flashinfer

karpathy / llm.c

PKU-YuanGroup / Open-Sora-Plan

hiyouga / LLaMA-Factory

eric-mitchell / direct-preference-optimization

lyogavin / airllm

lizongying / my-tv

sgl-project / sglang

e2b-dev / awesome-ai-agents

state-spaces / mamba

hao-ai-lab / LookaheadDecoding

QwenLM / Qwen-Agent

NVIDIA / TensorRT-LLM