Skip to content
View white-wolf-tech's full-sized avatar
🏠
Working from wild world
🏠
Working from wild world
  • NULL
  • NULL

Block or report white-wolf-tech

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.

Python 2,591 259 Updated Mar 10, 2025

FlashMLA: Efficient MLA decoding kernels

C++ 11,281 792 Updated Mar 1, 2025

Codebase for Merging Language Models (ICML 2024)

Python 800 48 Updated May 5, 2024

📖A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, Flash-Attention, Paged-Attention, Parallelism, etc. 🎉🎉

3,636 254 Updated Mar 4, 2025

LLM101n: Let's build a Storyteller

32,357 1,750 Updated Aug 1, 2024

FlashInfer: Kernel Library for LLM Serving

Cuda 2,373 247 Updated Mar 13, 2025

LLM training in simple, raw C/CUDA

Cuda 26,013 2,981 Updated Oct 2, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,923 1,057 Updated Mar 6, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 44,033 5,384 Updated Mar 13, 2025

Reference implementation for DPO (Direct Preference Optimization)

Python 2,440 202 Updated Aug 11, 2024

AirLLM 70B inference with single 4GB GPU

Jupyter Notebook 5,737 457 Updated Nov 24, 2024

我的电视 电视直播软件,安装即可使用

C 31,852 3,581 Updated Jun 20, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 11,863 1,228 Updated Mar 13, 2025

A list of AI autonomous agents

15,883 1,191 Updated Feb 26, 2025

Mamba SSM architecture

Python 14,219 1,238 Updated Jan 18, 2025

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Python 1,215 74 Updated Mar 6, 2025

Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.

Python 6,161 554 Updated Mar 7, 2025

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,699 1,149 Updated Mar 13, 2025
Python 602 55 Updated Jul 31, 2024

Extend existing LLMs way beyond the original training length with constant memory usage, without retraining

Python 690 41 Updated Apr 10, 2024

《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing

Java 110,059 13,697 Updated Mar 11, 2025

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,384 1,123 Updated Nov 14, 2024

Focus on prompting and generating

Python 43,763 6,613 Updated Jan 24, 2025

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Python 15,754 1,853 Updated Jun 27, 2024

Using a Model to generate prompts for Model applications. / 使用模型来生成作图咒语的偷懒工具,支持 MidJourney、Stable Diffusion 等。

Python 1,173 113 Updated Apr 5, 2023

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集

Python 3,048 234 Updated Apr 14, 2024

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,765 263 Updated Mar 13, 2025

StableLM: Stability AI Language Models

Jupyter Notebook 15,838 1,033 Updated Apr 8, 2024

基于ChatGLM-6B的中文问诊模型

Python 806 84 Updated Oct 19, 2023
Next