yizhang2077

yizhang2077

10 followers · 32 following

Achievements

Starred repositories

FMInference / FlexLLMGen

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,235 552 Updated Oct 28, 2024

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes/codes for ML SYS.

Python 178 6 Updated Dec 12, 2024

BBuf / how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

Cuda 1,718 141 Updated Dec 12, 2024

vietnh1009 / ASCII-generator

ASCII generator (image to text, image to image, video to video)

Python 7,422 570 Updated Nov 22, 2024

BerriAI / litellm

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Python 15,046 1,763 Updated Dec 12, 2024

Zefan-Cai / Awesome-LLM-KV-Cache

Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.

135 7 Updated Dec 7, 2024

karpathy / llama2.c

Inference Llama 2 in one file of pure C

C 17,561 2,104 Updated Aug 6, 2024

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 24,696 2,799 Updated Oct 2, 2024

DefTruth / CUDA-Learn-Notes

📚150+ Tensor/CUDA Cores Kernels, ⚡️flash-attention-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS 🎉🎉).

Cuda 1,623 173 Updated Dec 12, 2024

DefTruth / Awesome-LLM-Inference

📖A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. 🎉🎉

2,983 202 Updated Dec 9, 2024

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 38,903 4,351 Updated Dec 10, 2024

THUDM / CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 9,683 916 Updated Dec 8, 2024

ifromeast / cuda_learning

learning how CUDA works

Cuda 171 23 Updated Aug 16, 2024

ggerganov / llama.cpp

LLM inference in C/C++

C++ 69,154 9,936 Updated Dec 12, 2024

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 6,496 575 Updated Dec 12, 2024

BigBrotherTrade / trader

交易模块

Python 4,191 916 Updated May 13, 2024

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 37,835 6,049 Updated Dec 9, 2024

UFund-Me / Qbot

[🔥updating ...] AI 自动量化交易机器人(完全本地部署) AI-powered Quantitative Investment Research Platform. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Charmve/iQuant

Jupyter Notebook 8,253 1,154 Updated Nov 9, 2024