Skip to content
View callmejacksong's full-sized avatar

Block or report callmejacksong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,451 168 Updated Jun 25, 2024

The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold Networks (KANs) for language modeling

Python 714 55 Updated Nov 25, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,625 1,130 Updated Mar 7, 2025

Letta (formerly MemGPT) is a framework for creating LLM services with memory.

Python 14,874 1,585 Updated Mar 6, 2025

Awesome-LLM: a curated list of Large Language Model

21,894 1,790 Updated Mar 4, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 40,516 6,097 Updated Mar 7, 2025

mallchat的前端项目,是一个既能购物又能聊天的电商系统。以互联网企业级开发规范的要求来实现它,电商该有的购物车,订单,支付,推荐,搜索,拉新,促活,推送,物流,客服,它都必须有。持续更新ing

TypeScript 1,140 500 Updated Jan 14, 2025

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 8,434 1,045 Updated Mar 6, 2025

Examples and guides for using the OpenAI API

MDX 62,122 10,021 Updated Mar 4, 2025

High-Resolution Image Synthesis with Latent Diffusion Models

Python 40,346 5,174 Updated Oct 10, 2024

Prompt-to-prompt extention of Stable Diffusion web UI

Python 14 Updated Jan 30, 2023
Jupyter Notebook 3,238 308 Updated May 14, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 37,238 4,281 Updated Mar 6, 2025

ChatGPT 中文指南🔥,ChatGPT 中文调教指南,指令指南,应用开发指南,精选资源清单,更好的使用 chatGPT 让你的生产力 up up up! 🚀

Python 11,030 914 Updated Nov 5, 2024

A list of totally open alternatives to ChatGPT

4,595 201 Updated May 3, 2023

Deep Learning Examples

Jupyter Notebook 817 105 Updated Oct 18, 2024

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 58,993 5,986 Updated Aug 24, 2024

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Python 8,370 838 Updated Mar 6, 2025

A simple C++11 Thread Pool implementation

C++ 8,159 2,287 Updated Jul 20, 2024

data sdk for baidu Index

Python 766 229 Updated Apr 12, 2023

Real-time monitor and web admin for Celery distributed task queue

Python 6,630 1,105 Updated Sep 1, 2024