Skip to content
View callmejacksong's full-sized avatar

Block or report callmejacksong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,378 166 Updated Jun 25, 2024

The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold Networks (KANs) for language modeling

Python 711 54 Updated Nov 25, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,165 1,070 Updated Jan 16, 2025

Letta (formerly MemGPT) is a framework for creating LLM services with memory.

Python 14,021 1,510 Updated Jan 16, 2025

Awesome-LLM: a curated list of Large Language Model

20,668 1,688 Updated Jan 13, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 33,950 5,214 Updated Jan 18, 2025

mallchat的前端项目,是一个既能购物又能聊天的电商系统。以互联网企业级开发规范的要求来实现它,电商该有的购物车,订单,支付,推荐,搜索,拉新,促活,推送,物流,客服,它都必须有。持续更新ing

TypeScript 1,116 489 Updated Jan 14, 2025

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 8,184 1,013 Updated Jan 17, 2025

Examples and guides for using the OpenAI API

MDX 61,201 9,800 Updated Jan 17, 2025

High-Resolution Image Synthesis with Latent Diffusion Models

Python 39,796 5,116 Updated Oct 10, 2024

Prompt-to-prompt extention of Stable Diffusion web UI

Python 14 Updated Jan 30, 2023
Jupyter Notebook 3,207 302 Updated May 14, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,274 4,201 Updated Jan 18, 2025

ChatGPT 中文指南🔥,ChatGPT 中文调教指南,指令指南,应用开发指南,精选资源清单,更好的使用 chatGPT 让你的生产力 up up up! 🚀

Python 10,977 907 Updated Nov 5, 2024

A list of totally open alternatives to ChatGPT

4,572 200 Updated May 3, 2023

Deep Learning Examples

Jupyter Notebook 814 107 Updated Oct 18, 2024

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 57,955 5,911 Updated Aug 24, 2024

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Python 8,323 830 Updated Jan 8, 2025

A simple C++11 Thread Pool implementation

C++ 8,073 2,264 Updated Jul 20, 2024

data sdk for baidu Index

Python 758 229 Updated Apr 12, 2023

Real-time monitor and web admin for Celery distributed task queue

Python 6,569 1,103 Updated Sep 1, 2024