Skip to content
View jerrycxj's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report jerrycxj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

AI

57 repositories

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 39,893 6,538 Updated Dec 9, 2024

A playbook for systematically maximizing the performance of deep learning models.

28,094 2,314 Updated Jun 18, 2024

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 21,519 2,792 Updated Aug 15, 2024

⚡️ Python client for the unofficial ChatGPT API with auto token regeneration, conversation tracking, proxy support and more.

Python 4,214 442 Updated Jan 5, 2023

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

Python 3,880 318 Updated Jun 12, 2024

An unnecessarily tiny implementation of GPT-2 in NumPy.

Python 3,322 431 Updated Apr 24, 2023

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 37,253 4,282 Updated Mar 6, 2025

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 67,801 8,317 Updated Mar 4, 2025

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

Python 2,242 397 Updated Mar 7, 2025

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 41,026 5,236 Updated Jun 27, 2024

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

TypeScript 33,977 5,772 Updated Nov 29, 2024

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

Python 7,680 605 Updated Jul 25, 2023

LLM inference in C/C++

C++ 76,026 10,994 Updated Mar 7, 2025

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Python 39,686 5,652 Updated Mar 7, 2025

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 11,444 718 Updated Dec 17, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 140,787 28,210 Updated Mar 7, 2025

Inference code for Llama models

Python 57,812 9,716 Updated Jan 26, 2025

Robust Speech Recognition via Large-Scale Weak Supervision

Python 77,666 9,298 Updated Jan 4, 2025

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,266 560 Updated Oct 28, 2024

Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image …

Python 2,096 278 Updated Mar 6, 2025

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 38,045 4,648 Updated Mar 1, 2025

Supercharge Your Model Training

Python 5,303 435 Updated Mar 7, 2025

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Python 7 2 Updated Jul 29, 2022

Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca

C 4,152 419 Updated Nov 14, 2024

🎉 Repo for LaWGPT, Chinese-Llama tuned with Chinese Legal knowledge. 基于中文法律知识的大语言模型

Python 5,923 546 Updated Jun 11, 2024

Fast and memory-efficient exact attention

Python 16,139 1,528 Updated Mar 7, 2025

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python 1,293 140 Updated Mar 7, 2025

Tensor library for machine learning

C++ 12,037 1,158 Updated Mar 7, 2025

Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app

Python 53,277 6,961 Updated Nov 17, 2024

Efficient Inference for Big Models

Python 579 66 Updated Jan 24, 2023