Skip to content
View zhengpeirong's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report zhengpeirong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

CUDA/Metal accelerated language model inference

C 466 18 Updated Dec 18, 2024

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 4,099 323 Updated Dec 16, 2024

Train transformer language models with reinforcement learning.

Python 10,404 1,343 Updated Dec 23, 2024

A highly optimized LLM inference acceleration engine for Llama and its variants.

C++ 393 30 Updated Dec 16, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 6,641 596 Updated Dec 24, 2024

A collection of awesome text-to-image generation studies.

TeX 470 26 Updated Dec 20, 2024

Native Operating System and Hardware Information

Java 4,802 880 Updated Dec 23, 2024

cross platform C++ library for hardware information (CPU, RAM, GPU, ...)

C++ 494 91 Updated Oct 15, 2024

[CVPR 2023] DepGraph: Towards Any Structural Pruning

Python 2,794 338 Updated Dec 21, 2024

A curated list of awesome edge computing, including Frameworks, Simulators, Tools, etc.

406 70 Updated Sep 24, 2024

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,578 491 Updated Dec 15, 2024

πŸ€— Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Python 26,789 5,510 Updated Dec 24, 2024

Repo for external large-scale work

Python 6,520 729 Updated Apr 27, 2024

Evaluating LLMs with fewer examples

Jupyter Notebook 140 15 Updated Apr 12, 2024
Python 102 1 Updated Sep 24, 2024

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

13,938 1,406 Updated Feb 13, 2023

Tips for Writing a Research Paper using LaTeX

TeX 3,268 377 Updated May 4, 2023

A framework for few-shot evaluation of language models.

Python 7,285 1,966 Updated Dec 23, 2024

nnScaler: Compiling DNN models for Parallel Training

Python 81 12 Updated Dec 10, 2024

Open source code in the field of semantic communication.

48 6 Updated Nov 28, 2024

πŸš€ A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 8,089 997 Updated Dec 13, 2024

TPI-LLM: Serving 70b-scale LLMs Efficiently on Low-resource Edge Devices

Python 167 12 Updated Nov 15, 2024

Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.

262 11 Updated Dec 23, 2024

πŸ‘« A curated list of Model Merging methods.

86 4 Updated Sep 16, 2024

Download the latest stable Synergy binaries.

Python 1,243 117 Updated Nov 1, 2024

Awesome LLMs on Device: A Comprehensive Survey

1,018 127 Updated Oct 8, 2024

A curated list of research in System for Edge Intelligence and Computing(Edge MLSys), including Frameworks, Tools, Repository, etc. Paper notes are also provided.

25 2 Updated Jan 4, 2022

Awesome list for LLM pruning.

185 8 Updated Dec 15, 2024

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

C++ 1,444 100 Updated Aug 7, 2024

Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!

Python 38,498 7,329 Updated Nov 27, 2022
Next