Skip to content
View haolin-nju's full-sized avatar

Block or report haolin-nju

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Efficient and easy multi-instance LLM serving

Python 260 17 Updated Jan 3, 2025

Toy Turing Machine for Introduction to Computing Theory of NJU CS. 南京大学计算机系研究生课程《计算理论导引》玩具图灵机

Python 4 Updated Jun 10, 2022

南京大学学位论文模板

TeX 484 67 Updated Nov 8, 2024

A flexible and efficient training framework for large-scale alignment tasks

Python 263 20 Updated Jan 3, 2025

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 15,151 1,223 Updated Dec 12, 2024

A Massively Parallel Large Scale Self-Play Framework

Python 320 32 Updated Jan 9, 2023

Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.

Python 265 49 Updated Mar 31, 2023

Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training

C++ 1,743 233 Updated Jan 2, 2025

Training and serving large-scale neural networks with auto parallelization.

Python 3,093 360 Updated Dec 9, 2023

A tool for extracting plain text from Wikipedia dumps

Python 3,781 966 Updated May 23, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,098 4,180 Updated Jan 3, 2025

Optimized primitives for collective multi-GPU communication

C++ 3,346 843 Updated Sep 17, 2024

Collective communications library with various primitives for multi-machine training.

C++ 1,240 304 Updated Dec 30, 2024

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Python 28,738 3,411 Updated Dec 21, 2024

😘 让你“爱”上 GitHub,解决访问时图裂、加载慢的问题。(无需安装)

Python 24,133 2,353 Updated Jan 3, 2025

Lingvo

Python 2,822 447 Updated Dec 23, 2024

A GPipe implementation in PyTorch

Python 820 100 Updated Jul 25, 2024

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Python 14,325 2,242 Updated Dec 12, 2024

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

C++ 22,388 5,640 Updated Jan 3, 2025

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 85,492 23,017 Updated Jan 3, 2025
Python 385 117 Updated Nov 4, 2022

📚 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计

178,360 51,171 Updated Aug 21, 2024

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 34,720 5,898 Updated Jan 3, 2025

A list of awesome compiler projects and papers for tensor computation and deep learning.

2,453 306 Updated Oct 19, 2024

A high performance and generic framework for distributed DNN training

Python 3,646 491 Updated Oct 3, 2023

PyTorch tutorials.

Jupyter Notebook 8,313 4,085 Updated Jan 3, 2025

🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSy…

2,729 314 Updated Aug 14, 2024

《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

Python 64,801 11,155 Updated Jul 30, 2024

The official GitHub mirror of the Chromium source

C++ 19,570 7,177 Updated Jan 3, 2025
Next