Stars
A modern model graph visualizer and debugger
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
depyf is a tool to help you understand and adapt to PyTorch compiler torch.compile.
An open-source tool-augmented conversational language model from Fudan University
Fast and memory-efficient exact attention
Development repository for the Triton language and compiler
LLaMa/RWKV onnx models, quantization and testcase
Universal LLM Deployment Engine with ML Compilation
Machine Learning Agility (MLAgility) benchmark and benchmarking tools
ICML'2022: Black-Box Tuning for Language-Model-as-a-Service & EMNLP'2022: BBTv2: Towards a Gradient-Free Future with Large Language Models
Running large language models on a single GPU for throughput-oriented scenarios.
Auction House addOn for Classic (1.13) IMPORTANT: The folder name must be "aux-addon" IMPORTANT: The Vanilla (1.12) version moved here https://github.com/shirsig/aux-addon-vanilla
Ongoing research training transformer models at scale
Official PyTorch implementation of U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
"Multi-Level Intermediate Representation" Compiler Infrastructure
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
A few Windows specific scripts for PyTorch
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Open standard for machine learning interoperability
Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit