Stars
HunyuanVideo: A Systematic Framework For Large Video Generation Model
BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.
Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
A list of awesome compiler projects and papers for tensor computation and deep learning.
jlxue / antares
Forked from microsoft/antaresAntares: an automatic engine for multi-platform kernel generation and optimization. Supporting CPU, CUDA, ROCm, DirectX12 and GraphCore platforms.
jlxue / AI-System
Forked from microsoft/AI-SystemSystem for AI Education Resource.
Hackable and optimized Transformers building blocks, supporting a composable construction.
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
MindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.
Microsoft SEAL is an easy-to-use and powerful homomorphic encryption library.
A Platform for Secure Analytics and Machine Learning
A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
jlxue / nnfusion
Forked from microsoft/nnfusionA flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
The new Windows Terminal and the original Windows console host, all in the same place!
The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs
Efficiently computes derivatives of NumPy code.
Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit
An universal deep learning models conversor
Benchmarking Deep Learning operations on different hardware