Skip to content
View backyes's full-sized avatar

Block or report backyes

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
C++ 243 39 Updated Sep 15, 2023

The Tensor Algebra SuperOptimizer for Deep Learning

C++ 696 92 Updated Jan 26, 2023

Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training

C++ 1,760 236 Updated Feb 22, 2025

oneAPI Deep Neural Network Library (oneDNN)

C++ 3,730 1,025 Updated Feb 23, 2025

Puck is a high-performance ANN search engine

Jupyter Notebook 346 39 Updated Nov 21, 2024

A Python package for extending the official PyTorch that can easily obtain performance on Intel platform

Python 1,747 262 Updated Feb 21, 2025

A community-maintained Python framework for creating mathematical animations.

Python 30,071 2,105 Updated Feb 11, 2025

A unified, comprehensive and efficient recommendation library

Python 3,599 631 Updated Feb 23, 2025

Set of datasets for the deep learning recommendation model (DLRM).

41 15 Updated Dec 21, 2022

Inference Llama 2 in one file of pure C

C 18,065 2,200 Updated Aug 6, 2024

Large Language Model for Generative Recommendation

Python 63 7 Updated Mar 12, 2024

A curated list of Generative Recommender Systems (Paper & Code)

388 40 Updated Apr 1, 2024

Large Language Model-enhanced Recommender System Papers

638 53 Updated Feb 14, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 38,984 5,834 Updated Feb 23, 2025

Large Context Attention

Python 684 53 Updated Jan 24, 2025

The framework for the paper "Inter-layer Scheduling Space Definition and Exploration for Tiled Accelerators" in ISCA 2023.

C++ 58 7 Updated Feb 22, 2025

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python 9,341 732 Updated Aug 5, 2024

RAPIDS Memory Manager

C++ 535 208 Updated Feb 23, 2025

Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.

C++ 86 17 Updated Nov 23, 2022

PyTorch-Based Fast and Efficient Processing for Various Machine Learning Applications with Diverse Sparsity

Cuda 104 27 Updated Feb 17, 2025

[Mlsys'22] Understanding gnn computational graph: A coordinated computation, io, and memory perspective

Python 18 5 Updated Sep 11, 2023

CogDL: A Comprehensive Library for Graph Deep Learning (WWW 2023)

Python 1,754 314 Updated Feb 1, 2024

A schedule language for large model training

Python 144 16 Updated Jun 18, 2024

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 7,106 1,046 Updated Feb 18, 2025

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 1,992 349 Updated Feb 7, 2025

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 41,025 5,235 Updated Jun 27, 2024

Flops counter for convolutional networks in pytorch framework

Python 2,861 308 Updated Jan 20, 2025

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 37,901 4,634 Updated Feb 21, 2025
Next