Skip to content
View averbukh's full-sized avatar

Block or report averbukh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 34,527 5,281 Updated Jan 23, 2025

Dynamic Instrumentation Tool Platform

C 2,721 570 Updated Jan 23, 2025

Portable C and C++ Development Kit for x64 (and x86) Windows

C 3,335 233 Updated Jan 20, 2025

Design patterns implemented in Java

Java 90,552 26,765 Updated Jan 16, 2025
Python 129 33 Updated Feb 22, 2024

Model interpretability and understanding for PyTorch

Python 5,046 507 Updated Jan 22, 2025

Official implementation for "iTransformer: Inverted Transformers Are Effective for Time Series Forecasting" (ICLR 2024 Spotlight), https://openreview.net/forum?id=JePfAI8fah

Python 1,415 234 Updated Jan 3, 2025

πŸ” A Hex Editor for Reverse Engineers, Programmers and people who value their retinas when working at 3 AM.

C++ 46,704 2,027 Updated Jan 22, 2025

code and discussion of a counter_based_engine for the C++ standard

C++ 3 Updated Feb 9, 2024

3D Maps for OpenSceneGraph / C++14

C++ 1,545 790 Updated Jan 22, 2025
Makefile 1 Updated Jun 25, 2024

Example models using DeepSpeed

Python 6,229 1,061 Updated Jan 21, 2025

C++ standards drafts

TeX 5,751 761 Updated Jan 22, 2025
C 12 1 Updated Apr 15, 2024
Python 17 4 Updated Jul 7, 2024

YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]

Python 10,313 1,031 Updated Sep 26, 2024

NCCL Tests

Cuda 973 255 Updated Dec 30, 2024

Inference Llama 2 in one file of pure C

C 17,891 2,171 Updated Aug 6, 2024

LLM training in simple, raw C/CUDA

Cuda 25,110 2,868 Updated Oct 2, 2024

ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale

C++ 307 123 Updated Nov 24, 2024

Automating Web Performance testing with Puppeteer πŸŽͺ

JavaScript 1,821 90 Updated Jan 18, 2023

Convert C++ header files to PlantUML

Python 228 36 Updated Dec 2, 2024

MIRROR of the SimGrid framework, for the simulation of distributed applications (Clouds, HPC, Grids, IoT and others). Most of the dev occurs on FramaGit.

C++ 169 93 Updated Jan 22, 2025

Implementation of C++ standard libraries in C

C 1,170 70 Updated Dec 15, 2024

πŸ”₯ A Complete List of GitHub Profile Badges and Achievements πŸ”₯

Markdown 1,933 193 Updated Aug 9, 2024

AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)

Python 80 12 Updated Jul 14, 2023

Machine Learning Resources, Practice and Research

Python 3,706 1,388 Updated Jun 26, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,331 4,205 Updated Jan 22, 2025

Ascend PyTorch adapter (torch_npu). Mirror of https://gitee.com/ascend/pytorch

Python 292 18 Updated Jan 23, 2025

Inference code for Llama models

Python 57,292 9,665 Updated Aug 18, 2024
Next