Skip to content
View echobinarybytes's full-sized avatar

Block or report echobinarybytes

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A tool for bandwidth measurements on NVIDIA GPUs.

C++ 331 30 Updated Oct 18, 2024

Web3脚本交互(撸毛)极简入门指南

Python 137 30 Updated Nov 1, 2022

ROCm Communication Collectives Library (RCCL)

C++ 279 125 Updated Dec 17, 2024

Boost.org asio module

C++ 1,261 418 Updated Dec 11, 2024

The Patterns of Scalable, Reliable, and Performant Large-Scale Systems

59,496 6,088 Updated Dec 14, 2024

:octocat: A curated awesome list of lists of interview questions. Feel free to contribute! 🎓

71,979 8,902 Updated Jul 29, 2024

Windows version of NVIDIA's NCCL ('Nickel') for multi-GPU training - please use https://github.com/NVIDIA/nccl for changes.

Cuda 56 11 Updated Dec 8, 2023

Windows Calculator: A simple yet powerful calculator that ships with Windows

C++ 29,903 5,446 Updated Dec 3, 2024

NCCL Tests

Cuda 934 248 Updated Nov 1, 2024

A modern formatting library

C++ 20,989 2,531 Updated Dec 11, 2024

C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)

C++ 2,956 335 Updated Jul 31, 2024

Inference Llama 2 in one file of pure C

C 17,594 2,104 Updated Aug 6, 2024

a light-weighted, integrated trading/backtesting system/platform(综合量化交易回测系统/平台)

C++ 544 165 Updated Oct 25, 2022

Tensor library for machine learning

C++ 11,376 1,062 Updated Dec 17, 2024

LLM inference in C/C++

C++ 69,363 9,992 Updated Dec 17, 2024

Open MPI main development repository

C 2,203 867 Updated Dec 17, 2024

Optimized primitives for collective multi-GPU communication

C++ 3,307 836 Updated Sep 17, 2024

A baseline repository of Auto-Parallelism in Training Neural Networks

Python 142 20 Updated Jun 25, 2022

A curated list of awesome projects and papers for distributed training or inference

202 25 Updated Oct 8, 2024
C 268 111 Updated Sep 11, 2017

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 279,501 46,777 Updated Dec 2, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 35,899 4,168 Updated Dec 17, 2024

Intel® Data Mover Library (Intel® DML)

C++ 89 17 Updated Sep 26, 2024

A massively spiffy yet delicately unobtrusive compression library.

C 5,813 2,478 Updated Nov 9, 2024

A lightweight DNS-over-HTTPS proxy.

C 801 119 Updated Nov 18, 2024

📡 PoC auto collect from GitHub. ⚠️ Be careful Malware.

6,595 1,204 Updated Dec 17, 2024

UNIX-like reverse engineering framework and command-line toolset

C 20,882 3,019 Updated Dec 17, 2024

SAFE: Self-Attentive Function Embeddings for binary similarity

Python 174 40 Updated Jul 17, 2023

Quake III Arena GPL Source Release

C 7,146 1,912 Updated Aug 2, 2024
Next