Skip to content
View mzq308734881's full-sized avatar

Block or report mzq308734881

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Hi-Speed DNN Training with Espresso: Unleashing the Full Potential of Gradient Compression with Near-Optimal Usage Strategies (EuroSys '23)

Python 15 3 Updated Sep 21, 2023

Ongoing research training transformer models at scale

Python 11,009 2,459 Updated Jan 5, 2025

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 137,183 27,459 Updated Jan 5, 2025

NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs

C++ 439 59 Updated Jan 2, 2025

[ICML 2022] Channel Importance Matters in Few-shot Image Classification

Python 55 9 Updated Apr 19, 2023

Code for "Adaptive Gradient Quantization for Data-Parallel SGD", published in NeurIPS 2020.

Jupyter Notebook 29 5 Updated Jan 14, 2021

Sparsified SGD with Memory: https://arxiv.org/abs/1809.07599

Jupyter Notebook 56 11 Updated Oct 25, 2018

Ok-Topk is a scheme for distributed training with sparse gradients. Ok-Topk integrates a novel sparse allreduce algorithm (less than 6k communication volume which is asymptotically optimal) with th…

Python 23 8 Updated Dec 10, 2022

A repository in preparation for open-sourcing lottery ticket hypothesis code.

Python 625 113 Updated Sep 6, 2022

Code for "On the Relation Between the Sharpest Directions of DNN Loss and the SGD Step Length", ICLR 2019

Python 11 2 Updated Dec 8, 2022

Code for reproducing experiments performed for Accoridon

Python 13 3 Updated Jun 11, 2021

Understanding Top-k Sparsification in Distributed Deep Learning

Python 22 9 Updated Nov 15, 2019

Pytorch implementation of cnn network

Python 1,947 486 Updated Nov 13, 2023

Practice on cifar100(ResNet, DenseNet, VGG, GoogleNet, InceptionV3, InceptionV4, Inception-ResNetv2, Xception, Resnet In Resnet, ResNext,ShuffleNet, ShuffleNetv2, MobileNet, MobileNetv2, SqueezeNet…

Python 4,358 1,183 Updated Jul 15, 2024

Practical low-rank gradient compression for distributed optimization: https://arxiv.org/abs/1905.13727

Python 144 33 Updated Oct 29, 2024

Rethinking gradient sparsification as total error minimization

Jupyter Notebook 3 2 Updated Dec 8, 2021

SIDCo is An Efficient Statistical-based Gradient Compression Technique for Distributed Training Systems

Python 9 3 Updated Jun 6, 2021

A Tool for Automatic Parallelization of Deep Learning Training in Distributed Multi-GPU Environments.

Python 130 35 Updated Feb 21, 2022

Pressio is latin for compression. Libpressio is a C++ library with C compatible bindings to abstract between different lossless and lossy compressors and their configurations. It solves the problem…

C++ 16 4 Updated Dec 30, 2024

A flexible package manager that supports multiple versions, configurations, platforms, and compilers.

Python 4,488 2,315 Updated Jan 5, 2025

Error-bounded Lossy Data Compressor (for floating-point/integer datasets)

C 157 55 Updated Apr 6, 2024

A distributed SGD algorithm for Matrix Factorization using PySpark

Python 8 1 Updated Apr 19, 2015

Implementation of vector quantization algorithms, codes for Norm-Explicit Quantization: Improving Vector Quantization for Maximum Inner Product Search.

Python 57 15 Updated Jan 7, 2021

Network-Accelerated Distributed Deep Learning

10 3 Updated Sep 13, 2021

A GPU accelerated error-bounded lossy compression for scientific data.

Cuda 3 1 Updated Feb 23, 2022

A high performance and generic framework for distributed DNN training

Python 3,648 491 Updated Oct 3, 2023

Multi Model Server is a tool for serving neural net models for inference

Java 1,001 231 Updated May 20, 2024

Switch ML Application

C++ 175 48 Updated Jul 15, 2022

SGD with compressed gradients and error-feedback: https://arxiv.org/abs/1901.09847

Jupyter Notebook 30 10 Updated Jul 25, 2024

Code for the signSGD paper

Jupyter Notebook 81 15 Updated Jan 12, 2021
Next