Ok-Topk is a scheme for distributed training with sparse gradients. Ok-Topk integrates a novel sparse allreduce algorithm (less than 6k communication volume which is asymptotically optimal) with th…

Python 23 8 Updated Dec 10, 2022

facebookresearch / open_lth

A repository in preparation for open-sourcing lottery ticket hypothesis code.

Python 625 113 Updated Sep 6, 2022

kudkudak / dnn_sharpest_directions

Code for "On the Relation Between the Sharpest Directions of DNN Loss and the SGD Step Length", ICLR 2019

Python 11 2 Updated Dec 8, 2022

uw-mad-dash / Accordion

Code for reproducing experiments performed for Accoridon

Python 13 3 Updated Jun 11, 2021

hclhkbu / GaussianK-SGD

Understanding Top-k Sparsification in Distributed Deep Learning

Python 22 9 Updated Nov 15, 2019

shanglianlm0525 / PyTorch-Networks

Pytorch implementation of cnn network

Python 1,947 486 Updated Nov 13, 2023

weiaicunzai / pytorch-cifar100

Practice on cifar100(ResNet, DenseNet, VGG, GoogleNet, InceptionV3, InceptionV4, Inception-ResNetv2, Xception, Resnet In Resnet, ResNext,ShuffleNet, ShuffleNetv2, MobileNet, MobileNetv2, SqueezeNet…

Python 4,358 1,183 Updated Jul 15, 2024

epfml / powersgd

Practical low-rank gradient compression for distributed optimization: https://arxiv.org/abs/1905.13727

Python 144 33 Updated Oct 29, 2024

sands-lab / rethinking-sparsification

Rethinking gradient sparsification as total error minimization

Jupyter Notebook 3 2 Updated Dec 8, 2021

sands-lab / SIDCo

SIDCo is An Efficient Statistical-based Gradient Compression Technique for Distributed Training Systems

Python 9 3 Updated Jun 6, 2021

snuspl / parallax

A Tool for Automatic Parallelization of Deep Learning Training in Distributed Multi-GPU Environments.

Python 130 35 Updated Feb 21, 2022

CODARcode / libpressio

Pressio is latin for compression. Libpressio is a C++ library with C compatible bindings to abstract between different lossless and lossy compressors and their configurations. It solves the problem…

C++ 16 4 Updated Dec 30, 2024

spack / spack

A flexible package manager that supports multiple versions, configurations, platforms, and compilers.

Python 4,488 2,315 Updated Jan 5, 2025

szcompressor / SZ

Error-bounded Lossy Data Compressor (for floating-point/integer datasets)

C 157 55 Updated Apr 6, 2024

junjiah / DSGD-MF

A distributed SGD algorithm for Matrix Factorization using PySpark

Python 8 1 Updated Apr 19, 2015

xinyandai / product-quantization

Implementation of vector quantization algorithms, codes for Norm-Explicit Quantization: Improving Vector Quantization for Maximum Inner Product Search.

Python 57 15 Updated Jan 7, 2021

sands-lab / net-accel-dl-tutorial-sigcomm21

Network-Accelerated Distributed Deep Learning

10 3 Updated Sep 13, 2021

hpdps-group / cuSZ

Forked from szcompressor/cuSZ

A GPU accelerated error-bounded lossy compression for scientific data.

Cuda 3 1 Updated Feb 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mzq mzq308734881

Achievements

Achievements

Block or report mzq308734881

Stars

zhuangwang93 / Espresso

NVIDIA / Megatron-LM

huggingface / transformers

NVIDIA / DCGM

Frankluox / Channel_Importance_FSL

Tabrizian / learning-to-quantize

epfml / sparsifiedSGD

Shigangli / Ok-Topk