Skip to content
View anxuthu's full-sized avatar

Block or report anxuthu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
134 results for source starred repositories
Clear filter

[ICLR 2024] Lemur: Open Foundation Models for Language Agents

Python 542 34 Updated Oct 28, 2023

🐙 OctoPack: Instruction Tuning Code Large Language Models

Jupyter Notebook 457 28 Updated Feb 5, 2025

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 11,473 722 Updated Dec 17, 2024

A multi-programming language benchmark for LLMs

Python 235 40 Updated Jan 23, 2025

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,873 4,056 Updated Jul 17, 2024
Python 1,459 110 Updated May 12, 2023

Code for the paper "Evaluating Large Language Models Trained on Code"

Python 2,615 374 Updated Jan 17, 2025

CLI platform to experiment with codegen. Precursor to: https://lovable.dev

Python 53,339 6,978 Updated Nov 17, 2024

A playbook for systematically maximizing the performance of deep learning models.

28,109 2,317 Updated Jun 18, 2024
Python 586 74 Updated Oct 26, 2024

An implementation of a deep learning recommendation model (DLRM)

Python 3,839 852 Updated Oct 11, 2024

Repository for NeurIPS 2020 Spotlight "AdaBelief Optimizer: Adapting stepsizes by the belief in observed gradients"

Jupyter Notebook 1,061 110 Updated Aug 9, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 37,340 4,291 Updated Mar 11, 2025

PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722

Python 4,915 799 Updated Dec 19, 2024
Python 6,400 1,886 Updated Feb 25, 2025
Python 45 8 Updated Oct 27, 2019

Transformer implementation in PyTorch.

Python 478 108 Updated Mar 7, 2019

Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training

C++ 1,768 237 Updated Mar 12, 2025

[ICML 2020] Obtaining Adjustable Regularization for Free via Iterate Averaging

Python 3 1 Updated Jul 31, 2020

Fair Resource Allocation in Federated Learning (ICLR '20)

Python 247 60 Updated Dec 2, 2023

Decentralized SGD and Consensus with Communication Compression: https://arxiv.org/abs/1907.09356

Python 66 17 Updated Sep 10, 2020

Synchronized Batch Normalization implementation in PyTorch.

Python 1,499 188 Updated Apr 8, 2021

Simple Hierarchical Count Sketch in Python

Python 20 9 Updated Jun 3, 2021

PyTorch for benchmarking communication-efficient distributed SGD optimization algorithms

Python 75 20 Updated Aug 30, 2021

A PyTorch implementation of the paper "Training Neural Networks Using Features Replay"

Python 10 1 Updated Aug 28, 2019

Communication-efficient decentralized SGD (Pytorch)

Python 23 10 Updated Mar 17, 2020

PyTorch Implementation of Momentum-Based Policy Gradient Methods

Python 8 2 Updated Aug 12, 2020

[ICLR 2020; IPDPS 2019] Fast and accurate minibatch training for deep GNNs and large graphs (GraphSAINT: Graph Sampling Based Inductive Learning Method).

Python 480 87 Updated Aug 12, 2022
Next