Skip to content
View anxuthu's full-sized avatar

Block or report anxuthu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICLR 2024] Lemur: Open Foundation Models for Language Agents

Python 542 34 Updated Oct 28, 2023

🐙 OctoPack: Instruction Tuning Code Large Language Models

Jupyter Notebook 457 28 Updated Feb 5, 2025

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 11,470 721 Updated Dec 17, 2024

A multi-programming language benchmark for LLMs

Python 235 40 Updated Jan 23, 2025

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,876 4,056 Updated Jul 17, 2024
Python 1,458 109 Updated May 12, 2023

Code for the paper "Evaluating Large Language Models Trained on Code"

Python 2,614 374 Updated Jan 17, 2025

CLI platform to experiment with codegen. Precursor to: https://lovable.dev

Python 53,333 6,972 Updated Nov 17, 2024

A playbook for systematically maximizing the performance of deep learning models.

28,107 2,316 Updated Jun 18, 2024
Python 587 74 Updated Oct 26, 2024

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Python 15,925 3,557 Updated Jun 2, 2023

An implementation of a deep learning recommendation model (DLRM)

Python 3,839 852 Updated Oct 11, 2024

Repository for NeurIPS 2020 Spotlight "AdaBelief Optimizer: Adapting stepsizes by the belief in observed gradients"

Jupyter Notebook 1,060 110 Updated Aug 9, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 37,330 4,290 Updated Mar 11, 2025

PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722

Python 4,914 799 Updated Dec 19, 2024
Python 6,396 1,885 Updated Feb 25, 2025

Code repo for "Language Models with Transformers" paper

Python 21 13 Updated Sep 18, 2020
Python 45 8 Updated Oct 27, 2019

Transformer implementation in PyTorch.

Python 478 108 Updated Mar 7, 2019

Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training

C++ 1,767 237 Updated Mar 11, 2025

[ICML 2020] Obtaining Adjustable Regularization for Free via Iterate Averaging

Python 3 1 Updated Jul 31, 2020

Fair Resource Allocation in Federated Learning (ICLR '20)

Python 247 60 Updated Dec 2, 2023

Decentralized SGD and Consensus with Communication Compression: https://arxiv.org/abs/1907.09356

Python 66 17 Updated Sep 10, 2020

Synchronized Batch Normalization implementation in PyTorch.

Python 1,499 188 Updated Apr 8, 2021

Simple Hierarchical Count Sketch in Python

Python 20 9 Updated Jun 3, 2021

PyTorch for benchmarking communication-efficient distributed SGD optimization algorithms

Python 75 20 Updated Aug 30, 2021

A PyTorch implementation of the paper "Training Neural Networks Using Features Replay"

Python 10 1 Updated Aug 28, 2019

Code for Federated Learning with Matched Averaging, ICLR 2020.

Python 335 83 Updated Dec 5, 2021
Next