anxuthu

Follow

anxuthu anxuthu

Follow

5 followers · 14 following

Achievements

Achievements

Stars

134 results for source starred repositories

OpenLemur / Lemur

[ICLR 2024] Lemur: Open Foundation Models for Language Agents

Python 542 34 Updated Oct 28, 2023

bigcode-project / octopack

🐙 OctoPack: Instruction Tuning Code Large Language Models

Jupyter Notebook 457 28 Updated Feb 5, 2025

microsoft / LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 11,473 722 Updated Dec 17, 2024

nuprl / MultiPL-E

A multi-programming language benchmark for LLMs

Python 235 40 Updated Jan 23, 2025

tatsu-lab / stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,873 4,056 Updated Jul 17, 2024

sahil280114 / codealpaca

Python 1,459 110 Updated May 12, 2023

openai / human-eval

Code for the paper "Evaluating Large Language Models Trained on Code"

Python 2,615 374 Updated Jan 17, 2025

AntonOsika / gpt-engineer

CLI platform to experiment with codegen. Precursor to: https://lovable.dev

Python 53,339 6,978 Updated Nov 17, 2024

google-research / tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

28,109 2,317 Updated Jun 18, 2024

google-research / sam

Python 586 74 Updated Oct 26, 2024

Qualcomm-AI-research / oscillations-qat

Python 75 10 Updated Jul 21, 2022

facebookresearch / dlrm

An implementation of a deep learning recommendation model (DLRM)

Python 3,839 852 Updated Oct 11, 2024

juntang-zhuang / Adabelief-Optimizer

Repository for NeurIPS 2020 Spotlight "AdaBelief Optimizer: Adapting stepsizes by the belief in observed gradients"

Jupyter Notebook 1,061 110 Updated Aug 9, 2024

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 37,340 4,291 Updated Mar 11, 2025

facebookresearch / moco

PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722

Python 4,915 799 Updated Dec 19, 2024

MIC-DKFZ / nnUNet

Python 6,400 1,886 Updated Feb 25, 2025

briancheung / superposition

Python 45 8 Updated Oct 27, 2019

tunz / transformer-pytorch

Transformer implementation in PyTorch.

Python 478 108 Updated Mar 7, 2019

flexflow / flexflow-train

Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training

C++ 1,768 237 Updated Mar 12, 2025

uuujf / IterAvg

[ICML 2020] Obtaining Adjustable Regularization for Free via Iterate Averaging

Python 3 1 Updated Jul 31, 2020

litian96 / fair_flearn

Fair Resource Allocation in Federated Learning (ICLR '20)

Python 247 60 Updated Dec 2, 2023

epfml / ChocoSGD

Decentralized SGD and Consensus with Communication Compression: https://arxiv.org/abs/1907.09356

Python 66 17 Updated Sep 10, 2020

cybertronai / autograd-hacks

Python 156 32 Updated Jun 8, 2022

vacancy / Synchronized-BatchNorm-PyTorch

Synchronized Batch Normalization implementation in PyTorch.

Python 1,499 188 Updated Apr 8, 2021

nikitaivkin / csh

Simple Hierarchical Count Sketch in Python

Python 20 9 Updated Jun 3, 2021

kiddyboots216 / CommEfficient

PyTorch for benchmarking communication-efficient distributed SGD optimization algorithms

Python 75 20 Updated Aug 30, 2021

slowbull / FeaturesReplay

A PyTorch implementation of the paper "Training Neural Networks Using Features Replay"

Python 10 1 Updated Aug 28, 2019

JYWa / MATCHA

Communication-efficient decentralized SGD (Pytorch)

Python 23 10 Updated Mar 17, 2020

gaosh / MBPG

PyTorch Implementation of Momentum-Based Policy Gradient Methods

Python 8 2 Updated Aug 12, 2020

GraphSAINT / GraphSAINT

[ICLR 2020; IPDPS 2019] Fast and accurate minibatch training for deep GNNs and large graphs (GraphSAINT: Graph Sampling Based Inductive Learning Method).

Python 480 87 Updated Aug 12, 2022