anxuthu

anxuthu anxuthu

5 followers · 14 following

Achievements

Stars

OpenLemur / Lemur

[ICLR 2024] Lemur: Open Foundation Models for Language Agents

Python 542 34 Updated Oct 28, 2023

bigcode-project / octopack

🐙 OctoPack: Instruction Tuning Code Large Language Models

Jupyter Notebook 457 28 Updated Feb 5, 2025

microsoft / LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 11,470 721 Updated Dec 17, 2024

nuprl / MultiPL-E

A multi-programming language benchmark for LLMs

Python 235 40 Updated Jan 23, 2025

tatsu-lab / stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,876 4,056 Updated Jul 17, 2024

sahil280114 / codealpaca

Python 1,458 109 Updated May 12, 2023

openai / human-eval

Code for the paper "Evaluating Large Language Models Trained on Code"

Python 2,614 374 Updated Jan 17, 2025

AntonOsika / gpt-engineer

CLI platform to experiment with codegen. Precursor to: https://lovable.dev

Python 53,333 6,972 Updated Nov 17, 2024

google-research / tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

28,107 2,316 Updated Jun 18, 2024

google-research / sam

Python 587 74 Updated Oct 26, 2024

Qualcomm-AI-research / oscillations-qat

Python 75 10 Updated Jul 21, 2022

tensorflow / tensor2tensor

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Python 15,925 3,557 Updated Jun 2, 2023

facebookresearch / dlrm

An implementation of a deep learning recommendation model (DLRM)

Python 3,839 852 Updated Oct 11, 2024

juntang-zhuang / Adabelief-Optimizer

Repository for NeurIPS 2020 Spotlight "AdaBelief Optimizer: Adapting stepsizes by the belief in observed gradients"

Jupyter Notebook 1,060 110 Updated Aug 9, 2024

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 37,330 4,290 Updated Mar 11, 2025

facebookresearch / moco

PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722

Python 4,914 799 Updated Dec 19, 2024

MIC-DKFZ / nnUNet

Python 6,396 1,885 Updated Feb 25, 2025

cgraywang / gluon-nlp-1

Forked from dmlc/gluon-nlp

Code repo for "Language Models with Transformers" paper

Python 21 13 Updated Sep 18, 2020

briancheung / superposition

Python 45 8 Updated Oct 27, 2019

tunz / transformer-pytorch

Transformer implementation in PyTorch.

Python 478 108 Updated Mar 7, 2019

flexflow / flexflow-train

Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training

C++ 1,767 237 Updated Mar 11, 2025

uuujf / IterAvg

[ICML 2020] Obtaining Adjustable Regularization for Free via Iterate Averaging

Python 3 1 Updated Jul 31, 2020

litian96 / fair_flearn

Fair Resource Allocation in Federated Learning (ICLR '20)

Python 247 60 Updated Dec 2, 2023

epfml / ChocoSGD

Decentralized SGD and Consensus with Communication Compression: https://arxiv.org/abs/1907.09356

Python 66 17 Updated Sep 10, 2020

cybertronai / autograd-hacks

Python 156 32 Updated Jun 8, 2022

vacancy / Synchronized-BatchNorm-PyTorch

Synchronized Batch Normalization implementation in PyTorch.

Python 1,499 188 Updated Apr 8, 2021

nikitaivkin / csh

Simple Hierarchical Count Sketch in Python

Python 20 9 Updated Jun 3, 2021

kiddyboots216 / CommEfficient

PyTorch for benchmarking communication-efficient distributed SGD optimization algorithms

Python 75 20 Updated Aug 30, 2021

slowbull / FeaturesReplay

A PyTorch implementation of the paper "Training Neural Networks Using Features Replay"

Python 10 1 Updated Aug 28, 2019

IBM / FedMA

Code for Federated Learning with Matched Averaging, ICLR 2020.

Python 335 83 Updated Dec 5, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

anxuthu anxuthu

Achievements

Achievements

Block or report anxuthu

Stars

OpenLemur / Lemur

bigcode-project / octopack

microsoft / LoRA

nuprl / MultiPL-E

tatsu-lab / stanford_alpaca

sahil280114 / codealpaca

openai / human-eval

AntonOsika / gpt-engineer

google-research / tuning_playbook

google-research / sam

Qualcomm-AI-research / oscillations-qat

tensorflow / tensor2tensor

facebookresearch / dlrm

juntang-zhuang / Adabelief-Optimizer

deepspeedai / DeepSpeed

facebookresearch / moco

MIC-DKFZ / nnUNet

cgraywang / gluon-nlp-1

briancheung / superposition

tunz / transformer-pytorch

flexflow / flexflow-train

uuujf / IterAvg

litian96 / fair_flearn

epfml / ChocoSGD

cybertronai / autograd-hacks

vacancy / Synchronized-BatchNorm-PyTorch

nikitaivkin / csh

kiddyboots216 / CommEfficient

slowbull / FeaturesReplay

IBM / FedMA