shumingma

Follow

Shuming Ma shumingma

Follow

159 followers · 9 following

Microsoft Research

Achievements

Achievements

Stars

facebookresearch / lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,340 224 Updated Dec 12, 2024

microsoft / BitNet

Official inference framework for 1-bit LLMs

C++ 12,435 873 Updated Dec 20, 2024

NeoVertex1 / SuperPrompt

SuperPrompt is an attempt to engineer prompts that might help us understand AI agents.

5,630 528 Updated Dec 1, 2024

HazyResearch / ThunderKittens

Tile primitives for speedy kernels

Cuda 1,798 82 Updated Dec 23, 2024

facebookresearch / fairseq2

FAIR Sequence Modeling Toolkit 2

Python 742 88 Updated Dec 24, 2024

QwenLM / Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 14,933 1,208 Updated Dec 12, 2024

NVIDIA / cutlass

CUDA Templates for Linear Algebra Subroutines

C++ 5,871 1,016 Updated Dec 11, 2024

openai / evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 15,234 2,629 Updated Dec 18, 2024

openai / openai-python

The official Python library for the OpenAI API

Python 23,618 3,350 Updated Dec 21, 2024

openai / tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 12,773 878 Updated Oct 3, 2024

triton-lang / triton

Development repository for the Triton language and compiler

C++ 13,783 1,687 Updated Dec 24, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 14,758 1,394 Updated Dec 22, 2024

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,463 2,572 Updated Dec 15, 2024

OFA-Sys / OFA

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Python 2,435 249 Updated Apr 24, 2024

microsoft / torchscale

Foundation Architecture for (M)LLMs

Python 3,036 211 Updated Apr 11, 2024

ELS-RD / kernl

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Jupyter Notebook 1,544 95 Updated Feb 16, 2024

facebookresearch / stopes

A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB team.

Python 257 39 Updated Dec 10, 2024

EleanorJiang / BlonDe

Official implementations for (1) BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation and (2) Discourse Centric Evaluation of Machine Translation with a Densely Annotated P…

Python 75 9 Updated Sep 21, 2023

facebookresearch / metaseq

Repo for external large-scale work

Python 6,520 729 Updated Apr 27, 2024

hannlp / SimpleNMT

A simple and readable neural machine translation system

Python 24 1 Updated Mar 6, 2022

google-research / deduplicate-text-datasets

Rust 1,146 112 Updated Jul 30, 2024

facebookresearch / xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 8,816 632 Updated Dec 23, 2024

google-research / multilingual-t5

Python 1,253 129 Updated Dec 15, 2022

robertostling / eflomal

Efficient Low-Memory Aligner

C 139 31 Updated Sep 3, 2024

bytedance / lightseq

LightSeq: A High Performance Library for Sequence Processing and Generation

C++ 3,228 329 Updated May 16, 2023

kakaobrain / torchgpipe

A GPipe implementation in PyTorch

Python 820 100 Updated Jul 25, 2024

facebookresearch / fairscale

PyTorch extensions for high performance and large scale training.

Python 3,214 283 Updated Nov 26, 2024

microsoft / infinibatch

Efficient, check-pointed data loading for deep learning with massive data sets.

Python 205 17 Updated Jun 12, 2023

thompsonb / vecalign

Improved Sentence Alignment in Linear Time and Space

Python 163 30 Updated Mar 6, 2023

odashi / mteval

Collection of Evaluation Metrics and Algorithms for Machine Translation

C++ 76 15 Updated Mar 5, 2018