Skip to content
View shumingma's full-sized avatar

Block or report shumingma

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,340 224 Updated Dec 12, 2024

Official inference framework for 1-bit LLMs

C++ 12,435 873 Updated Dec 20, 2024

SuperPrompt is an attempt to engineer prompts that might help us understand AI agents.

5,630 528 Updated Dec 1, 2024

Tile primitives for speedy kernels

Cuda 1,798 82 Updated Dec 23, 2024

FAIR Sequence Modeling Toolkit 2

Python 742 88 Updated Dec 24, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 14,933 1,208 Updated Dec 12, 2024

CUDA Templates for Linear Algebra Subroutines

C++ 5,871 1,016 Updated Dec 11, 2024

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 15,234 2,629 Updated Dec 18, 2024

The official Python library for the OpenAI API

Python 23,618 3,350 Updated Dec 21, 2024

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 12,773 878 Updated Oct 3, 2024

Development repository for the Triton language and compiler

C++ 13,783 1,687 Updated Dec 24, 2024

Fast and memory-efficient exact attention

Python 14,758 1,394 Updated Dec 22, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,463 2,572 Updated Dec 15, 2024

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Python 2,435 249 Updated Apr 24, 2024

Foundation Architecture for (M)LLMs

Python 3,036 211 Updated Apr 11, 2024

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Jupyter Notebook 1,544 95 Updated Feb 16, 2024

A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB team.

Python 257 39 Updated Dec 10, 2024

Official implementations for (1) BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation and (2) Discourse Centric Evaluation of Machine Translation with a Densely Annotated P…

Python 75 9 Updated Sep 21, 2023

Repo for external large-scale work

Python 6,520 729 Updated Apr 27, 2024

A simple and readable neural machine translation system

Python 24 1 Updated Mar 6, 2022

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 8,816 632 Updated Dec 23, 2024

Efficient Low-Memory Aligner

C 139 31 Updated Sep 3, 2024

LightSeq: A High Performance Library for Sequence Processing and Generation

C++ 3,228 329 Updated May 16, 2023

A GPipe implementation in PyTorch

Python 820 100 Updated Jul 25, 2024

PyTorch extensions for high performance and large scale training.

Python 3,214 283 Updated Nov 26, 2024

Efficient, check-pointed data loading for deep learning with massive data sets.

Python 205 17 Updated Jun 12, 2023

Improved Sentence Alignment in Linear Time and Space

Python 163 30 Updated Mar 6, 2023

Collection of Evaluation Metrics and Algorithms for Machine Translation

C++ 76 15 Updated Mar 5, 2018
Next