PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" (https://arxiv.org/abs/2404.07143)

Python 286 23 Updated May 4, 2024

Beomi / InfiniTransformer

Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Python 349 31 Updated Apr 23, 2024

exists-forall / striped_attention

Python 37 2 Updated Nov 10, 2023

zhuzilin / ring-flash-attention

Ring attention implementation with flash attention

Python 621 52 Updated Dec 19, 2024

lucidrains / ring-attention-pytorch

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

Python 485 29 Updated Oct 25, 2024

kyegomez / Blockwise-Parallel-Transformer

32 times longer context window than vanilla Transformers and up to 4 times longer than memory efficient Transformers.

Python 43 1 Updated Jun 16, 2023

haoliuhl / ringattention

Large Context Attention

Python 660 53 Updated Aug 12, 2024

mistralai / mistral-inference

Official inference library for Mistral models

Jupyter Notebook 9,832 870 Updated Nov 12, 2024

RulinShao / LightSeq

Official repository for LightSeq: Sequence Level Parallelism for Distributed Training of Long Context Transformers

Python 200 9 Updated Aug 19, 2024

facebookresearch / faiss

A library for efficient similarity search and clustering of dense vectors.

C++ 32,130 3,687 Updated Dec 28, 2024

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 38,969 4,347 Updated Dec 25, 2024

LargeWorldModel / LWM

Large World Model -- Modeling Text and Video with Millions Context

Python 7,189 554 Updated Oct 19, 2024

xverse-ai / XVERSE-13B

XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.

Python 648 59 Updated Apr 9, 2024

NUS-HPC-AI-Lab / VideoSys

VideoSys: An easy and efficient system for video generation

Python 1,847 127 Updated Dec 26, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 22,902 2,253 Updated Dec 27, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 32,740 4,987 Updated Dec 29, 2024

FlagOpen / FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Python 8,066 591 Updated Dec 27, 2024

huggingface / text-generation-inference

Large Language Model Text Generation Inference

Python 9,524 1,107 Updated Dec 27, 2024

intelligent-machine-learning / dlrover

DLRover: An Automatic Distributed Deep Learning System

Python 1,312 168 Updated Dec 29, 2024

DeepLink-org / AIChipBenchmark

Python 20 8 Updated Dec 19, 2024

NVIDIA / apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Python 8,475 1,412 Updated Dec 14, 2024

OpenBMB / BMTrain

Efficient Training (including pre-training and fine-tuning) for Big Models

Python 571 78 Updated Jul 22, 2024

abacusai / Long-Context

This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and benchmark tasks that evaluate a model’s information retrieval cap…

Python 583 37 Updated Nov 17, 2023

jquesnelle / yarn

YaRN: Efficient Context Window Extension of Large Language Models

Python 1,376 118 Updated Apr 17, 2024

OpenLMLab / scaling-rope

code for Scaling Laws of RoPE-based Extrapolation

71 2 Updated Oct 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

刘畅 liuchangdm

Achievements