PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" (https://arxiv.org/abs/2404.07143)

Python 286 23 Updated May 4, 2024

Beomi / InfiniTransformer

Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Python 349 31 Updated Apr 23, 2024

exists-forall / striped_attention

Python 38 2 Updated Nov 10, 2023

zhuzilin / ring-flash-attention

Ring attention implementation with flash attention

Python 645 56 Updated Dec 19, 2024

lucidrains / ring-attention-pytorch

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

Python 492 29 Updated Oct 25, 2024

kyegomez / Blockwise-Parallel-Transformer

32 times longer context window than vanilla Transformers and up to 4 times longer than memory efficient Transformers.

Python 44 1 Updated Jun 16, 2023

haoliuhl / ringattention

Large Context Attention

Python 671 53 Updated Aug 12, 2024

mistralai / mistral-inference

Official inference library for Mistral models

Jupyter Notebook 9,856 873 Updated Nov 12, 2024

RulinShao / LightSeq

Official repository for LightSeq: Sequence Level Parallelism for Distributed Training of Long Context Transformers

Python 204 9 Updated Aug 19, 2024

facebookresearch / faiss

A library for efficient similarity search and clustering of dense vectors.

C++ 32,403 3,704 Updated Jan 15, 2025

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 39,014 4,352 Updated Jan 8, 2025

LargeWorldModel / LWM

Large World Model -- Modeling Text and Video with Millions Context

Python 7,201 554 Updated Oct 19, 2024

xverse-ai / XVERSE-13B

XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.

Python 648 59 Updated Apr 9, 2024

NUS-HPC-AI-Lab / VideoSys

VideoSys: An easy and efficient system for video generation

Python 1,879 128 Updated Jan 1, 2025

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 23,105 2,279 Updated Dec 27, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 33,840 5,187 Updated Jan 16, 2025

FlagOpen / FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Python 8,246 601 Updated Jan 16, 2025

huggingface / text-generation-inference

Large Language Model Text Generation Inference

Python 9,591 1,120 Updated Jan 16, 2025

intelligent-machine-learning / dlrover

DLRover: An Automatic Distributed Deep Learning System

Python 1,311 169 Updated Jan 16, 2025

DeepLink-org / AIChipBenchmark

Python 22 8 Updated Dec 19, 2024

NVIDIA / apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Python 8,502 1,417 Updated Dec 14, 2024

OpenBMB / BMTrain

Efficient Training (including pre-training and fine-tuning) for Big Models

Python 574 78 Updated Jul 22, 2024

abacusai / Long-Context

This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and benchmark tasks that evaluate a model’s information retrieval cap…

Python 583 37 Updated Nov 17, 2023

jquesnelle / yarn

YaRN: Efficient Context Window Extension of Large Language Models

Python 1,398 118 Updated Apr 17, 2024

OpenLMLab / scaling-rope

code for Scaling Laws of RoPE-based Extrapolation

71 2 Updated Oct 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

刘畅 liuchangdm

Achievements