Skip to content
View liuchangdm's full-sized avatar

Block or report liuchangdm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training

Python 391 55 Updated Nov 15, 2024
Python 272 27 Updated Dec 24, 2024

Official release of InternLM2.5 base and chat models. 1M context support

Python 6,601 463 Updated Nov 21, 2024

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Python 389 28 Updated Dec 26, 2024

Reference implementation of Megalodon 7B model

Cuda 510 54 Updated Apr 18, 2024

PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" (https://arxiv.org/abs/2404.07143)

Python 286 23 Updated May 4, 2024

Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Python 349 31 Updated Apr 23, 2024

Ring attention implementation with flash attention

Python 621 52 Updated Dec 19, 2024

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

Python 485 29 Updated Oct 25, 2024

32 times longer context window than vanilla Transformers and up to 4 times longer than memory efficient Transformers.

Python 43 1 Updated Jun 16, 2023

Large Context Attention

Python 660 53 Updated Aug 12, 2024

Official inference library for Mistral models

Jupyter Notebook 9,832 870 Updated Nov 12, 2024

Official repository for LightSeq: Sequence Level Parallelism for Distributed Training of Long Context Transformers

Python 200 9 Updated Aug 19, 2024

A library for efficient similarity search and clustering of dense vectors.

C++ 32,130 3,687 Updated Dec 28, 2024

Making large AI models cheaper, faster and more accessible

Python 38,969 4,347 Updated Dec 25, 2024

Large World Model -- Modeling Text and Video with Millions Context

Python 7,189 554 Updated Oct 19, 2024

XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.

Python 648 59 Updated Apr 9, 2024

VideoSys: An easy and efficient system for video generation

Python 1,847 127 Updated Dec 26, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 22,902 2,253 Updated Dec 27, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 32,740 4,987 Updated Dec 29, 2024

Retrieval and Retrieval-augmented LLMs

Python 8,066 591 Updated Dec 27, 2024

Large Language Model Text Generation Inference

Python 9,524 1,107 Updated Dec 27, 2024

DLRover: An Automatic Distributed Deep Learning System

Python 1,312 168 Updated Dec 29, 2024
Python 20 8 Updated Dec 19, 2024

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Python 8,475 1,412 Updated Dec 14, 2024

Efficient Training (including pre-training and fine-tuning) for Big Models

Python 571 78 Updated Jul 22, 2024

This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and benchmark tasks that evaluate a model’s information retrieval cap…

Python 583 37 Updated Nov 17, 2023

YaRN: Efficient Context Window Extension of Large Language Models

Python 1,376 118 Updated Apr 17, 2024

code for Scaling Laws of RoPE-based Extrapolation

71 2 Updated Oct 16, 2023
Next