Skip to content
View liuchangdm's full-sized avatar

Block or report liuchangdm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

191 results for source starred repositories
Clear filter

LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training

Python 397 55 Updated Nov 15, 2024
Python 272 27 Updated Dec 24, 2024

Official release of InternLM2.5 base and chat models. 1M context support

Python 6,615 464 Updated Nov 21, 2024

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Python 392 28 Updated Dec 30, 2024

Reference implementation of Megalodon 7B model

Cuda 512 54 Updated Apr 18, 2024

PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" (https://arxiv.org/abs/2404.07143)

Python 287 23 Updated May 4, 2024

Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Python 349 31 Updated Apr 23, 2024

Ring attention implementation with flash attention

Python 625 52 Updated Dec 19, 2024

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

Python 486 29 Updated Oct 25, 2024

32 times longer context window than vanilla Transformers and up to 4 times longer than memory efficient Transformers.

Python 43 1 Updated Jun 16, 2023

Large Context Attention

Python 663 53 Updated Aug 12, 2024

Official inference library for Mistral models

Jupyter Notebook 9,843 874 Updated Nov 12, 2024

Official repository for LightSeq: Sequence Level Parallelism for Distributed Training of Long Context Transformers

Python 201 9 Updated Aug 19, 2024

A library for efficient similarity search and clustering of dense vectors.

C++ 32,200 3,689 Updated Dec 28, 2024

Making large AI models cheaper, faster and more accessible

Python 38,986 4,347 Updated Jan 3, 2025

Large World Model -- Modeling Text and Video with Millions Context

Python 7,189 554 Updated Oct 19, 2024

XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.

Python 649 59 Updated Apr 9, 2024

VideoSys: An easy and efficient system for video generation

Python 1,857 127 Updated Jan 1, 2025

Open-Sora: Democratizing Efficient Video Production for All

Python 22,954 2,259 Updated Dec 27, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 33,095 5,038 Updated Jan 4, 2025

Retrieval and Retrieval-augmented LLMs

Python 8,123 595 Updated Jan 3, 2025

Large Language Model Text Generation Inference

Python 9,554 1,110 Updated Jan 3, 2025

DLRover: An Automatic Distributed Deep Learning System

Python 1,321 169 Updated Jan 3, 2025
Python 20 8 Updated Dec 19, 2024

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Python 8,482 1,414 Updated Dec 14, 2024

Efficient Training (including pre-training and fine-tuning) for Big Models

Python 571 78 Updated Jul 22, 2024

This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and benchmark tasks that evaluate a model’s information retrieval cap…

Python 583 37 Updated Nov 17, 2023

YaRN: Efficient Context Window Extension of Large Language Models

Python 1,387 118 Updated Apr 17, 2024

code for Scaling Laws of RoPE-based Extrapolation

71 2 Updated Oct 16, 2023
Next