Skip to content
View liuchangdm's full-sized avatar

Block or report liuchangdm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

191 results for source starred repositories
Clear filter

LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training

Python 397 55 Updated Nov 15, 2024
Python 275 27 Updated Jan 16, 2025

Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).

Python 6,680 475 Updated Jan 16, 2025

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Python 401 29 Updated Dec 30, 2024

Reference implementation of Megalodon 7B model

Cuda 512 54 Updated Apr 18, 2024

PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" (https://arxiv.org/abs/2404.07143)

Python 286 23 Updated May 4, 2024

Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Python 349 31 Updated Apr 23, 2024

Ring attention implementation with flash attention

Python 645 56 Updated Dec 19, 2024

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

Python 492 29 Updated Oct 25, 2024

32 times longer context window than vanilla Transformers and up to 4 times longer than memory efficient Transformers.

Python 44 1 Updated Jun 16, 2023

Large Context Attention

Python 671 53 Updated Aug 12, 2024

Official inference library for Mistral models

Jupyter Notebook 9,856 873 Updated Nov 12, 2024

Official repository for LightSeq: Sequence Level Parallelism for Distributed Training of Long Context Transformers

Python 204 9 Updated Aug 19, 2024

A library for efficient similarity search and clustering of dense vectors.

C++ 32,403 3,704 Updated Jan 15, 2025

Making large AI models cheaper, faster and more accessible

Python 39,014 4,352 Updated Jan 8, 2025

Large World Model -- Modeling Text and Video with Millions Context

Python 7,201 554 Updated Oct 19, 2024

XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.

Python 648 59 Updated Apr 9, 2024

VideoSys: An easy and efficient system for video generation

Python 1,879 128 Updated Jan 1, 2025

Open-Sora: Democratizing Efficient Video Production for All

Python 23,105 2,279 Updated Dec 27, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 33,840 5,187 Updated Jan 16, 2025

Retrieval and Retrieval-augmented LLMs

Python 8,246 601 Updated Jan 16, 2025

Large Language Model Text Generation Inference

Python 9,591 1,120 Updated Jan 16, 2025

DLRover: An Automatic Distributed Deep Learning System

Python 1,311 169 Updated Jan 16, 2025
Python 22 8 Updated Dec 19, 2024

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Python 8,502 1,417 Updated Dec 14, 2024

Efficient Training (including pre-training and fine-tuning) for Big Models

Python 574 78 Updated Jul 22, 2024

This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and benchmark tasks that evaluate a model’s information retrieval cap…

Python 583 37 Updated Nov 17, 2023

YaRN: Efficient Context Window Extension of Large Language Models

Python 1,398 118 Updated Apr 17, 2024

code for Scaling Laws of RoPE-based Extrapolation

71 2 Updated Oct 16, 2023
Next