Skip to content
View Taylor-Gavel's full-sized avatar

Block or report Taylor-Gavel

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Cold Compress is a hackable, lightweight, and open-source toolkit for creating and benchmarking cache compression methods built on top of GPT-Fast, a simple, PyTorch-native generation codebase.

Python 106 10 Updated Aug 9, 2024

LongBench v2 and LongBench (ACL 2024)

Python 755 65 Updated Jan 15, 2025

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Python 845 57 Updated Dec 16, 2024

LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment

Python 270 20 Updated Apr 29, 2024

[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.

Python 416 51 Updated Aug 1, 2024

[NeurIPS'24 Spotlight] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 whil…

Python 877 41 Updated Dec 28, 2024

Get your documents ready for gen AI

Python 18,495 974 Updated Jan 17, 2025

[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"

Python 378 19 Updated Oct 16, 2024

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 6,755 376 Updated Jul 11, 2024

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

7,096 423 Updated Jul 28, 2024

Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning

Python 160 13 Updated Feb 6, 2024

Code and data for "Lost in the Middle: How Language Models Use Long Contexts"

Python 324 27 Updated Jan 4, 2024

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 3,798 287 Updated Jan 11, 2025