Skip to content
View JingyangXiang's full-sized avatar

Block or report JingyangXiang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Python 400 21 Updated Oct 31, 2024

[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.

Python 405 47 Updated Aug 1, 2024

Implementation of "Attention Is Off By One" by Evan Miller

Python 186 10 Updated Aug 28, 2023

Extend existing LLMs way beyond the original training length with constant memory usage, without retraining

Python 683 40 Updated Apr 10, 2024
Python 37 4 Updated Dec 12, 2024

[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…

Python 6,175 418 Updated Dec 6, 2024

Dataset for the paper "HVAQ: A High-Resolution Vision-Based Air Quality Dataset"

8 4 Updated Nov 20, 2021

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 136,277 27,294 Updated Dec 13, 2024

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,337 162 Updated Jun 25, 2024

[ICML 2024] KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache

Python 254 23 Updated Oct 10, 2024

Optimize softmax in triton in many cases

Python 16 Updated Sep 6, 2024

Triton Documentation in Chinese Simplified / Triton 中文文档

TypeScript 43 4 Updated Dec 11, 2024

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 4,170 474 Updated Oct 22, 2024

DFRot: Achieving Outlier-Free and Massive Activation-Free for Rotated LLMs with Refined Rotation

Python 6 Updated Dec 10, 2024

Official PyTorch implementation of FlatQuant: Flatness Matters for LLM Quantization

Python 76 7 Updated Nov 12, 2024

Auto convert transformers models to QuaRot.

Python 8 1 Updated Apr 12, 2024

[NeurIPS 2024] Dual-Perspective Activation: Efficient Channel Denoising via Joint Forward-Backward Criterion for Artificial Neural Networks

Python 7 1 Updated Dec 2, 2024

[CVPR2024] The code for "MGMap: Mask-Guided Learning for Online Vectorized HD Map Construction"

Python 92 5 Updated Apr 13, 2024

Refine high-quality datasets and visual AI models

Python 8,983 575 Updated Dec 13, 2024

基于 ChatGPT API 的划词翻译浏览器插件和跨平台桌面端应用 - Browser extension and cross-platform desktop application for translation based on ChatGPT API.

TypeScript 24,001 1,741 Updated Nov 16, 2024

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 32,605 4,783 Updated Dec 6, 2024

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Python 1,765 99 Updated Jan 21, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 6,514 578 Updated Dec 14, 2024

A series of large language models trained from scratch by developers @01-ai

Jupyter Notebook 7,742 485 Updated Nov 27, 2024

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,193 461 Updated Nov 6, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 31,880 4,846 Updated Dec 14, 2024

Code accompanying the paper "Massive Activations in Large Language Models"

Python 130 8 Updated Mar 4, 2024

clash for windows汉化版. 提供clash for windows的汉化版, 汉化补丁及汉化版安装程序

JavaScript 21,962 2,815 Updated Dec 10, 2024

Some templates for visualizing data

Jupyter Notebook 4 Updated Oct 24, 2024

Command-line tool to inspect the difference between (the text in) two PDF files

Python 227 17 Updated Mar 31, 2022
Next