JingyangXiang

JingyangXiang JingyangXiang

29 followers · 7 following

Zhejiang University
Hangzhou, Zhejiang, China
00:53 (UTC +08:00)
https://jingyangxiang.github.io/

Achievements

Lists (3)

Sort

Stars

mit-han-lab / duo-attention

DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Python 400 21 Updated Oct 31, 2024

FMInference / H2O

[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.

Python 405 47 Updated Aug 1, 2024

kyegomez / AttentionIsOFFByOne

Implementation of "Attention Is Off By One" by Evan Miller

Python 186 10 Updated Aug 28, 2023

tomaarsen / attention_sinks

Extend existing LLMs way beyond the original training length with constant memory usage, without retraining

Python 683 40 Updated Apr 10, 2024

RoboUniview / RoboMM

Python 37 4 Updated Dec 12, 2024

FoundationVision / VAR

[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…

Python 6,175 418 Updated Dec 6, 2024

implicitDeclaration / HVAQ-dataset

Dataset for the paper "HVAQ: A High-Resolution Vision-Based Air Quality Dataset"

8 4 Updated Nov 20, 2021

huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 136,277 27,294 Updated Dec 13, 2024

FasterDecoding / Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,337 162 Updated Jun 25, 2024

jy-yuan / KIVI

[ICML 2024] KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache

Python 254 23 Updated Oct 10, 2024

iclementine / optimize_softmax

Optimize softmax in triton in many cases

Python 16 Updated Sep 6, 2024

hyperai / triton-cn

Triton Documentation in Chinese Simplified / Triton 中文文档

TypeScript 43 4 Updated Dec 11, 2024

wdndev / llm_interview_note

主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题

HTML 4,170 474 Updated Oct 22, 2024

JingyangXiang / DFRot

DFRot: Achieving Outlier-Free and Massive Activation-Free for Rotated LLMs with Refined Rotation

Python 6 Updated Dec 10, 2024

ruikangliu / FlatQuant

Official PyTorch implementation of FlatQuant: Flatness Matters for LLM Quantization

Python 76 7 Updated Nov 12, 2024

sgsdxzy / AutoQuarot

Auto convert transformers models to QuaRot.

Python 8 1 Updated Apr 12, 2024

horrible-dong / DPA

[NeurIPS 2024] Dual-Perspective Activation: Efficient Channel Denoising via Joint Forward-Backward Criterion for Artificial Neural Networks

Python 7 1 Updated Dec 2, 2024

xiaolul2 / MGMap

[CVPR2024] The code for "MGMap: Mask-Guided Learning for Online Vectorized HD Map Construction"

Python 92 5 Updated Apr 13, 2024

voxel51 / fiftyone

Refine high-quality datasets and visual AI models

Python 8,983 575 Updated Dec 13, 2024

openai-translator / openai-translator

基于 ChatGPT API 的划词翻译浏览器插件和跨平台桌面端应用 - Browser extension and cross-platform desktop application for translation based on ChatGPT API.

TypeScript 24,001 1,741 Updated Nov 16, 2024

huggingface / pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 32,605 4,783 Updated Dec 6, 2024