Saturn798

Follow

Saturn Saturn798

Follow

Stars

wln20 / CSKV

[NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios

Python 9 Updated Oct 18, 2024

wln20 / Attention-Viewer

A tool for visualizing attention-score heatmap in generative LLMs

Python 21 1 Updated May 16, 2024

pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 85,743 23,087 Updated Jan 12, 2025