xxxxyu

Follow

🎯

Focusing

Xiangyu Li xxxxyu

🎯

Focusing

Follow

Ph.D. student at AIR, THU

25 followers · 24 following

Tsinghua University
Beijing, China
07:05 (UTC +08:00)
https://xxxxyu.github.io/academic
https://xxxxyu.github.io/blog

Achievements

Achievements

Highlights

Pro

Lists (8)

Sort

Awesome Repos

11 repositories

Docs & Tutorials

24 repositories

LLM Applications

LLM Inference & Serving

32 repositories

Others

Resources

Software & Tools

17 repositories

Website Dev

Stars

microsoft / T-MAC

Low-bit LLM inference on CPU with lookup table

C++ 689 53 Updated Jan 9, 2025

tmux-plugins / tpm

Tmux Plugin Manager

Shell 12,734 445 Updated Aug 5, 2024

sanwebinfo / my-termux-setup

Here is My Termux Terminal Emulator Setup & Packages

174 15 Updated Sep 8, 2023

ssut / payload-dumper-go

an android OTA payload dumper written in Go

Go 2,566 212 Updated Nov 20, 2024

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes/codes for ML SYS.

Python 1,121 54 Updated Feb 27, 2025

mfkiwl / rk-open-docs

internal docs

Shell 165 102 Updated Jun 3, 2021

graelo / pumas

Power Usage Monitor for Apple Silicon

Rust 150 5 Updated Sep 20, 2024

DD-DuDa / BitDistiller

[ACL 2024] A novel QAT with Self-Distillation framework to enhance ultra low-bit LLMs.

Python 100 15 Updated May 16, 2024

microsoft / BitNet

Official inference framework for 1-bit LLMs

C++ 12,767 897 Updated Feb 18, 2025

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,511 364 Updated Feb 26, 2025

Zefan-Cai / Awesome-LLM-KV-Cache

Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.

217 12 Updated Dec 7, 2024

microsoft / BitBLAS

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.

Python 528 39 Updated Feb 14, 2025

yixuantt / MultiHop-RAG

Repository for "MultiHop-RAG: A Dataset for Evaluating Retrieval-Augmented Generation Across Documents" (COLM 2024)

Python 265 19 Updated Nov 19, 2024

facebookresearch / faiss

A library for efficient similarity search and clustering of dense vectors.

C++ 33,301 3,771 Updated Feb 27, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 10,976 1,092 Updated Feb 27, 2025

ray-project / llm-applications

A comprehensive guide to building RAG-based LLM applications for production.

Jupyter Notebook 1,772 243 Updated Aug 2, 2024

Mahdisadjadi / arxivscraper

A python module to scrape arxiv.org for a date range and category

Python 292 53 Updated Jan 22, 2024

NirmalSilwal / system-design-resources

Contains system design materials to prepare for system design interviews 🚩👨‍💻👨‍💻👨‍💻

884 305 Updated Apr 21, 2023

keyvanakbary / learning-notes

Notes on books I read, talks I watch, articles I study, and papers I love

SCSS 5,765 1,236 Updated Jan 2, 2024

thuhcsi / MagicMan

Official repository for paper "MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement"

Python 288 11 Updated Sep 16, 2024

kyrolabs / awesome-langchain

😎 Awesome list of tools and projects with the awesome LangChain framework

8,000 562 Updated Feb 21, 2025

amazon-science / RAGChecker

RAGChecker: A Fine-grained Framework For Diagnosing RAG

Python 770 66 Updated Dec 13, 2024

lizhe2004 / Awesome-LLM-RAG-Application

the resources about the application based on LLM with RAG pattern

1,155 68 Updated Jan 22, 2025

chatchat-space / Langchain-Chatchat

Langchain-Chatchat（原Langchain-ChatGLM）基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

TypeScript 33,787 5,751 Updated Nov 29, 2024

Pelochus / ezrknpu

Easy usage of Rockchip's NPUs found in RK3588 and similar chips

Shell 130 7 Updated Nov 13, 2024

kyegomez / BitNet

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

Python 1,761 158 Updated Jan 27, 2025

Joshua-Riek / ubuntu-rockchip

Ubuntu for Rockchip RK35XX Devices

Shell 2,969 320 Updated Jan 24, 2025

alshedivat / al-folio

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 12,287 11,603 Updated Feb 24, 2025

NVIDIA-AI-IOT / jetson-copilot

A reference application for a local AI assistant with LLM and RAG

Python 105 18 Updated Dec 5, 2024

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 2,199 226 Updated Feb 27, 2025