-
Tsinghua University
- Beijing, China
-
07:05
(UTC +08:00) - https://xxxxyu.github.io/academic
- https://xxxxyu.github.io/blog
Highlights
- Pro
Lists (8)
Sort Name ascending (A-Z)
Stars
Here is My Termux Terminal Emulator Setup & Packages
an android OTA payload dumper written in Go
My learning notes/codes for ML SYS.
[ACL 2024] A novel QAT with Self-Distillation framework to enhance ultra low-bit LLMs.
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.
BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.
Repository for "MultiHop-RAG: A Dataset for Evaluating Retrieval-Augmented Generation Across Documents" (COLM 2024)
A library for efficient similarity search and clustering of dense vectors.
SGLang is a fast serving framework for large language models and vision language models.
A comprehensive guide to building RAG-based LLM applications for production.
A python module to scrape arxiv.org for a date range and category
Contains system design materials to prepare for system design interviews 🚩👨💻👨💻👨💻
Notes on books I read, talks I watch, articles I study, and papers I love
Official repository for paper "MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement"
😎 Awesome list of tools and projects with the awesome LangChain framework
RAGChecker: A Fine-grained Framework For Diagnosing RAG
the resources about the application based on LLM with RAG pattern
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…
Easy usage of Rockchip's NPUs found in RK3588 and similar chips
Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch
Ubuntu for Rockchip RK35XX Devices
A beautiful, simple, clean, and responsive Jekyll theme for academics
A reference application for a local AI assistant with LLM and RAG
FlashInfer: Kernel Library for LLM Serving