Skip to content
View yangxianpku's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Block or report yangxianpku

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

RDMA core userspace libraries and daemons

C 1,668 711 Updated Jan 27, 2025

Yinghan's Code Sample

Cuda 305 55 Updated Jul 25, 2022

Yet another PyTorch implementation of Stable Diffusion (probably easy to read)

Python 561 62 Updated Mar 4, 2024

Infiniband Verbs Performance Tests

C 670 306 Updated Jan 13, 2025

cuDTW++: Ultra-Fast Dynamic Time Warping on CUDA-enabled GPUs

Cuda 25 2 Updated May 11, 2020

kaldi-asr/kaldi is the official location of the Kaldi project.

Shell 14,490 5,332 Updated Jan 28, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 2,451 146 Updated Jan 24, 2025

Implement Flash Attention using Cute.

Cuda 68 3 Updated Dec 17, 2024

Web UI for AIOS: AIOS AgentHub and AIOS AgentChat

5 Updated Nov 1, 2024

FlagPerf is an open-source software platform for benchmarking AI chips.

Python 319 108 Updated Dec 31, 2024

FlagGems is an operator library for large language models implemented in Triton Language.

Python 407 63 Updated Jan 26, 2025

TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)

C++ 38 14 Updated Jan 31, 2025

本书为《C++17 the complete guide》的个人中文翻译,仅供学习和交流使用,侵删

TeX 1,643 260 Updated Sep 22, 2024

VPTQ, A Flexible and Extreme low-bit quantization algorithm

Python 574 39 Updated Jan 21, 2025

Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.

Python 14,073 1,390 Updated Jan 31, 2025

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 18,367 1,924 Updated Oct 15, 2024

Mirage: Automatically Generating Fast GPU Kernels without Programming in Triton/CUDA

C++ 724 40 Updated Jan 30, 2025

[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".

Python 391 46 Updated Jan 17, 2025

Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs

Python 185 17 Updated Dec 11, 2024

🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Qwen / DeepSeek), Knowledge Base (file upload / knowledge managemen…

TypeScript 52,572 11,377 Updated Feb 1, 2025

⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。

Python 14,566 1,514 Updated Jan 21, 2025

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

Python 2,310 262 Updated Jan 24, 2025

一个基于C++11的轻量级网络框架,基于线程池技术可以实现大并发网络IO

C++ 2,011 598 Updated Jan 29, 2025

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 1,198 97 Updated Jan 24, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 8,282 808 Updated Feb 1, 2025

Dynamic Memory Management for Serving LLMs without PagedAttention

C 275 22 Updated Jan 31, 2025

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Python 22,661 9,574 Updated Jan 29, 2025

CUDA Core Compute Libraries

C++ 1,408 185 Updated Feb 1, 2025

🤖一个基于 WeChaty 结合 OpenAi ChatGPT / Kimi / 讯飞等Ai服务实现的微信机器人 ,可以用来帮助你自动回复微信消息,或者管理微信群/好友,检测僵尸粉等...

JavaScript 6,249 838 Updated Jan 14, 2025

A simple prompt-chatting AI based on wechaty and fintuned NLP model

Python 2,232 430 Updated Feb 16, 2023
Next