Skip to content
View KoalaYuFeng's full-sized avatar
  • National University of Singapore
  • Singapore
  • X @feng_seu

Highlights

  • Pro

Organizations

@Xtra-Computing

Block or report KoalaYuFeng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
18 stars written in C++
Clear filter

LLM inference in C/C++

C++ 72,739 10,482 Updated Feb 2, 2025

A library for efficient similarity search and clustering of dense vectors.

C++ 32,679 3,724 Updated Jan 31, 2025

High-speed Large Language Model Serving for Local Deployment

C++ 8,071 418 Updated Jan 28, 2025

ThunderSVM: A Fast SVM Library on GPUs and CPUs

C++ 1,581 218 Updated Apr 1, 2024

Scalable Network Stack for FPGAs (TCP/IP, RoCEv2)

C++ 780 275 Updated Nov 17, 2024

ThunderGBM: Fast GBDTs and Random Forests on GPUs

C++ 693 88 Updated Jan 29, 2024

Low-bit LLM inference on CPU with lookup table

C++ 659 49 Updated Jan 9, 2025

A collection of extensions for Vitis and Intel FPGA OpenCL to improve developer quality of life.

C++ 312 58 Updated Jan 20, 2025

collection of benchmarks to measure basic GPU capabilities

C++ 287 43 Updated Feb 1, 2025

An acceleration library that supports arbitrary bit-width combinatorial quantization operations

C++ 213 21 Updated Sep 30, 2024

RapidStream TAPA compiles task-parallel HLS program into high-frequency FPGA accelerators.

C++ 163 34 Updated Feb 2, 2025

HLS-based Graph Processing Framework on FPGAs

C++ 145 33 Updated Oct 11, 2022

A tree-based federated learning system (MLSys 2023)

C++ 145 41 Updated Jan 20, 2025

Alveo Collective Communication Library: MPI-like communication operations for Xilinx Alveo accelerators

C++ 85 27 Updated Oct 15, 2024

Fast Parallel Probabilistic Graphical Model Learning and Inference [IPDPS'22, PPoPP'23, USENIX ATC'24]

C++ 43 7 Updated Jan 31, 2025

Scaling Graph Processing on HBM-enabled FPGAs with Heterogeneous Pipelines

C++ 17 7 Updated Aug 8, 2022
C++ 4 Updated Jun 7, 2024