Stars
A library for efficient similarity search and clustering of dense vectors.
High-speed Large Language Model Serving for Local Deployment
ThunderSVM: A Fast SVM Library on GPUs and CPUs
Scalable Network Stack for FPGAs (TCP/IP, RoCEv2)
ThunderGBM: Fast GBDTs and Random Forests on GPUs
A collection of extensions for Vitis and Intel FPGA OpenCL to improve developer quality of life.
collection of benchmarks to measure basic GPU capabilities
An acceleration library that supports arbitrary bit-width combinatorial quantization operations
RapidStream TAPA compiles task-parallel HLS program into high-frequency FPGA accelerators.
HLS-based Graph Processing Framework on FPGAs
A tree-based federated learning system (MLSys 2023)
Alveo Collective Communication Library: MPI-like communication operations for Xilinx Alveo accelerators
Fast Parallel Probabilistic Graphical Model Learning and Inference [IPDPS'22, PPoPP'23, USENIX ATC'24]
Scaling Graph Processing on HBM-enabled FPGAs with Heterogeneous Pipelines