Highlights
Lists (1)
Sort Name ascending (A-Z)
Stars
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
MSVC's implementation of the C++ Standard Library.
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
A lightweight library for portable low-level GPU computation using WebGPU.
Learn to use WebGPU for native graphic applications in C++
Amazon Kinesis Video Streams Producer SDK for C++ is for developers to install and customize for their connected camera and other devices to securely stream video, audio, and time-encoded data to K…
A GPU-driven system framework for scalable AI applications
Open-Source Licensed Educational SSD Simulator for High-Performance Storage and Full-System Evaluations
Kernel Fusion and Runtime Compilation Based on NNVM
Matrix Operation Library for FPGA https://xilinx.github.io/gemx/
Official Github repository for the SIGCOMM '24 paper "Accelerating Model Training in Multi-cluster Environments with Consumer-grade GPUs"
Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access (ACM EuroSys '23)
An MPEG/DASH client-server module for simulating rate adaptation mechanisms over HTTP/TCP.