-
-
-
TensorRT Public
Forked from pytorch/TensorRTPyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT
Python BSD 3-Clause "New" or "Revised" License UpdatedFeb 9, 2024 -
hidet Public
Forked from hidet-org/hidetAn open-source efficient deep learning framework.
Python Apache License 2.0 UpdatedAug 21, 2023 -
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
Python Other UpdatedAug 2, 2023 -
triton Public
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
C++ MIT License UpdatedAug 2, 2023 -
cpp-ipc Public
Forked from mutouyun/cpp-ipcC++ IPC Library: A high-performance inter-process communication using shared memory on Linux/Windows.
C++ Other UpdatedJul 16, 2023 -
awesome-tensor-compilers Public
Forked from merrymercy/awesome-tensor-compilersA list of awesome compiler projects and papers for tensor computation and deep learning.
UpdatedJul 16, 2023 -
iceoryx Public
Forked from eclipse-iceoryx/iceoryxEclipse iceoryx™ - true zero-copy inter-process-communication
C++ Apache License 2.0 UpdatedJul 15, 2023 -
finetune-gpt2xl Public
Forked from Xirider/finetune-gpt2xlGuide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpeed
Python MIT License UpdatedJun 14, 2023 -
cricket Public
Forked from RWTH-ACS/cricketcricket is a virtualization solution for GPUs
C MIT License UpdatedJun 2, 2023 -
-
-
rocksdb Public
Forked from facebook/rocksdbA library that provides an embeddable, persistent key-value store for fast storage.
C++ GNU General Public License v2.0 UpdatedApr 8, 2023 -
llvm-project Public
Forked from llvm/llvm-projectThe LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github pull requests at this moment. Please submit your patches at…
Other UpdatedMar 26, 2023 -
GPU-Virtualization-Benchmarks Public
Forked from UofT-EcoSystem/GPU-Virtualization-BenchmarksHTML UpdatedMar 16, 2023 -
tvm Public
Forked from apache/tvmOpen deep learning compiler stack for cpu, gpu and specialized accelerators
Python Apache License 2.0 UpdatedMar 9, 2023 -
ml-cvnets Public
Forked from apple/ml-cvnetsCVNets: A library for training computer vision networks
Python Other UpdatedFeb 26, 2023 -
DeepLearningExamples Public
Forked from NVIDIA/DeepLearningExamplesState-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Python UpdatedJan 30, 2023 -
unilm Public
Forked from microsoft/unilmLarge-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Python MIT License UpdatedJan 24, 2023 -
yolov5 Public
Forked from ultralytics/yolov5YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
Python GNU General Public License v3.0 UpdatedJan 21, 2023 -
open-gpu-kernel-modules Public
Forked from NVIDIA/open-gpu-kernel-modulesNVIDIA Linux open GPU kernel module source
C Other UpdatedJan 19, 2023 -
-
ava Public
Forked from utcs-scea/avaAutomatic virtualization of (general) accelerators.
C++ BSD 2-Clause "Simplified" License UpdatedNov 28, 2022 -
Lucid Public
Forked from S-Lab-System-Group/LucidLucid: A Non-Intrusive, Scalable and Interpretable Scheduler for Deep Learning Training Jobs
Python Other UpdatedNov 3, 2022 -
-
clusterdata Public
Forked from alibaba/clusterdatacluster data collected from production clusters in Alibaba for cluster management research
Jupyter Notebook UpdatedSep 22, 2022 -
-
cuda-graph-with-dynamic-parameters Public
Forked from hummingtree/cuda-graph-with-dynamic-parametersC++ MIT License UpdatedAug 9, 2022 -
protobuf-messaging Public
C++ library for sending/receiving protobuf messages over various channels (pipe, socket, kafka, etc.)
C++ MIT License UpdatedJul 28, 2022