-
XJTU
- Xi'an
Highlights
- Pro
-
tvm Public
Forked from apache/tvmOpen deep learning compiler stack for cpu, gpu and specialized accelerators
Python Apache License 2.0 UpdatedJul 11, 2024 -
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
Python Other UpdatedApr 10, 2024 -
triton Public
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
C++ MIT License UpdatedApr 2, 2024 -
byteir Public
Forked from bytedance/byteirA model compilation solution for various hardware
MLIR Apache License 2.0 UpdatedMar 28, 2024 -
torch-mlir Public
Forked from llvm/torch-mlirThe Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.
C++ Other UpdatedMar 27, 2024 -
gpgpu-sim_distribution Public
Forked from gpgpu-sim/gpgpu-sim_distributionGPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as…
C++ Other UpdatedFeb 20, 2024 -
iree Public
Forked from iree-org/ireeA retargetable MLIR-based machine learning compiler and runtime toolkit.
C++ Apache License 2.0 UpdatedJan 23, 2024 -
Trilinos Public
Forked from trilinos/TrilinosPrimary repository for the Trilinos Project
C++ Other UpdatedDec 28, 2023 -
buddy-mlir Public
Forked from buddy-compiler/buddy-mlirAn MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).
C++ Apache License 2.0 UpdatedNov 9, 2023 -
lapack Public
Forked from Reference-LAPACK/lapackLAPACK development repository
Fortran Other UpdatedOct 10, 2023 -
merge-spmv Public
Forked from dumerrill/merge-spmvCuda BSD 3-Clause "New" or "Revised" License UpdatedJul 4, 2023 -
hipCUB Public
Forked from ROCm/hipCUBReusable software components for rocm developers
C++ Other UpdatedJun 27, 2023 -
cub Public
Forked from NVIDIA/cubCooperative primitives for CUDA C++.
Cuda BSD 3-Clause "New" or "Revised" License UpdatedJun 24, 2023 -
BladeDISC Public
Forked from alibaba/BladeDISCBladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.
C++ Apache License 2.0 UpdatedJun 21, 2023 -
CUDALibrarySamples Public
Forked from NVIDIA/CUDALibrarySamplesCUDA Library Samples
Cuda Other UpdatedMay 17, 2023 -
cutlass Public
Forked from NVIDIA/cutlassCUDA Templates for Linear Algebra Subroutines
C++ Other UpdatedMay 8, 2023 -
TensorRT Public
Forked from NVIDIA/TensorRTNVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for inference applicat…
C++ Apache License 2.0 UpdatedMay 5, 2023 -
SparseTIR Public
Forked from uwsampl/SparseTIRSparseTIR: Sparse Tensor Compiler for Deep Learning
Python Apache License 2.0 UpdatedApr 21, 2023 -
-
taco Public
Forked from rohany/tacoThe Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs
C++ Other UpdatedAug 5, 2022 -
cusplibrary Public
Forked from cusplibrary/cusplibraryCUSP : A C++ Templated Sparse Matrix Library
C++ Apache License 2.0 UpdatedAug 1, 2022 -
tvm-rfcs Public
Forked from yelite/tvm-rfcsA home for the final text of all TVM RFCs.
Apache License 2.0 UpdatedJul 27, 2022 -
REKCARC-TSC-UHT Public
Forked from PKUanonym/REKCARC-TSC-UHT清华大学计算机系课程攻略 Guidance for courses in Department of Computer Science and Technology, Tsinghua University
HTML Creative Commons Attribution Share Alike 4.0 International UpdatedJan 22, 2022 -
-
microtvm-blogpost-eval-pynqz1 Public
Forked from areusch/microtvm-blogpost-evalC UpdatedJun 13, 2021 -
-
CMSIS_5 Public
Forked from ARM-software/CMSIS_5CMSIS Version 5 Development Repository
C Apache License 2.0 UpdatedJun 12, 2021 -
ucc162.3 Public
Forked from sheisc/ucc162.3A lightweight open-source C compiler for research and education.
C UpdatedMay 10, 2021 -
CUDAAdvisor Public
Forked from sderek/CUDAAdvisorCUDAAdvisor: a GPU profiling tool
Cuda MIT License UpdatedAug 24, 2018 -