Skip to content
View qelk123's full-sized avatar
:octocat:
Focusing
:octocat:
Focusing

Highlights

  • Pro

Block or report qelk123

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • tvm Public

    Forked from apache/tvm

    Open deep learning compiler stack for cpu, gpu and specialized accelerators

    Python Apache License 2.0 Updated Jul 11, 2024
  • pytorch Public

    Forked from pytorch/pytorch

    Tensors and Dynamic neural networks in Python with strong GPU acceleration

    Python Other Updated Apr 10, 2024
  • triton Public

    Forked from triton-lang/triton

    Development repository for the Triton language and compiler

    C++ MIT License Updated Apr 2, 2024
  • byteir Public

    Forked from bytedance/byteir

    A model compilation solution for various hardware

    MLIR Apache License 2.0 Updated Mar 28, 2024
  • torch-mlir Public

    Forked from llvm/torch-mlir

    The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.

    C++ Other Updated Mar 27, 2024
  • GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as…

    C++ Other Updated Feb 20, 2024
  • iree Public

    Forked from iree-org/iree

    A retargetable MLIR-based machine learning compiler and runtime toolkit.

    C++ Apache License 2.0 Updated Jan 23, 2024
  • Trilinos Public

    Forked from trilinos/Trilinos

    Primary repository for the Trilinos Project

    C++ Other Updated Dec 28, 2023
  • An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).

    C++ Apache License 2.0 Updated Nov 9, 2023
  • lapack Public

    Forked from Reference-LAPACK/lapack

    LAPACK development repository

    Fortran Other Updated Oct 10, 2023
  • merge-spmv Public

    Forked from dumerrill/merge-spmv
    Cuda BSD 3-Clause "New" or "Revised" License Updated Jul 4, 2023
  • hipCUB Public

    Forked from ROCm/hipCUB

    Reusable software components for rocm developers

    C++ Other Updated Jun 27, 2023
  • cub Public

    Forked from NVIDIA/cub

    Cooperative primitives for CUDA C++.

    Cuda BSD 3-Clause "New" or "Revised" License Updated Jun 24, 2023
  • BladeDISC Public

    Forked from alibaba/BladeDISC

    BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.

    C++ Apache License 2.0 Updated Jun 21, 2023
  • CUDA Library Samples

    Cuda Other Updated May 17, 2023
  • cutlass Public

    Forked from NVIDIA/cutlass

    CUDA Templates for Linear Algebra Subroutines

    C++ Other Updated May 8, 2023
  • TensorRT Public

    Forked from NVIDIA/TensorRT

    NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for inference applicat…

    C++ Apache License 2.0 Updated May 5, 2023
  • SparseTIR Public

    Forked from uwsampl/SparseTIR

    SparseTIR: Sparse Tensor Compiler for Deep Learning

    Python Apache License 2.0 Updated Apr 21, 2023
  • zju-icicles Public

    Forked from QSCTech/zju-icicles

    浙江大学课程攻略共享计划

    HTML Updated Aug 24, 2022
  • taco Public

    Forked from rohany/taco

    The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs

    C++ Other Updated Aug 5, 2022
  • CUSP : A C++ Templated Sparse Matrix Library

    C++ Apache License 2.0 Updated Aug 1, 2022
  • tvm-rfcs Public

    Forked from yelite/tvm-rfcs

    A home for the final text of all TVM RFCs.

    Apache License 2.0 Updated Jul 27, 2022
  • 清华大学计算机系课程攻略 Guidance for courses in Department of Computer Science and Technology, Tsinghua University

    HTML Creative Commons Attribution Share Alike 4.0 International Updated Jan 22, 2022
  • PPT Public

    Forked from lanl/PPT

    Performance Prediction Toolkit

    Python Updated Nov 15, 2021
  • C Updated Jun 13, 2021
  • Python Apache License 2.0 Updated Jun 13, 2021
  • CMSIS_5 Public

    Forked from ARM-software/CMSIS_5

    CMSIS Version 5 Development Repository

    C Apache License 2.0 Updated Jun 12, 2021
  • ucc162.3 Public

    Forked from sheisc/ucc162.3

    A lightweight open-source C compiler for research and education.

    C Updated May 10, 2021
  • CUDAAdvisor Public

    Forked from sderek/CUDAAdvisor

    CUDAAdvisor: a GPU profiling tool

    Cuda MIT License Updated Aug 24, 2018
  • buddy Public

    Forked from cloudwu/buddy

    Buddy memory allocation

    C Updated Dec 22, 2011