Skip to content
View crcrpar's full-sized avatar
  • NVIDIA
  • Tokyo
  • 15:40 (UTC +09:00)

Block or report crcrpar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
12 results for source starred repositories written in Cuda
Clear filter

Code and data for paper "Deep Painterly Harmonization": https://arxiv.org/abs/1804.03189

Cuda 6,074 624 Updated Aug 2, 2021

Fast parallel CTC.

Cuda 4,071 1,039 Updated Mar 4, 2024

Squeeze-and-Excitation Networks

Cuda 3,459 845 Updated Feb 25, 2019

CUDA Library Samples

Cuda 1,794 365 Updated Feb 27, 2025

Fully Convolutional Instance-aware Semantic Segmentation

Cuda 1,567 413 Updated Sep 27, 2021

Efficient GPU kernels for block-sparse matrix multiplication and convolution

Cuda 1,037 200 Updated Jun 8, 2023

RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing …

Cuda 850 203 Updated Mar 5, 2025

Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)

Cuda 710 257 Updated Aug 19, 2024

CUDA Kernel Benchmarking Library

Cuda 582 72 Updated Nov 20, 2024

An efficient GPU support for LLM inference with x-bit quantization (e.g. FP6,FP5).

Cuda 237 17 Updated Oct 28, 2024

Optimized Parallel Tiled Approach to perform Matrix Multiplication by taking advantage of the lower latency, higher bandwidth shared memory within GPU thread blocks.

Cuda 16 1 Updated Sep 24, 2017
Cuda 2 Updated Aug 2, 2024