crcrpar

Masaki Kozuki crcrpar

197 followers · 211 following

NVIDIA
Tokyo
15:40 (UTC +09:00)

Achievements

x3 x2 x3

Achievements

x3 x2 x3

Stars

12 results for source starred repositories written in Cuda

Clear filter

luanfujun / deep-painterly-harmonization

Code and data for paper "Deep Painterly Harmonization": https://arxiv.org/abs/1804.03189

Cuda 6,074 624 Updated Aug 2, 2021

baidu-research / warp-ctc

Fast parallel CTC.

Cuda 4,071 1,039 Updated Mar 4, 2024

hujie-frank / SENet

Squeeze-and-Excitation Networks

Cuda 3,459 845 Updated Feb 25, 2019

NVIDIA / CUDALibrarySamples

CUDA Library Samples

Cuda 1,794 365 Updated Feb 27, 2025

msracver / FCIS

Fully Convolutional Instance-aware Semantic Segmentation

Cuda 1,567 413 Updated Sep 27, 2021

openai / blocksparse

Efficient GPU kernels for block-sparse matrix multiplication and convolution

Cuda 1,037 200 Updated Jun 8, 2023

rapidsai / raft

RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing …

Cuda 850 203 Updated Mar 5, 2025

olcf / cuda-training-series

Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)

Cuda 710 257 Updated Aug 19, 2024

NVIDIA / nvbench

CUDA Kernel Benchmarking Library

Cuda 582 72 Updated Nov 20, 2024

usyd-fsalab / fp6_llm

An efficient GPU support for LLM inference with x-bit quantization (e.g. FP6,FP5).

Cuda 237 17 Updated Oct 28, 2024

debowin / cuda-tiled-matrix-multiplication

Optimized Parallel Tiled Approach to perform Matrix Multiplication by taking advantage of the lower latency, higher bandwidth shared memory within GPU thread blocks.

Cuda 16 1 Updated Sep 24, 2017

enp1s0 / mateval

Cuda 2 Updated Aug 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Masaki Kozuki crcrpar

Achievements

Achievements

Block or report crcrpar

Stars

luanfujun / deep-painterly-harmonization

baidu-research / warp-ctc

hujie-frank / SENet

NVIDIA / CUDALibrarySamples

msracver / FCIS

openai / blocksparse

rapidsai / raft

olcf / cuda-training-series

NVIDIA / nvbench

usyd-fsalab / fp6_llm

debowin / cuda-tiled-matrix-multiplication

enp1s0 / mateval