Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

jeng1220 Follow

Overview Repositories 23 Projects 0 Packages 0 Stars 86

More

Overview
Repositories
Projects
Packages
Stars

jeng1220

Follow

Jeng Bai-Cheng jeng1220

Follow

major in heterogeneous computing such as CUDA, OpenCL, etc.

25 followers · 1 following

NVIDIA
Taiwan

Achievements

Achievements

Block or report jeng1220

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Add an optional note:

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Overview Repositories 23 Projects 0 Packages 0 Stars 86

More

Overview
Repositories
Projects
Packages
Stars

Type All

Select type

All Sources Forks Archived Can be sponsored Mirrors Templates

Language All

Select language

All Cuda C++ Python Fortran Jupyter Notebook Shell

Sort Last updated

Select order

Last updated Name Stars

cuda_examples Public

Simple CUDA Examples

Cuda 3 Apache License 2.0 Updated Jan 5, 2025
flash-attention Public
Forked from PaddlePaddle/flash-attention

Fast and memory-efficient exact attention

C++ BSD 3-Clause "New" or "Revised" License Updated Oct 1, 2024
Paddle Public
Forked from PaddlePaddle/Paddle

PArallel Distributed Deep LEarning （『飞桨』核心框架，高性能单机、分布式训练和跨平台部署）

C++ 1 Apache License 2.0 Updated Jun 28, 2024
TransformerEngine Public
Forked from NVIDIA/TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper GPUs, to provide better performance with lower memory utilization in bot…

Python Apache License 2.0 Updated Sep 8, 2023
CUDALibrarySamples Public
Forked from NVIDIA/CUDALibrarySamples

CUDA Library Samples

Cuda Other Updated May 8, 2023
openacc_fortran_examples Public

Simple OpenACC Fortran Examples

fortran cuda openacc

Fortran 54 11 Apache License 2.0 Updated Aug 1, 2021
gpubootcamp Public
Forked from openhackathons-org/gpubootcamp

This repository consists for gpu bootcamp material for HPC and AI

Jupyter Notebook Apache License 2.0 Updated Jun 23, 2021
tf_keras_example Public

TensorFlow and Keras Examples

Python 1 Updated Feb 25, 2021
cuGemmProf Public

A simple tool to profile performance of multiple combinations of GEMM of cuBLAS

C++ 24 7 MIT License Updated Feb 9, 2021
cuFFT_example Public

simple cuFFT examples

Cuda 1 Updated Feb 5, 2021
gpu_isac_mirror Public

gpu_isac mirror

Python Updated Oct 14, 2020
git_test Public

1 Updated May 16, 2020
amazon-dsstne Public
Forked from amazon-archives/amazon-dsstne

Deep Scalable Sparse Tensor Network Engine (DSSTNE) is an Amazon developed library for building Deep Learning (DL) machine learning (ML) models

C++ Apache License 2.0 Updated Mar 2, 2020
TensorRT Public
Forked from NVIDIA/TensorRT

TensorRT is a C++ library for high performance inference on NVIDIA GPUs and deep learning accelerators.

C++ Apache License 2.0 Updated Dec 25, 2019
FluidDoc Public
Forked from PaddlePaddle/docs

Documentations for PaddlePaddle

Shell Updated Dec 25, 2019
dlrm Public
Forked from facebookresearch/dlrm

An implementation of a deep learning recommendation model (DLRM)

Python MIT License Updated Sep 11, 2019
install_numba_and_pyculib_by_pip Public

Installation instructions for numba and pyculib by pip, tested on Ubuntu.

Updated Aug 12, 2019
cutlass Public
Forked from NVIDIA/cutlass

CUDA Templates for Linear Algebra Subroutines

C++ 1 1 BSD 3-Clause "New" or "Revised" License Updated Jul 24, 2019
cupy Public
Forked from cupy/cupy

NumPy-like API accelerated with CUDA

Python MIT License Updated Jan 28, 2019
stream_benchmark Public

CUDA stream benchmark

Python 1 Updated Jan 7, 2019
trt-se-resnext Public

a sample, running se-resnext on TensorRT

C++ 6 2 Updated Nov 15, 2018
KerasToTensorRT Public

This is a simple demonstration for running Keras model model on Tensorflow with TensorRT integration(TFTRT) or on TensorRT directly without invoking "freeze_graph.py".

Python 67 23 Updated Jul 17, 2018
Tensorflow_Inception_v3_TensorRT Public

This is a simple demonstration for running Tensorflow inception v3 model on TensorRT

C++ 12 7 Updated Jun 5, 2018

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.