Skip to content
View Elio-yang's full-sized avatar
🏠
Working from home
🏠
Working from home

Highlights

  • Pro

Block or report Elio-yang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Batch convert ppt files to pdf files by Automator on MacOS

AppleScript 109 8 Updated Aug 30, 2024

A low-latency & high-throughput serving engine for LLMs

Python 270 33 Updated Sep 12, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 31,762 4,830 Updated Dec 12, 2024

A Cycle-level simulator for M2NDP

C++ 17 1 Updated Nov 28, 2024

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 2,586 212 Updated Oct 16, 2024

Practice on cifar100(ResNet, DenseNet, VGG, GoogleNet, InceptionV3, InceptionV4, Inception-ResNetv2, Xception, Resnet In Resnet, ResNext,ShuffleNet, ShuffleNetv2, MobileNet, MobileNetv2, SqueezeNet…

Python 4,332 1,179 Updated Jul 15, 2024

Pepc - Power, Energy, and Performance Configurator

Python 29 8 Updated Nov 29, 2024

ngAP's artifact for ASPLOS'24

C++ 19 Updated Dec 10, 2024

ASCII generator (image to text, image to image, video to video)

Python 7,421 570 Updated Nov 22, 2024

CUDA Python Low-level Bindings

Python 995 81 Updated Dec 12, 2024

gem5-nvmain hybrid simulator supporting simulation of DRAM-NVM hybrid memory system

C++ 74 49 Updated Jul 23, 2019

SHMA: Software-managed Caching for Hybrid DRAM/NVM Memory Architectures, implemented with zsim and nvmain hybrid simulators

C++ 60 34 Updated Aug 26, 2017

Transforming Graphs for Efficient Irregular Graph Processing on GPUs

Cuda 47 16 Updated Nov 15, 2022

This is an read-only mirror of the gem5 simulator. The upstream repository is stored in https://gem5.googlesource.com, code reviews should be submitted to https://gem5-review.googlesource.com/. The…

C++ 29 9 Updated Jul 20, 2024

Open-source Framework for HPCA2024 paper: Gemini: Mapping and Architecture Co-exploration for Large-scale DNN Chiplet Accelerators

C++ 59 10 Updated Aug 31, 2024
Python 40 7 Updated Sep 26, 2024
Go 2 Updated Aug 29, 2023

A Python package for extending the official PyTorch that can easily obtain performance on Intel platform

Python 1,645 254 Updated Dec 12, 2024

Evaluation code for confidential virtual machines (AMD SEV-SNP / Intel TDX)

Python 4 1 Updated Dec 12, 2024
C++ 4 1 Updated Oct 6, 2024

GPGPU-Sim enabled Turing WMMA API and its benchmark results. Undergraduate study at Yonsei Univ.

C++ 9 6 Updated Feb 21, 2021

llama3.cuda is a pure C/CUDA implementation for Llama 3 model.

Cuda 317 21 Updated Jun 4, 2024

LLM inference in C/C++

C++ 69,142 9,930 Updated Dec 12, 2024
C++ 4 1 Updated Nov 10, 2024

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 10,923 2,141 Updated Dec 5, 2024

Source code for the paper "Encrypted Image Classification with Low Memory Footprint using Fully Homomorphic Encryption"

Jupyter Notebook 43 13 Updated Jul 20, 2024

A reading list for homomorphic encryption

106 7 Updated Aug 1, 2024

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

C++ 5,195 622 Updated Dec 12, 2024

Materials about Privacy-Preserving Machine Learning

233 51 Updated Jun 26, 2024
Next