Skip to content
View lsq314's full-sized avatar

Block or report lsq314

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AMD OpenNIC Project Overview

Shell 247 41 Updated Dec 20, 2022

FpgaNIC is an FPGA-based Versatile 100Gb SmartNIC for GPUs [ATC 22]

TeX 126 18 Updated Aug 17, 2023

Implementation of a Tensor Processing Unit for embedded systems and the IoT.

VHDL 437 63 Updated Jan 5, 2019
C++ 14 3 Updated Sep 17, 2024

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

Python 2,116 150 Updated Aug 1, 2024

A pytorch implementation of dorefa quantization

Python 113 11 Updated Dec 30, 2019

A reading list for deep graph learning acceleration.

240 20 Updated Feb 7, 2025

PyTorch implementation of DeltaLSTM and Column-Balanced Targeted Dropout

Python 4 1 Updated Jun 16, 2022

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

JavaScript 1 Updated Mar 7, 2024
C++ 2 Updated Apr 9, 2023

An unnecessarily tiny implementation of GPT-2 in NumPy.

Python 3,324 431 Updated Apr 24, 2023

The code for our ICCAD work "Optimized Data Reuse via Reordering for Sparse Matrix-Vector Multiplication on FPGAs"

C++ 6 Updated May 7, 2022

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Python 14,441 2,100 Updated Jul 24, 2024
C++ 36 6 Updated Mar 6, 2019

Vitis HLS LLVM source code and examples

383 58 Updated Oct 11, 2024

A high-level performance analysis tool for FPGA-based accelerators

C++ 19 7 Updated Jun 2, 2017

Useful CMake Examples

CMake 12,680 2,520 Updated Feb 28, 2024
Lua 671 284 Updated Aug 21, 2018

Matplotlib styles for scientific plotting

Python 7,591 731 Updated Feb 21, 2025

Java inefficiency detection tool based on CPU performance monitoring counters and hardware debug register. The tool detects dead writes, silent stores, and redundant loads.

C++ 45 5 Updated Sep 1, 2021

This is a mips simulator I wrote once to help my understanding of pipelines, branch prediction, assembly language, and more.

C++ 65 14 Updated Dec 3, 2019