Lists (4)
Sort Name ascending (A-Z)
Starred repositories
Development repository for the Triton language and compiler
This is simple code of SpikedAttention (Neurips 2024)
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
Open-source Windows and Office activator featuring HWID, Ohook, TSforge, KMS38, and Online KMS activation methods, along with advanced troubleshooting.
An FPGA accelerator for general-purpose Sparse-Matrix Dense-Matrix Multiplication (SpMM).
A open source reimplementation of Google's Tensor Processing Unit (TPU).
Edge-MoE: Memory-Efficient Multi-Task Vision Transformer Architecture with Task-level Sparsity via Mixture-of-Experts
ABC: System for Sequential Logic Synthesis and Formal Verification
A curated list of awesome hardware/chip design resources for deep learning
A curated list of Computer Architecture and Systems resources
Implementations of Buffets, which are efficient, composable idioms for implementing Explicit Decoupled Data Orchestration.
A general framework for optimizing DNN dataflow on systolic array
A high-performance, zero-overhead, extensible Python compiler with built-in NumPy support
pku-liang / Sanger
Forked from hatsu3/SangerA co-design architecture on sparse attention
ONNXim is a fast cycle-level simulator that can model multi-core NPUs for DNN inference
DRAMsim3: a Cycle-accurate, Thermal-Capable DRAM Simulator
Deep Learning Accelerator Based on Eyeriss V2 Architecture with custom RISC-V extended instructions
SpikingJelly is an open-source deep learning framework for Spiking Neural Network (SNN) based on PyTorch.
A massively parallel, high-level programming language
A comprehensive collection of KAN(Kolmogorov-Arnold Network)-related resources, including libraries, projects, tutorials, papers, and more, for researchers and developers in the Kolmogorov-Arnold N…
A Collection Of The State-of-the-art Metaheuristic Algorithms In Python (Metaheuristic/Optimizer/Nature-inspired/Biology)
A flexible cross-platform IIR and FIR engine for crossovers, room correction etc.
Ramulator 2.0 is a modern, modular, extensible, and fast cycle-accurate DRAM simulator. It provides support for agile implementation and evaluation of new memory system designs (e.g., new DRAM stan…
[HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.