Skip to content
View dongdong1203's full-sized avatar

Block or report dongdong1203

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

s1: Simple test-time scaling

Python 5,837 664 Updated Mar 4, 2025

NeuPIMs Simulator

Jupyter Notebook 71 21 Updated Jun 19, 2024

Development repository for the Triton language and compiler

MLIR 14,721 1,836 Updated Mar 5, 2025

This is simple code of SpikedAttention (Neurips 2024)

Python 11 Updated Oct 18, 2024

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,614 487 Updated Feb 28, 2025

Open-source Windows and Office activator featuring HWID, Ohook, TSforge, KMS38, and Online KMS activation methods, along with advanced troubleshooting.

Batchfile 123,603 12,038 Updated Feb 23, 2025

An FPGA accelerator for general-purpose Sparse-Matrix Dense-Matrix Multiplication (SpMM).

C++ 77 13 Updated Jul 26, 2024

A open source reimplementation of Google's Tensor Processing Unit (TPU).

Python 410 71 Updated Dec 6, 2017

Edge-MoE: Memory-Efficient Multi-Task Vision Transformer Architecture with Task-level Sparsity via Mixture-of-Experts

C++ 104 14 Updated May 10, 2024

ABC: System for Sequential Logic Synthesis and Formal Verification

C 950 608 Updated Mar 4, 2025

A curated list of awesome hardware/chip design resources for deep learning

34 5 Updated May 1, 2018

A curated list of Computer Architecture and Systems resources

494 53 Updated Dec 24, 2024
Python 223 31 Updated Nov 9, 2022

Implementations of Buffets, which are efficient, composable idioms for implementing Explicit Decoupled Data Orchestration.

Verilog 68 17 Updated Apr 30, 2019

A general framework for optimizing DNN dataflow on systolic array

Python 33 10 Updated Jan 2, 2021

A high-performance, zero-overhead, extensible Python compiler with built-in NumPy support

Python 15,436 526 Updated Mar 3, 2025

A co-design architecture on sparse attention

Python 51 4 Updated Aug 23, 2021

ONNXim is a fast cycle-level simulator that can model multi-core NPUs for DNN inference

C++ 93 19 Updated Feb 10, 2025

DRAMsim3: a Cycle-accurate, Thermal-Capable DRAM Simulator

C++ 348 149 Updated Aug 3, 2024

SystemC training aimed at TLM.

C++ 27 9 Updated Jul 31, 2020

Deep Learning Accelerator Based on Eyeriss V2 Architecture with custom RISC-V extended instructions

Scala 181 31 Updated Jun 25, 2020

SpikingJelly is an open-source deep learning framework for Spiking Neural Network (SNN) based on PyTorch.

Python 1,489 258 Updated Jan 19, 2025

A massively parallel, high-level programming language

Rust 18,447 457 Updated Feb 23, 2025

A comprehensive collection of KAN(Kolmogorov-Arnold Network)-related resources, including libraries, projects, tutorials, papers, and more, for researchers and developers in the Kolmogorov-Arnold N…

2,801 260 Updated Feb 24, 2025

A Collection Of The State-of-the-art Metaheuristic Algorithms In Python (Metaheuristic/Optimizer/Nature-inspired/Biology)

Python 968 198 Updated Sep 3, 2024

A flexible cross-platform IIR and FIR engine for crossovers, room correction etc.

Rust 624 53 Updated Mar 3, 2025

Ramulator 2.0 is a modern, modular, extensible, and fast cycle-accurate DRAM simulator. It provides support for agile implementation and evaluation of new memory system designs (e.g., new DRAM stan…

C++ 296 67 Updated Dec 11, 2024

[HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning

Scala 82 8 Updated Aug 27, 2024

Deep Learning library for Lava

Jupyter Notebook 154 72 Updated Feb 11, 2025

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

C++ 5,300 631 Updated Mar 5, 2025
Next