Skip to content
View shengwenliang's full-sized avatar

Highlights

  • Pro

Block or report shengwenliang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Python 31,643 2,108 Updated Feb 22, 2025

Democratizing Reinforcement Learning for LLMs

Python 1,724 145 Updated Feb 16, 2025

High Throughput Batched Tridiagonal solver library for Xilinx Accelerator cards

C++ 5 1 Updated Mar 19, 2023

Network on Chip Simulator

C++ 258 130 Updated Jan 22, 2024

Reproduce R1 Zero on Logic Puzzle

Python 1,668 101 Updated Feb 21, 2025

Machine-Learning Accelerator System Exploration Tools

Python 146 75 Updated Feb 23, 2025

A 120-day CUDA learning plan covering daily concepts, exercises, pitfalls, and references (including “Programming Massively Parallel Processors”). Features six capstone projects to solidify GPU par…

Shell 538 53 Updated Feb 16, 2025
Verilog 1,412 293 Updated Feb 23, 2025
Jupyter Notebook 2,901 619 Updated Feb 19, 2025

Machine Learning Engineering Open Book

Python 12,867 785 Updated Feb 23, 2025

EQueue Dialect

MLIR 40 8 Updated Feb 3, 2022

A unified simulation platform that combines hardware and software, enabling pre-silicon, full-stack, closed-loop evaluation of your robotic system.

Python 39 4 Updated Feb 2, 2025

A Full-System Simulator for CXL-Based SSD Memory System

C++ 15 3 Updated Dec 24, 2024

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 58,725 5,974 Updated Aug 24, 2024

Learnings and programs related to CUDA

Cuda 285 10 Updated Feb 20, 2025

Modeling Architectural Platform

C++ 177 59 Updated Feb 21, 2025

gem5-nvmain hybrid simulator supporting simulation of DRAM-NVM hybrid memory system

C++ 75 49 Updated Jul 23, 2019

PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker/Zotero

Python 17,589 1,412 Updated Feb 22, 2025

Demo of PyTorch to Verilog with Torch-MLIR, MLIR, and CIRCT for Hot Chips 2022.

MLIR 5 1 Updated Aug 17, 2022

Visualization of cache-optimized matrix multiplication

Python 104 8 Updated Jun 13, 2019
C++ 2 1 Updated Apr 24, 2017

This is an online course where you can learn and master the skill of low-level performance analysis and tuning.

C++ 2,821 252 Updated Feb 23, 2025

Code for Sense (NDSS'24)

C++ 9 1 Updated Mar 3, 2024

mNPUsim: A Cycle-accurate Multi-core NPU Simulator (IISWC 2023)

C++ 45 7 Updated Dec 10, 2024

[🔥updating ...] AI 自动量化交易机器人(完全本地部署) AI-powered Quantitative Investment Research Platform. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Charmve/iQuant

Jupyter Notebook 9,987 1,430 Updated Nov 9, 2024

EE 628: Analysis and Design of Integrated Circuits (University of Hawaiʻi at Mānoa)

Jupyter Notebook 147 28 Updated Nov 27, 2024

design and verification of asynchronous circuits

Python 17 Updated Feb 23, 2025

A Cycle-level simulator for M2NDP

C++ 23 3 Updated Nov 28, 2024

A course in reinforcement learning in the wild

Jupyter Notebook 6,031 1,707 Updated Dec 23, 2024
Next
Showing results