Highlights
- Pro
- All languages
- ApacheConf
- Assembly
- Bluespec
- C
- C++
- CMake
- CSS
- Coq
- Cuda
- Dart
- Dockerfile
- Emacs Lisp
- FIRRTL
- Go
- HCL
- HTML
- Haskell
- Java
- JavaScript
- Jupyter Notebook
- LLVM
- Lua
- MATLAB
- MLIR
- Makefile
- Markdown
- Mathematica
- PLpgSQL
- Python
- Raku
- Rich Text Format
- Roff
- Rust
- SCSS
- SVG
- Scala
- Shell
- SystemVerilog
- TL-Verilog
- Tcl
- TeX
- TypeScript
- VHDL
- Verilog
- Vim Script
Starred repositories
Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥
Democratizing Reinforcement Learning for LLMs
High Throughput Batched Tridiagonal solver library for Xilinx Accelerator cards
Machine-Learning Accelerator System Exploration Tools
A 120-day CUDA learning plan covering daily concepts, exercises, pitfalls, and references (including “Programming Massively Parallel Processors”). Features six capstone projects to solidify GPU par…
Machine Learning Engineering Open Book
A unified simulation platform that combines hardware and software, enabling pre-silicon, full-stack, closed-loop evaluation of your robotic system.
A Full-System Simulator for CXL-Based SSD Memory System
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
gem5-nvmain hybrid simulator supporting simulation of DRAM-NVM hybrid memory system
PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker/Zotero
Demo of PyTorch to Verilog with Torch-MLIR, MLIR, and CIRCT for Hot Chips 2022.
Visualization of cache-optimized matrix multiplication
This is an online course where you can learn and master the skill of low-level performance analysis and tuning.
mNPUsim: A Cycle-accurate Multi-core NPU Simulator (IISWC 2023)
[🔥updating ...] AI 自动量化交易机器人(完全本地部署) AI-powered Quantitative Investment Research Platform. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Charmve/iQuant
EE 628: Analysis and Design of Integrated Circuits (University of Hawaiʻi at Mānoa)
design and verification of asynchronous circuits
A course in reinforcement learning in the wild