CMake 教程 Modern-CMake 的简体中文翻译，中文版 Gitbook ：https://modern-cmake-cn.github.io/Modern-CMake-zh_CN/ Chinese(simplified) translation of famous cmake tutorial Modern CMake. GitHub Pages : https://modern…

CMake 758 93 Updated Aug 6, 2024

yzhaiustc / Optimizing-DGEMM-on-Intel-CPUs-with-AVX512F

Stepwise optimizations of DGEMM on CPU, reaching performance faster than Intel MKL eventually, even under multithreading.

C 121 25 Updated Feb 3, 2022

niudai / VSCode-Zhihu

Zhihu extension built on vscode.

TypeScript 882 79 Updated Jun 27, 2023

xmake-io / xmake

🔥 A cross-platform build utility based on Lua

Lua 10,535 815 Updated Jan 18, 2025

li199603 / parallel_prefix_sum

Parallel Prefix Sum (Scan) with CUDA

Cuda 18 3 Updated Jun 22, 2024

myhhub / cmake-project

CMake完整使用教程。CMake教程包括一系列循序渐进的任务，介绍CMake信息，展示如何实现目标。

CMake 270 36 Updated Mar 29, 2021

triton-lang / triton

Development repository for the Triton language and compiler

C++ 14,060 1,715 Updated Jan 18, 2025

harleyszhang / llm_note

LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.

Python 393 34 Updated Jan 14, 2025

stanford-futuredata / stk

Python 96 20 Updated Aug 26, 2024

yuanzhoulvpi2017 / zero_nlp

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 3,151 382 Updated Jan 2, 2025

drewjin / GsiT

Multimodal Transformers are Hierarchical Modal-wise Heterogeneous Graphs

Jupyter Notebook 1 Updated Jan 6, 2025

gpu-mode / lectures

Material for gpu-mode lectures

Jupyter Notebook 3,504 353 Updated Jan 6, 2025

windrider / cs149_asst4

c++ 实现 stanford cs149 assiment4

C++ 1 Updated Mar 4, 2023

windrider / cs149_asst2

c++

C++ 1 Updated Mar 22, 2023

windrider / cs149_asst1

c++ 实现stanford cs149 assignment1

C++ 12 Updated Feb 19, 2023

PKUFlyingPig / CS149-parallel-computing

Learning materials for Stanford CS149 : Parallel Computing

C 195 28 Updated Jul 31, 2021

ispc / ispc

Intel® Implicit SPMD Program Compiler

C++ 2,572 317 Updated Jan 16, 2025

BoL0150 / cs149

C++ 3 Updated Nov 21, 2024

stanford-cs149 / asst1

Stanford CS149 -- Assignment 1

C++ 77 82 Updated Oct 2, 2024

1zhou-Wang / MemVR

Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models'.

Python 38 2 Updated Nov 27, 2024

BlinkDL / RWKV-LM

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 13,012 882 Updated Jan 10, 2025

microsoft / MInference

[NeurIPS'24 Spotlight] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 whil…

Python 878 41 Updated Dec 28, 2024

pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 85,952 23,138 Updated Jan 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Drew drewjin

Block or report drewjin

Stars

facebookresearch / xformers

google-deepmind / alphafold3

MLSys-Learner-Resources / Awesome-MLSys-Blogger

SiriusNEO / Triton-Puzzles-Lite

xdit-project / xDiT

XiaoSong9905 / CUDA-Optimization-Guide

databricks / megablocks

Modern-CMake-CN / Modern-CMake-zh_CN