Skip to content

Tags: codecat-he/cutlass

Tags

v3.2.2

Toggle v3.2.2's commit message
Doc updates for 3.2.2

v3.2.1

Toggle v3.2.1's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Fix Parallel Split-K on Gemm Operation Profiler (NVIDIA#1109)

* Debug and fix for parallel split-k in profiler

* restore debug files and remove prints

v3.2.0

Toggle v3.2.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Add simple hash and eq methods for gemm_operations. (NVIDIA#1053)

v3.1.0

Toggle v3.1.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Update README.md

v3.0.0

Toggle v3.0.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Updates for 3.0 (NVIDIA#857)

Co-authored-by: Aniket Shivam <[email protected]>

v2.11.0

Toggle v2.11.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
New updates for 2.11 (NVIDIA#775)

* New updates.

* Minor profiler updates

Co-authored-by: Aniket Shivam <[email protected]>

v2.10.0

Toggle v2.10.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
CUTLASS 2.10 bug fixes and minor updates. (NVIDIA#626)

v2.9.1

Toggle v2.9.1's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Update linear_combination_generic.h (NVIDIA#472)

add `skip_elementwise_` to support serial splitk in linear_combination_generic.h`

v2.9.0

Toggle v2.9.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Update CMakeLists.txt (NVIDIA#473)

* Update CMakeLists.txt

Add 128bit int support if using nvc++ to solve NVIDIA#310 

@jeffhammond, would you please give it a try?

* Update CMakeLists.txt

correct copy paste error

v2.8.0

Toggle v2.8.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Updated GEMM performance plot with CUTLASS 2.8 compiled with CUDA 11.…

…5 Toolkit (NVIDIA#375)

Updated GEMM performance plot with CUTLASS 2.8 compiled using CUDA 11.5 Toolkit.

GPUs under test:

    NVIDIA A100
    NVIDIA A2
    NVIDIA TitanV
    NVIDIA GeForce 2080 Ti