Skip to content

Tags: causten/AMDMIGraphX

Tags

rocm-4.3.0

Toggle rocm-4.3.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
MLIR MIOpen Dialect integration (phase 1) (ROCm#768) (ROCm#769)

* MLIR MIOpen Dialect integration (phase 1) (ROCm#768)

* Added Findmlir.cmake (using environment variables to import)

* Added mlir_conv pass to GPU target

  * Apply to any gpu::convolution if supported by MLIR

  * Call MLIR C-API to generate iGEMM kernel with configuration from gpu::convolution

  * Capture binary in dictionary for matching convolutions

  * Build a code_object_op with the binary and execution dimensions

  * Substitute for the gpu::convolution

* Changed the parameters for the code_object to reflect the generated MLIR kernel

* Expanded out MemRefDescriptor fields in param list

* Also updated for MLIR C-API changes

* * fixed global_size calculation

* MLIR MIOpen Dialect integration (phase 1) (ROCm#768)

* Added Findmlir.cmake (using environment variables to import)

* Added mlir_conv pass to GPU target

  * Apply to any gpu::convolution if supported by MLIR

  * Call MLIR C-API to generate iGEMM kernel with configuration from gpu::convolution

  * Capture binary in dictionary for matching convolutions

  * Build a code_object_op with the binary and execution dimensions

  * Substitute for the gpu::convolution

* Changed the parameters for the code_object to reflect the generated MLIR kernel

* Expanded out MemRefDescriptor fields in param list

* Also updated for MLIR C-API changes

* * Added command line option: --enable_mlir

* * fixed command line switch

* updated for new MLIR API changes

* * Added cget llvm-project-mlir to import MIIR API libraries into Dockerfile
  * removed cmake Findmlir

* updated for changes in MIIR C-API

* * updated CMakeLists.txt to allow disable of MLIR import

* fixed memory leaks and removed copies

* updated for 5D memrefs

* * formatting

* * fixed review comments

* * fixed merge issues

* hip gcnDeviceName now includes specifiers at the end
  * use major/minor values instead

* * disable MLIR by default

* * removed command-line switch --enable-mlir

* * fix unused when MLIR disabled

* * enable jenkins enable/test MLIR

* * format

* * fixed clang-tidy

* * added new type

Co-authored-by: Paul Fultz II <[email protected]>
Co-authored-by: mvermeulen <[email protected]>

rocm-4.1.0

Toggle rocm-4.1.0's commit message
ROCm 4.1 release

rocm-4.2.0

Toggle rocm-4.2.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Bug rocblas int8 (ROCm#767)

* fix the flag in rocblas api for int8 data type

* used different flag for different rocblas versions

* clang format

Co-authored-by: mvermeulen <[email protected]>

rocm-4.0.0

Toggle rocm-4.0.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Selu operator (ROCm#642)

* code backup

* clang format

* support for sele operator

* clang format

* added an onnx unit test for selu

* clang format

* add more unit tests for the selu operation

Co-authored-by: mvermeulen <[email protected]>

rocm-3.10.0

Toggle rocm-3.10.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Selu operator (ROCm#642)

* code backup

* clang format

* support for sele operator

* clang format

* added an onnx unit test for selu

* clang format

* add more unit tests for the selu operation

Co-authored-by: mvermeulen <[email protected]>

rocm-3.9.0

Toggle rocm-3.9.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Selu operator (ROCm#642)

* code backup

* clang format

* support for sele operator

* clang format

* added an onnx unit test for selu

* clang format

* add more unit tests for the selu operation

Co-authored-by: mvermeulen <[email protected]>

0.8

Toggle 0.8's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Add load/save function for program (ROCm#623)

* Add save/load functions

* Formatting

* Add loading and saving to the driver

* Formatting

* Add return

* Serialize the context with the program

* Formatting

* Add python API

* Formatting

* Add c/c++ apis

* Formatting

* Add tests

* Formatting

* Fix tidy error

* Fix python doc

* Restore python code

* Add function name to errors

* Formatting

* Use lvalue for writing

* Serialize context

* Fix convolution and pooling operator for miopen

* Formatting

* Add const ref

* Set target name to gpu

* Add target tests

* Formatting

* Move register target to cpp file

* Fix target test

* Use make_target in driver

* Formatting

* Use make_target for the API

* Formatting

* Add cpu include

* Increase timeout

* Add more tests

* Formatting

Co-authored-by: Shucai Xiao <[email protected]>
Co-authored-by: mvermeulen <[email protected]>

0.7

Toggle 0.7's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Optimize multiply across slices (ROCm#568)

* Add initial optimization when using a mul over a sliced convolution

* Formatting

* Add more tests

* Formatting

* Convert to an assert

* Check if used once

* Formatting

* Add test with horiz fusion

* Formatting

* Optimize nested slice

* Formatting

* Fix test

* Add const refs

* Remove unnecessary assert

Co-authored-by: mvermeulen <[email protected]>

rocm-3.5.0

Toggle rocm-3.5.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Fix the usage of move_dpp in HIP-Clang. (ROCm#544)

0.6

Toggle 0.6's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Add same padding mode for onnx (ROCm#456)

* fix pad calc

* add padding calc and test

* formatting

* made asym generic function

* formatting

Co-authored-by: mvermeulen <[email protected]>