Tags: causten/AMDMIGraphX
Tags
MLIR MIOpen Dialect integration (phase 1) (ROCm#768) (ROCm#769) * MLIR MIOpen Dialect integration (phase 1) (ROCm#768) * Added Findmlir.cmake (using environment variables to import) * Added mlir_conv pass to GPU target * Apply to any gpu::convolution if supported by MLIR * Call MLIR C-API to generate iGEMM kernel with configuration from gpu::convolution * Capture binary in dictionary for matching convolutions * Build a code_object_op with the binary and execution dimensions * Substitute for the gpu::convolution * Changed the parameters for the code_object to reflect the generated MLIR kernel * Expanded out MemRefDescriptor fields in param list * Also updated for MLIR C-API changes * * fixed global_size calculation * MLIR MIOpen Dialect integration (phase 1) (ROCm#768) * Added Findmlir.cmake (using environment variables to import) * Added mlir_conv pass to GPU target * Apply to any gpu::convolution if supported by MLIR * Call MLIR C-API to generate iGEMM kernel with configuration from gpu::convolution * Capture binary in dictionary for matching convolutions * Build a code_object_op with the binary and execution dimensions * Substitute for the gpu::convolution * Changed the parameters for the code_object to reflect the generated MLIR kernel * Expanded out MemRefDescriptor fields in param list * Also updated for MLIR C-API changes * * Added command line option: --enable_mlir * * fixed command line switch * updated for new MLIR API changes * * Added cget llvm-project-mlir to import MIIR API libraries into Dockerfile * removed cmake Findmlir * updated for changes in MIIR C-API * * updated CMakeLists.txt to allow disable of MLIR import * fixed memory leaks and removed copies * updated for 5D memrefs * * formatting * * fixed review comments * * fixed merge issues * hip gcnDeviceName now includes specifiers at the end * use major/minor values instead * * disable MLIR by default * * removed command-line switch --enable-mlir * * fix unused when MLIR disabled * * enable jenkins enable/test MLIR * * format * * fixed clang-tidy * * added new type Co-authored-by: Paul Fultz II <[email protected]> Co-authored-by: mvermeulen <[email protected]>
Bug rocblas int8 (ROCm#767) * fix the flag in rocblas api for int8 data type * used different flag for different rocblas versions * clang format Co-authored-by: mvermeulen <[email protected]>
Selu operator (ROCm#642) * code backup * clang format * support for sele operator * clang format * added an onnx unit test for selu * clang format * add more unit tests for the selu operation Co-authored-by: mvermeulen <[email protected]>
Selu operator (ROCm#642) * code backup * clang format * support for sele operator * clang format * added an onnx unit test for selu * clang format * add more unit tests for the selu operation Co-authored-by: mvermeulen <[email protected]>
Selu operator (ROCm#642) * code backup * clang format * support for sele operator * clang format * added an onnx unit test for selu * clang format * add more unit tests for the selu operation Co-authored-by: mvermeulen <[email protected]>
Add load/save function for program (ROCm#623) * Add save/load functions * Formatting * Add loading and saving to the driver * Formatting * Add return * Serialize the context with the program * Formatting * Add python API * Formatting * Add c/c++ apis * Formatting * Add tests * Formatting * Fix tidy error * Fix python doc * Restore python code * Add function name to errors * Formatting * Use lvalue for writing * Serialize context * Fix convolution and pooling operator for miopen * Formatting * Add const ref * Set target name to gpu * Add target tests * Formatting * Move register target to cpp file * Fix target test * Use make_target in driver * Formatting * Use make_target for the API * Formatting * Add cpu include * Increase timeout * Add more tests * Formatting Co-authored-by: Shucai Xiao <[email protected]> Co-authored-by: mvermeulen <[email protected]>
Optimize multiply across slices (ROCm#568) * Add initial optimization when using a mul over a sliced convolution * Formatting * Add more tests * Formatting * Convert to an assert * Check if used once * Formatting * Add test with horiz fusion * Formatting * Optimize nested slice * Formatting * Fix test * Add const refs * Remove unnecessary assert Co-authored-by: mvermeulen <[email protected]>
PreviousNext