Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Kernels to add #129

Open
2 of 7 tasks
MrBurmark opened this issue Jun 7, 2021 · 2 comments
Open
2 of 7 tasks

Kernels to add #129

MrBurmark opened this issue Jun 7, 2021 · 2 comments
Assignees

Comments

@MrBurmark
Copy link
Member

MrBurmark commented Jun 7, 2021

List of kernels to add to the perf suite:

int scan = 0;
for (int i = ibegin; i < iend; ++i) {
  if (x[i] > 0.5) {
    list[scan++] = i;
  }
}
int len = scan;
@rhornung67
Copy link
Member

I believe we should be able to do baseline variants of scan kernels for OpenMP and OpenMP target (as of OpenMP 5.0. Is there a reasonable way to do baseline variants of scan kernels for GPU back-ends?

@MrBurmark
Copy link
Member Author

Good call @rhornung67, scan was added in openmp 5.0 so we can conditionally add a baseline for openmp. We can use cub and rocprim for the base variants of cuda and hip scan kernels.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants