Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[QST] The code location where the shared memory write by swizzled layout occurs in cutlass 2.x? #2019

Open
danielhua23 opened this issue Jan 1, 2025 · 3 comments

Comments

@danielhua23
Copy link

danielhua23 commented Jan 1, 2025

What is your question?

Dear cutlass team,

Happy new year!
I wanna consult the question as the title. I only found that the shared memory ldsm read is done by MmaTensorOpMultiplicandTileIterator. To my knowledge, the shared memory write by swizzled layout w/o bank conflict should occur in RegularTileAccessIterator, which but I think did not implement it. So Could you pls guide me the code location where shared memory write by swizzled layout occurs in cutlass 2.x?

Thanks.

@danielhua23
Copy link
Author

cc @hwu36 @jackkosaian

@hwu36
Copy link
Collaborator

hwu36 commented Jan 1, 2025

You are right that shared memory write is implemented in RegularTileAccessIterator

@danielhua23
Copy link
Author

Then Could you pls guide me which lines implemented the swizzling write?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants