You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Happy new year!
I wanna consult the question as the title. I only found that the shared memory ldsm read is done by MmaTensorOpMultiplicandTileIterator. To my knowledge, the shared memory write by swizzled layout w/o bank conflict should occur in RegularTileAccessIterator, which but I think did not implement it. So Could you pls guide me the code location where shared memory write by swizzled layout occurs in cutlass 2.x?
Thanks.
The text was updated successfully, but these errors were encountered:
What is your question?
Dear cutlass team,
Happy new year!
I wanna consult the question as the title. I only found that the shared memory ldsm read is done by MmaTensorOpMultiplicandTileIterator. To my knowledge, the shared memory write by swizzled layout w/o bank conflict should occur in RegularTileAccessIterator, which but I think did not implement it. So Could you pls guide me the code location where shared memory write by swizzled layout occurs in cutlass 2.x?
Thanks.
The text was updated successfully, but these errors were encountered: