Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Suitable tile size for GeForce RTX 2080 #19

Open
Shikherneo2 opened this issue Aug 11, 2020 · 0 comments
Open

Suitable tile size for GeForce RTX 2080 #19

Shikherneo2 opened this issue Aug 11, 2020 · 0 comments

Comments

@Shikherneo2
Copy link

Great work putting this together!

I am trying to run PRNN on a GeForce RTX 2080(46 SM, 7.7 compute).
I've tried the following tile sizes,
TileConfig<24, 1152, 1152, 192, 288, 6, 36, direction, T>
TileConfig<32, 1024, 1024, 128, 256, 4, 32, direction, T>
TileConfig<32, 1024, 1024, 64, 512, 1, 32, direction, T>
TileConfig<40, 640, 640, 80, 128, 5, 4, direction, T>

Running benchmark using any of these with batchsize=4, timesteps=20, and layer sizes max for each tile configuration, the fastest I can get is 0.00478542 TFLOPS/s in the forward run.

Are the tile sizes inappropriate or is the issue something else.

Thank you.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant