-
Notifications
You must be signed in to change notification settings - Fork 555
Add best practice for warpSize handling #3790
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: docs/develop
Are you sure you want to change the base?
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
looks good
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The discussion on warpSize at line 415 refers to "hardware features" but it seems like some mention of the fact that warpSize and Wavefront size are the same would be useful here?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Did some changes.
LGTM |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks great. Thanks.
- Retrieves the warp size of the GPU (warpSizeHost) to determine the optimal | ||
kernel configuration. | ||
|
||
- Allocates device memory (`d_data`` for input, `d_results`` for block-wise |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The backticks around d_data
and d_results
are not properly matched.
accordingly, as shown in the following block reduce example. | ||
|
||
The ``block_reduce`` kernel has a template parameter for warpSize and reduction | ||
operation in two main phases: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The second part of the sentence seems to lack a verb for me, maybe something like "...and performs a reduction operation in two main phases:"?
Co-authored-by: Fabian Ritter <ritter.x2a@gmail.com>
Co-authored-by: Fabian Ritter <ritter.x2a@gmail.com>
No description provided.