forked from jlgreathouse/BabelStream
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Shut up a useless NVCC compiler warning about 'useless comparison against 0' when it sees a constant n in the reducer class template. CUDA dotprod didn't work with num workgroups == 256, so fix it up to base its number of workgroups on the input size. To get performance roughly equzl to what it was before, increase the number of values handled by each thread.
- Loading branch information
1 parent
d2a2e2e
commit f2f65c4
Showing
2 changed files
with
22 additions
and
21 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters