Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add MPI awareness to Omniperf #170

Open
coleramos425 opened this issue Aug 25, 2023 · 0 comments
Open

Add MPI awareness to Omniperf #170

coleramos425 opened this issue Aug 25, 2023 · 0 comments
Labels
Omniperf Revamp Ticket related to the redesign of Omniperf
Milestone

Comments

@coleramos425
Copy link
Collaborator

Describe the suggestion
Add MPI awareness to Omniperf

Justification
Adding MPI awareness is something we've been meaning to address and is highly requested by users. If Omniperf is MPI aware we can also begin to implement some clever ways to reduce the computational load by distributed counter collection (multi gpu scenario).

We've been holding off on this because we wanted to do it right. This seems like an appropriate opportunity to tackle implementation.

Implementation

  1. Brute force approach would be to run our (~14x app replays) on each node. The profiling side of this method is straightforward, but post-processing could introduce some issues.
  2. Alternatively, we could split runs up across nodes, assuming the same kernels are being launched.

Additional Notes

  • A potential gotcha to consider is the number of MPI ranks we advertise as being supported. Launching hundreds of ranks introduces data processing difficulties due to the raw amount of data generated

Originally posted by @coleramos425 in #153 (comment)

@coleramos425 coleramos425 added the Omniperf Revamp Ticket related to the redesign of Omniperf label Aug 25, 2023
@coleramos425 coleramos425 added this to the v1.3.0 milestone Aug 25, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Omniperf Revamp Ticket related to the redesign of Omniperf
Projects
None yet
Development

No branches or pull requests

1 participant