Filtering by block doesn't consider cross-block dependencies for metrics #21

skyreflectedinmirrors · 2022-11-10T20:39:39Z

Specifically, we noticed this while trying to collect coalescing (which lives in the TCP section):

https://github.com/AMDResearch/omniperf/blob/62d130b458a21a2c964da234cf7a24420e01efe1/src/omniperf_cli/configs/gfx90a/1600_L1_cache.yaml#L20

but uses values from the TA (i.e., TA_TOTAL_WAVEFRONTS_sum).

So, if a user does:

omniperf profile -b TCP -n bar -- <foo>
omniperf analyze -p workloads/bar/mi200

the resulting Buffer Coalescing value in the L1 section will be empty.

The text was updated successfully, but these errors were encountered:

coleramos425 · 2022-11-11T21:16:02Z

Ah, good catch. Thanks for reporting this.

We'll have to refine the logic for ip block filtering to account for metrics that reference other blocks such as this. We'll add this to the next release

coleramos425 · 2022-12-12T21:21:20Z

Adding this to a future milestone. IP Block, dispatch, and kernel filtering are going to be overhauled when we introduce alternative profiling to users.

This alternative profiling option will introduce a single output csv where organizing logical IP Blocks is much easier. This will also eliminate the issue we have with metrics that use counters from different blocks like our Memory Chart. This similar issue is described below

Issue was that these metrics used SQ_ACCUM_PREV_HIRES which is a counter generated in several ip blocks. Needed to specify which csv to pull counter from in .yaml configs.

Another issue exists with these two cache latencies

The expressions for these metrics use counters from two ip blocks.

i.e. L1D Cache Latency = AVG(SQ_ACCUM_PREV_HIRES[from SQ_IFETCH_LEVEL] / SQC_DCACHE_REQ [from pmc_perf])

The coll_level fix used above won't work for these as two different csv would need to be specified. coll_level only lets us specify one. Could either

Reorder performance counters in rocprof perfmon config

Modify cli tool's coll_level implementation

ppanchad-amd · 2024-10-04T19:22:23Z

Closing since ticket is no longer relevant. Thanks!

skyreflectedinmirrors · 2024-10-04T19:42:41Z

This is definitely still relevant

coleramos425 added this to the v1.0.5 milestone Nov 11, 2022

coleramos425 self-assigned this Nov 11, 2022

coleramos425 added bug Something isn't working Profiling Related to the profiling done in Omniperf labels Nov 11, 2022

coleramos425 modified the milestones: v1.0.5, v1.0.6 Dec 12, 2022

koomie modified the milestones: v1.0.6, v.1.0.7 Dec 21, 2022

coleramos425 modified the milestones: v.1.0.7, v.1.1.0 Feb 21, 2023

coleramos425 modified the milestones: v.1.0.9, v1.1.0 Aug 14, 2023

ppanchad-amd closed this as not planned Won't fix, can't repro, duplicate, stale Oct 4, 2024

skyreflectedinmirrors reopened this Oct 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Filtering by block doesn't consider cross-block dependencies for metrics #21

Filtering by block doesn't consider cross-block dependencies for metrics #21

skyreflectedinmirrors commented Nov 10, 2022

coleramos425 commented Nov 11, 2022

coleramos425 commented Dec 12, 2022

ppanchad-amd commented Oct 4, 2024

skyreflectedinmirrors commented Oct 4, 2024

Filtering by block doesn't consider cross-block dependencies for metrics #21

Filtering by block doesn't consider cross-block dependencies for metrics #21

Comments

skyreflectedinmirrors commented Nov 10, 2022

coleramos425 commented Nov 11, 2022

coleramos425 commented Dec 12, 2022

ppanchad-amd commented Oct 4, 2024

skyreflectedinmirrors commented Oct 4, 2024