add esmf summary parser by edoyango · Pull Request #12 · ACCESS-NRI/access-profiling

edoyango · 2025-10-09T00:21:22Z

This add's a parser for esmf's text summary which can be parsed "flat" or hierarchically. We probably only need flat for now, but the latter might be useful at some point.

src/access/profiling/esmf_parser.py

tests/test_esmf_summary_parser.py

micaeljtoliveira · 2025-10-09T02:37:20Z

@edoyango I would suggest you activate the pre-commit hook in your development environment. It will likely save you some time.

micaeljtoliveira

@edoyango Thanks for the changes. All good from my side.

manodeep · 2025-10-09T22:34:37Z

@edoyango The linter check is failing - will you please fix that. I am looking through the rest of the code in the meantime

manodeep

I made some suggestions trying to reduce the hardcoding. This looks quite substantial and great overall.

The other bit would be that the code quality checks for the linter and coverage need to pass

src/access/profiling/esmf_parser.py

manodeep · 2025-10-09T22:52:44Z

tests/test_esmf_summary_parser.py

+    # check that all keys in correct_dict are in input_dict
+    if input_dict.keys() < correct_dict.keys():
+        raise ValueError(f"Missing keys for {region} (depth: {depth}): {set(correct_dict.keys()) - input_dict.keys()}")
+    extra_keys = set(correct_dict.keys()) - metric_keys


What happens if metric_keys contains extra keys? (Is that even possible for valid data?)

metric_keys is the expected "metric" keys in both input_dict and correct_dict. The function will error if input_dict and correct_dict don't have all the keys in metric_keys. extra_keys would be the region keys.

renamed extra_keys to region_keys so hopefully it's a bit clearer what they are

minghangli-uni

Sorry I haven’t looked into the code yet, but I’d suggest keeping only hierarchical, since some region names might appear in multiple locations (such as in Runphase or in Init). Otherwise it could lead to unexpected results.

edoyango · 2025-10-10T00:22:17Z

some region names might appear in multiple locations (such as in Runphase or in Init). Otherwise it could lead to unexpected results.

Are you talking about regions like [ensemble] RunPhase1, [ESM0001] RunPhase1 etc.? In the OM3 output I saw RunPhase1 many times, but they seemed to be prefixed uniquely. Are you saying that they could also have the same prefix too? Do you have an example handy?

minghangli-uni · 2025-10-10T00:29:35Z

For example, [ATM-TO-MED] RunPhase1 exists in both [ensemble] Init 1 and [ensemble] RunPhase1.

micaeljtoliveira · 2025-10-10T00:36:27Z

For example, [ATM-TO-MED] RunPhase1 exists in both [ensemble] Init 1 and [ensemble] RunPhase1.

I think that's fine. It's very common case when profiling a code that the same block of code is called from different places, with different data and, therefore, with very different timings. The hierarchical data gives you more information, which is good, but the flat one is still useful in many cases.

edoyango · 2025-10-10T00:36:38Z

True. Thanks for pointing out the issue.

@manodeep @micaeljtoliveira to either of you have a suggestion for how to handle this? I'm thinking for the flat case, we simply add a _N suffix for now. We could add the bracketed prefixes, but that would probably be too verbose since there's around 10 levels?

micaeljtoliveira · 2025-10-10T00:40:33Z

@manodeep @micaeljtoliveira to either of you have a suggestion for how to handle this? I'm thinking for the flat case, we simply add a _N suffix for now. We could add the bracketed prefixes, but that would probably be too verbose since there's around 10 levels?

As I wrote above, I think it's fine to aggregate calls to the same region but from different places in the call-tree together when working with the flat case. We loose some information, but what is left can still be useful.

codecov · 2025-10-10T00:54:05Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 100.00%. Comparing base (815072f) to head (0d6b41c).
⚠️ Report is 2 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff            @@
##              main       #12   +/-   ##
=========================================
  Coverage   100.00%   100.00%           
=========================================
  Files            9        10    +1     
  Lines          285       345   +60     
=========================================
+ Hits           285       345   +60

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

edoyango · 2025-10-14T06:16:31Z

some region names might appear in multiple locations (such as in Runphase or in Init). Otherwise it could lead to unexpected results.

@minghangli-uni @micaeljtoliveira @manodeep I've added some code to aggregate regions that occur more than one, if PETs and PEs are the same (errors if they're not). This assumes regions with identical names is referring to the same code. Can you all have a look before I merge?

micaeljtoliveira

@edoyango LGTM, just one very small thing to fix.

src/access/profiling/esmf_parser.py

minghangli-uni

LGTM, thanks @edoyango

Co-authored-by: Manodeep Sinha <manodeep@gmail.com> Co-authored-by: Micael Oliveira <micael.oliveira@anu.edu.au>

edoyango requested review from manodeep and micaeljtoliveira October 9, 2025 00:21

edoyango force-pushed the esmf-summary branch from 52c7814 to 8cd05fa Compare October 9, 2025 01:47

micaeljtoliveira reviewed Oct 9, 2025

View reviewed changes

src/access/profiling/esmf_parser.py Outdated Show resolved Hide resolved

tests/test_esmf_summary_parser.py Show resolved Hide resolved

micaeljtoliveira approved these changes Oct 9, 2025

View reviewed changes

manodeep reviewed Oct 9, 2025

View reviewed changes

minghangli-uni reviewed Oct 10, 2025

View reviewed changes

edoyango force-pushed the esmf-summary branch from 23b7ac7 to c3076ef Compare October 10, 2025 00:29

edoyango force-pushed the esmf-summary branch from f9f00a9 to df34da1 Compare October 10, 2025 00:53

edoyango force-pushed the esmf-summary branch from df34da1 to dc77c57 Compare October 10, 2025 05:06

micaeljtoliveira approved these changes Oct 14, 2025

View reviewed changes

src/access/profiling/esmf_parser.py Outdated Show resolved Hide resolved

minghangli-uni approved these changes Oct 14, 2025

View reviewed changes

add esmf summary parser

0d6b41c

Co-authored-by: Manodeep Sinha <manodeep@gmail.com> Co-authored-by: Micael Oliveira <micael.oliveira@anu.edu.au>

edoyango force-pushed the esmf-summary branch from fcd4d96 to 0d6b41c Compare October 14, 2025 22:27

edoyango merged commit 580517e into main Oct 14, 2025
8 checks passed

edoyango deleted the esmf-summary branch October 14, 2025 22:46

Conversation

edoyango commented Oct 9, 2025

Uh oh!

Uh oh!

Uh oh!

micaeljtoliveira commented Oct 9, 2025

Uh oh!

micaeljtoliveira left a comment

Choose a reason for hiding this comment

Uh oh!

manodeep commented Oct 9, 2025

Uh oh!

manodeep left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

manodeep Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

edoyango Oct 10, 2025

Choose a reason for hiding this comment

Uh oh!

edoyango Oct 10, 2025

Choose a reason for hiding this comment

Uh oh!

minghangli-uni left a comment

Choose a reason for hiding this comment

Uh oh!

edoyango commented Oct 10, 2025

Uh oh!

minghangli-uni commented Oct 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

micaeljtoliveira commented Oct 10, 2025

Uh oh!

edoyango commented Oct 10, 2025

Uh oh!

micaeljtoliveira commented Oct 10, 2025

Uh oh!

codecov bot commented Oct 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

edoyango commented Oct 14, 2025

Uh oh!

micaeljtoliveira left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

minghangli-uni left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

minghangli-uni commented Oct 10, 2025 •

edited

Loading

codecov bot commented Oct 10, 2025 •

edited

Loading