Allow profiling ranks in string format with comma as separator by juntaowww · Pull Request #855 · NVIDIA/cloudai

juntaowww · 2026-03-31T13:50:57Z

Summary

Allow profiling ranks in string format with comma as separator. The list format in cmd_args would make it DSE target.

Test Plan

manual run

coderabbitai · 2026-03-31T13:53:27Z

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: ASSERTIVE

Plan: Pro

Run ID: ec3b3454-94e5-4c30-9696-3901f9e94ae7

📥 Commits

Reviewing files that changed from the base of the PR and between 89287e8 and 8054958.

📒 Files selected for processing (1)

tests/workloads/megatron_bridge/test_command_gen_strategy_slurm.py

📝 Walkthrough

Walkthrough

Updated the MegatronBridgeCmdArgs.profiling_ranks field type to accept str in addition to int and List[int], and added a Field description documenting comma-separated rank IDs (e.g. "0,4,8"). Added a unit test verifying string-format input is propagated into generated Slurm wrapper content.

Changes

Cohort / File(s)	Summary
Megatron Bridge Configuration `src/cloudai/workloads/megatron_bridge/megatron_bridge.py`	Expanded `profiling_ranks` field type from `Optional[Union[int, List[int]]]` to `Optional[Union[int, str, List[int]]]` and updated the `Field(...)` description to document comma-separated string input support.
Unit Tests — Megatron Bridge `tests/workloads/megatron_bridge/test_command_gen_strategy_slurm.py`	Added `test_profiling_ranks_string_format` to assert that a string `"0,1,2,3"` for `profiling_ranks` with `enable_nsys` produces a Slurm wrapper containing `--profiling_ranks 0,1,2,3` and that the `TestRun` is not a DSE job before generation.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Poem

🐰 I nibble on commas, hop in delight,

"0,1,2,3" shines in the wrapper's light,
No more strict ints to bind my paws,
Flexibility hops in, without a pause.

🚥 Pre-merge checks | ✅ 2

✅ Passed checks (2 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title accurately describes the main change: allowing profiling ranks to be specified as a comma-separated string format in addition to existing formats.
Description check	✅ Passed	The description is related to the changeset, explaining the motivation (avoiding DSE target designation) and test approach, though it lacks technical depth.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 2

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@src/cloudai/workloads/megatron_bridge/megatron_bridge.py`:
- Line 116: The type annotation for the profiling_ranks Field should use modern
PEP 604 syntax: replace Optional[Union[int, str, List[int]]] with int | str |
list[int] | None for clarity and consistency; remove any now-unused typing
imports (Optional, Union, List) or update imports accordingly and ensure the
annotation is applied on the profiling_ranks Field definition in
megatron_bridge.py.
- Around line 116-119: Add a Pydantic field validator for profiling_ranks to
validate and normalize comma-separated string input: when profiling_ranks is a
str, ensure it matches only digits separated by commas (e.g. "0,4,8"), split and
convert to a list[int] (or raise a ValidationError on invalid format); keep
existing support for int and list[int]. Attach the validator to the
profiling_ranks field (use the model's validator/@field_validator for
"profiling_ranks") so downstream command generation always receives a validated
list[int] or a single int.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: ASSERTIVE

Plan: Pro

Run ID: 21a82a8b-87a4-4cbf-bc66-1a1d6a2de461

📥 Commits

Reviewing files that changed from the base of the PR and between f0ffef9 and 89287e8.

📒 Files selected for processing (1)

src/cloudai/workloads/megatron_bridge/megatron_bridge.py

src/cloudai/workloads/megatron_bridge/megatron_bridge.py

podkidyshev · 2026-04-01T10:06:47Z

@juntaowww let me know when to merge

juntaowww · 2026-04-01T10:08:08Z

good to merge now, thanks!

Allow profiling ranks in string format with comma as separator

89287e8

juntaowww requested review from amaslenn, jeffnvidia and srivatsankrishnan as code owners March 31, 2026 13:50

coderabbitai bot reviewed Mar 31, 2026

View reviewed changes

src/cloudai/workloads/megatron_bridge/megatron_bridge.py Show resolved Hide resolved

src/cloudai/workloads/megatron_bridge/megatron_bridge.py Show resolved Hide resolved

amaslenn requested review from podkidyshev and removed request for amaslenn March 31, 2026 14:21

podkidyshev reviewed Mar 31, 2026

View reviewed changes

src/cloudai/workloads/megatron_bridge/megatron_bridge.py Show resolved Hide resolved

Add a test for string format of profiling ranks

8054958

podkidyshev approved these changes Apr 1, 2026

View reviewed changes

podkidyshev merged commit 4ea5899 into NVIDIA:main Apr 1, 2026
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow profiling ranks in string format with comma as separator#855

Allow profiling ranks in string format with comma as separator#855
podkidyshev merged 2 commits intoNVIDIA:mainfrom
juntaowww:nsys_ranks

juntaowww commented Mar 31, 2026

Uh oh!

coderabbitai bot commented Mar 31, 2026 •

edited

Loading

Walkthrough

Changes

Estimated code review effort

Poem

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

podkidyshev commented Apr 1, 2026

Uh oh!

juntaowww commented Apr 1, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

juntaowww commented Mar 31, 2026

Summary

Test Plan

Uh oh!

coderabbitai bot commented Mar 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Poem

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

podkidyshev commented Apr 1, 2026

Uh oh!

juntaowww commented Apr 1, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

coderabbitai bot commented Mar 31, 2026 •

edited

Loading