Skip to content
This repository was archived by the owner on Jan 15, 2026. It is now read-only.

Commit 4abd481

Browse files
authored
Merge pull request #456 from zswerth/gpu_metrics_support
Add GPU metrics support for nsight profiling
2 parents f82ac5b + c5a2e9a commit 4abd481

File tree

1 file changed

+3
-0
lines changed

1 file changed

+3
-0
lines changed

launcher_scripts/nemo_launcher/core/stages.py

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -290,6 +290,9 @@ def _make_nsys_command_prefix(self, results_dir: str) -> str:
290290
f"--capture-range-end=stop "
291291
f"--cuda-graph-trace=node "
292292
)
293+
if nsys_cfg.get("gpu_metrics", False):
294+
slurm_local_rank = "\${SLURM_LOCALID}"
295+
nsys_prefix += f"--gpu-metrics-device={slurm_local_rank} "
293296
return nsys_prefix
294297

295298
def _make_container_mounts_string(self) -> str:

0 commit comments

Comments
 (0)