Thanks for sharing such an excellent work: Now I want to use eight GPUs for DDP training on a server. How should I use the script? I am a little confused about the usage of nr. Looking forward to your reply! thanks!