Hello Thank you for the work :) May I ask the number of V100s you used for training the model? Trying to estimate the total batch size you used (understand that its 22 per V100 GPU)