in train_loader, pair_constraint=not(args.use_batch_norm), Why set pair_constraint as False when use batch normalization? In your implementation, the image don't be normalized to [0.0,1.0]?