I used official data splits and the pre-trained weights you provided to evaluate on the Zind dataset. The experimental results are as follows:

I find these results to be significantly different from those reported in the paper. Could you tell me why? Thank you.
