Thanks again for your work! As mentioned in the question, I found that in the pre-trained crowdhuman_hybrid_branch.pth (for detection, I guess), the dim_feedforward parameter is 2048. And it is also mentioned in the instructions. However, in the weights file val/test.pth (for tracking), this parameter is 1024. I don't know why this is happening, if you have time, dealing with these, I would be very appreciative.