How should the seq_length in the training config be set, and how does it affect the training results?