Why not use strong permutation when training the teacher model?

Thanks for your job. Making full use of large-scale unlabeled data is highly valuable and worth attention.
 I'm curious about the following:
 1. why not use strong permutation augmentation when training the teacher model? As the paper mentioned, the labeled images are already sufficient (1.5M). Perhaps its generalization ability is comparable to or even better than the semi-supervised training 
 2. Why not use strong permutation in labeled data while training the student model? 
@LiheYoung 
 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why not use strong permutation when training the teacher model? #250

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Why not use strong permutation when training the teacher model? #250

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions