Could classify mother tongues into roughly similar groups for use as e.g. confounder in analyses. Not sure what the best way to group is, need the groups to be big enough, but some suggestion:
- Finnish
- Swedish
- Eastern bloc countries: Russian/Ukrainian/Estonian/Polish/Romanian/Lithuanian/Latvian/Hungarian/...
- Asian: Chinese languages/Nepali/Indian languages/Vietnamese/Japanese/Malaysian ...
- European well-off countries: Spanish/English/German/French/Portuguese/Italian/Norwegian/Danish ...
- Other: Somali/Turkish/Swahili/Sami...