-
Notifications
You must be signed in to change notification settings - Fork 17
Open
Labels
Description
I'm thinking of using polyglotdb with a multilingual corpus (that is, doing comparative work across multiple languages, though most files have a single language each). I'm trying to work out if it would be better to have each language as its own database, or a single large database. I'm leaving towards multiple databases with combining data after export, since the enrichment rules might be different for different languages. However, if others have thought through this question and come to a difference answer that would be helpful!