Skip to content

multilingual vs monolingual corpora? #244

@chirila

Description

@chirila

I'm thinking of using polyglotdb with a multilingual corpus (that is, doing comparative work across multiple languages, though most files have a single language each). I'm trying to work out if it would be better to have each language as its own database, or a single large database. I'm leaving towards multiple databases with combining data after export, since the enrichment rules might be different for different languages. However, if others have thought through this question and come to a difference answer that would be helpful!

Metadata

Metadata

Assignees

No one assigned

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions