Corpus of Apalaí (Cariban) Texts and Narratives
Authors: Fernando Orphão de Carvalho & Fabrício Ferraz Gerardi
This repository contains a corpus of texts and narratives in Apalaí, a Cariban language spoken in northern Brazil. The materials include transcriptions, translations, and annotations of spoken and written texts collected in linguistic fieldwork. The corpus aims to support linguistic analysis, language documentation, and revitalization efforts.
texts/: Directory with raw and annotated Apalaí texts.metadata/: Metadata files in OLAC and/or CMDI format.audio/(optional): Audio recordings associated with the texts.tools/: Scripts and tools used for text processing and formatting.README.md: Project description and documentation.
- To provide high-quality, accessible documentation of the Apalaí language.
- To support research in Cariban linguistics.
- To aid in language revitalization efforts for Apalaí communities.
- To contribute to the growing set of digital resources on South American Indigenous languages.
If you use this corpus in your research, please cite it as:
de Carvalho, F.O. and Ferraz Gerardi, F. (2025) ApalaiCorpus: Apalai Corpus (v0.1). Zenodo. https://doi.org/10.5281/zenodo.15518117
This corpus is provided for non-commercial research and educational purposes only. All rights reserved by the authors unless otherwise stated. For questions about reuse, please contact the authors.
- Fernando Orphão de Carvalho
- Fabrício Ferraz Gerardi