Author:
Bossaglia Giulia,Ferrari Lúcia de Almeida
Abstract
In this paper we present different resources for the study of spoken Brazilian Portuguese, developed within the C-ORAL-BRASIL project. The C-ORAL-BRASIL stemmed from the European C-ORAL-ROM project (Cresti & Moneglia, 2005), which has compiled spoken corpora of Italian, French, Spanish, and European Portuguese. The corpora of the C-ORAL family represent adequate tools for the analysis of spoken language, for they are provided not only with the transcripts of the recorded sessions (with prosodic breaks’ annotation), but also with their audio files and the text-to-speech alignment. So far, the C-ORAL-BRASIL project has published the C-ORAL-BRASIL I (Informal corpus: Raso & Mello, 2012), while the C-ORAL-BRASIL II (to be published by 2019) comprises a Formal corpus (Natural context), a Media corpus, and a Telephonic corpus. Besides these resources, a set of informationally tagged comparable minicorpora (representative samples of the aforementioned corpora) are already available or in preparation, enabling (cross-linguistic) studies focussed on information structure.
Publisher
Universidade Estadual de Campinas
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Topic unit detection in spontaneous speech;CHIMERA: Revista de Corpus de Lenguas Romances y Estudios Lingüísticos;2020-09-28