Author:
Meister Ko. Freitag Raquel
Abstract
Purpose
This paper aims to provide a context for Brazilian Portuguese language documentation and its data collection to establish linguistic repositories from a sociolinguistic overview.
Design/methodology/approach
The main sociolinguistic projects that have generated collections of Brazilian Portuguese language data are presented.
Findings
The comparison with another situation of repositories (seed vaults) and with the accounting concept of assets is evocated to map the challenges to be overcome in proposing a standardized and professional language repository to host the collections of linguistic data arising from the reported projects and others, in the accordance with the principles of the open science movement.
Originality/value
Thinking about the sustainability of projects to build linguistic documentation repositories, partnerships with the information technology area, or even with private companies, could minimize problems of obsolescence and safeguarding of data, by promoting the circulation and automation of analysis through natural language processing algorithms. These planning actions may help to promote the longevity of the linguistic documentation repositories of Brazilian sociolinguistic research.
Subject
Library and Information Sciences,Computer Science Applications
Reference41 articles.
1. Andersen, R. (2015), “Rescuing ancient seeds from a war-torn city”, The Atlantic, September 23, available at: www.theatlantic.com/science/archive/2015/09/rescuing-ancient-seeds-from-a-war-torn-city/406978/
2. Desafios do português brasileiro como língua adicional para a cidadania global;Revista Linguagem and Ensino,2021
3. O projeto NURC no Brasil: da década de 70 à década de 90;Linguística,1999
4. Informações sobre o projeto de Estudo da Norma Urbana Linguística Culta (Projeto NURC);Cadernos de Estudos Linguísticos,1984