Author:
García Jesús Enrique,Gholizadeh Ramin,González-López Verónica Andrea
Abstract
In this paper we use a distance d between sequences of N-grams to identify N-grams that show a different performance when comparing two sequences of N-grams. With this tool, we inspect written texts of European Portuguese dated between 16th century and 19th century. We identify the most voluble N-grams throughout the period and we also identify N-grams that should be considered when studying the linguistic changes from Classical Portuguese to Modern Portuguese. We find that 2-grams composed by unstressed monosyllables followed by paroxytone words (and viceversa) change markedly, from one text to the next, during the whole period. Stressed monosyllabic words (SMW) reveal discrepancies between written texts of the 16th century when compared with texts from the beginning of the 17th century. 2-grams of (i) SMW followed by paroxytone or oxytone word and (ii) paroxytone dissyllabic word or oxytone word followed by a SMW are some of them.
Publisher
Universidade Estadual de Campinas
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Comparison of Stochastic Processes;Data Analysis and Applications 4;2020-04-15
2. Partition Markov model for multiple processes;Mathematical Methods in the Applied Sciences;2020-01-20
3. Sample selection procedure in daily trading volume processes;Mathematical Methods in the Applied Sciences;2019-06-23
4. A BIC‐based consistent metric between Markovian processes;Applied Stochastic Models in Business and Industry;2018-05-29