Comparative Analysis of Preference in Contemporary and Earlier Texts Using Entropy Measures
Author:
Mohseni Mahdi12, Redies Christoph2ORCID, Gast Volker1ORCID
Affiliation:
1. Department of English and American Studies, University of Jena, 07743 Jena, Germany 2. Experimental Aesthetics Group, Institute of Anatomy I, Jena University Hospital, University of Jena, 07740 Jena, Germany
Abstract
Research in computational textual aesthetics has shown that there are textual correlates of preference in prose texts. The present study investigates whether textual correlates of preference vary across different time periods (contemporary texts versus texts from the 19th and early 20th centuries). Preference is operationalized in different ways for the two periods, in terms of canonization for the earlier texts, and through sales figures for the contemporary texts. As potential textual correlates of preference, we measure degrees of (un)predictability in the distributions of two types of low-level observables, parts of speech and sentence length. Specifically, we calculate two entropy measures, Shannon Entropy as a global measure of unpredictability, and Approximate Entropy as a local measure of surprise (unpredictability in a specific context). Preferred texts from both periods (contemporary bestsellers and canonical earlier texts) are characterized by higher degrees of unpredictability. However, unlike canonicity in the earlier texts, sales figures in contemporary texts are reflected in global (text-level) distributions only (as measured with Shannon Entropy), while surprise in local distributions (as measured with Approximate Entropy) does not have an additional discriminating effect. Our findings thus suggest that there are both time-invariant correlates of preference, and period-specific correlates.
Funder
German Research Foundation Open Access Publication Fund of the Thueringer Universitaets und Landesbibliothek Jena
Subject
General Physics and Astronomy
Reference73 articles.
1. Kao, J., and Jurafsky, D. A Computational Analysis of Style, Affect, and Imagery in Contemporary Poetry. Proceedings of the Workshop on Computational Linguistics for Literature. 2. Ashok, V., Feng, S., and Choi, Y. (2013, January 18–21). Success with Style: Using Writing Style to Predict the Success of Novels. Proceedings of the Conference on Empirical Methods in Natural Language Processing. 3. Maharjan, S., Arevalo, J., Montes, M., González, F., and Solorio, T. (2017, January 3–7). A Multi-task Approach to Predict Likability of Books. Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers, Valencia, Spain. 4. Quantifying Structure Differences in Literature Using Symbolic Diversity and Entropy Criteria;Febres;J. Quant. Linguist.,2017 5. Maharjan, S., Kar, S., Montes, M., González, F.A., and Solorio, T. (2018, January 1–6). Letting Emotions Flow: Success Prediction by Modeling the Flow of Emotions in Books. Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers), New Orleans, LA, USA.
|
|