Sociolinguistic Analysis with Missing Metadata? Leveraging Linguistic and Semiotic Resources Through Deep Learning to Investigate English Variation and Change on Twitter

Author:

Gonzales Wilkinson Daniel Wong1ORCID

Affiliation:

1. Department of English, The Chinese University of Hong Kong , Hong Kong SAR , People’s Republic of China

Abstract

Abstract This paper highlights a language and sign-based computational solution to the problem of missing social metadata on Twitter (now, ‘X’): demographic prediction using Deep Learning. It aims to apply this method to variationist sociolinguistics research, illustrating how the approach can facilitate analyses with missing metadata (i.e. stylistic age and sex/gender) by deriving this metadata solely from publicly available linguistic and semiotic resources on Twitter profiles (e.g. display pictures and biographies). I use my investigations of English tweets from the Philippines and Hong Kong as case examples, examining the extent to which the use of the copula and the use of will-shall modals on social media are conditioned by diachronic factors as well as factors internal and external to language (e.g. social factors). The results reveal the influence of stylistic gender and age as well as other factors on patterns of variation. They offer a glimpse into the nuanced sociolinguistic aspects of language usage on social media, highlighting the advantages of utilizing AI-powered Deep Learning to tackle data-related challenges. The discoveries and methodology hold the possibility of influencing other fields and practical situations beyond the study of language and society.

Publisher

Oxford University Press (OUP)

Reference78 articles.

1. Co-training for demographic classification using deep learning from label proportions;Ardehaly,2017

2. ‘quanteda: An R package for the quantitative analysis of textual data,’;Benoit;Journal of Open Source Software,2018

3. ‘The sociolinguistics of Hong Kong and the space for Hong Kong English,’;Bolton;World Englishes,2000

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3