Affiliation:
1. Sociology, University of Oxford
2. Sociology, Columbia University
Abstract
Abstract
While applications of machine learning (ML) and natural language processing (NLP) methods have grown in the social sciences within diverse international contexts, these methods remain inherently Eurocentric and strikingly challenging to apply to non-European languages. There is an urgent need for decolonial reconstructions of ML and NLP applications within the social sciences, particularly in meaningfully reconfigured applications of ML and NLP to non-Western contexts. Endeavoring to answer such a cogent calling, this chapter reviews the existing literature that applies machine learning and NLP methods to sociological analyses of Chinese-language contexts. Synthesizing existing research, the authors introduce and establish a new and cutting-edge subfield situated at the intersections of China studies, computer science, and sociology: ‘Chinese computational sociology.’ Contributing the first review of the state of Chinese computational sociology, this chapter outlines the various challenges facing applications of ML and NLP methods for studying social issues in Chinese-language contexts, and suggests possible solutions to such limitations. Beyond existing challenges and solutions, this chapter critically considers future trajectories for Chinese computational sociology as a subfield in its own right.
Reference52 articles.
1. Bao, P., Shen, H. W., Huang, J., & Cheng, X. Q. (2013, May). Popularity prediction in microblogging network: a case study on sina weibo. In Proceedings of the 22nd international conference on world wide web (pp. 177–178).
2. Chinese computational propaganda: Automation, algorithms and the manipulation of information about Chinese politics on Twitter and Weibo.;Information, Communication & Society,2019
3. Borch, C. (2023). Machine learning and postcolonial critique: Homologous challenges to sociological notions of human agency. Sociology, Article 00380385221146877. https://doi.org/10.1177/00380385221146877
4. Che, W., Feng, Y., Qin, L., & Liu, T. (2021). N-LTP: An open-source neural language technology platform for Chinese. arXiv. http://arxiv.org/abs/2009.11616
5. Cheng, L., Lu, Y., & Zhang, H. (2024). Gendered Erasure and Manufactured Passivity: Gender Bias in Government and Media Attention to Protests in China. Mobilization, Forthcoming.