Understanding political polarization using language models: A dataset and method

Author:

Gode Samiran1ORCID,Bare Supreeth1,Raj Bhiksha12,Yoo Hyungon1

Affiliation:

1. Carnegie Mellon University Pittsburgh Pennsylvania USA

2. Mohamed bin Zayed University of Artificial Intelligence Abu Dhabi United Arab Emirates

Abstract

AbstractOur paper aims to analyze political polarization in US political system using language models, and thereby help candidates make an informed decision. The availability of this information will help voters understand their candidates' views on the economy, healthcare, education, and other social issues. Our main contributions are a dataset extracted from Wikipedia that spans the past 120 years and a language model‐based method that helps analyze how polarized a candidate is. Our data are divided into two parts, background information and political information about a candidate, since our hypothesis is that the political views of a candidate should be based on reason and be independent of factors such as birthplace, alma mater, and so forth. We further split this data into four phases chronologically, to help understand if and how the polarization amongst candidates changes. This data has been cleaned to remove biases. To understand the polarization, we begin by showing results from some classical language models in Word2Vec and Doc2Vec. And then use more powerful techniques like the Longformer, a transformer‐based encoder, to assimilate more information and find the nearest neighbors of each candidate based on their political view and their background. The code and data for the project will be available here: “https://github.com/samirangode/Understanding_Polarization

Publisher

Wiley

Subject

Artificial Intelligence

Reference20 articles.

1. Learning Political Polarization on Social Media Using Neural Networks

2. Beltagy I. M. E.Peters andA.Cohan.2020. “Longformer: The Long‐Document Transformer.”arXiv preprint arXiv:2004.05150.https://arxiv.org/abs/2004.05150

3. Bhatt S. S.Joglekar S.Bano andN.Sastry.2018. “Illuminating an Ecosystem of Partisan Websites.” InCompanion Proceedings of The Web Conference 2018 545–54.

4. The price of political polarization: Evidence from municipal issuers during the coronavirus pandemic

5. DeSilver D.2022.The polarization in today's Congress has roots that go back decades. Pew Research Center.https://www.pewresearch.org/short‐reads/2022/03/10/the‐polarization‐in‐todays‐congress‐has‐roots‐that‐go‐back‐decades/

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3