The political preferences of LLMs-Reference-Cited by-同舟云学术

The political preferences of LLMs

Published:2024-07-31 Issue:7 Volume:19 Page:e0306621
ISSN:1932-6203
Container-title:PLOS ONE
language:en
Short-container-title:PLoS ONE

Author:

Rozado David^ORCID

Abstract

I report here a comprehensive analysis about the political preferences embedded in Large Language Models (LLMs). Namely, I administer 11 political orientation tests, designed to identify the political preferences of the test taker, to 24 state-of-the-art conversational LLMs, both closed and open source. When probed with questions/statements with political connotations, most conversational LLMs tend to generate responses that are diagnosed by most political test instruments as manifesting preferences for left-of-center viewpoints. This does not appear to be the case for five additional base (i.e. foundation) models upon which LLMs optimized for conversation with humans are built. However, the weak performance of the base models at coherently answering the tests’ questions makes this subset of results inconclusive. Finally, I demonstrate that LLMs can be steered towards specific locations in the political spectrum through Supervised Fine-Tuning (SFT) with only modest amounts of politically aligned data, suggesting SFT’s potential to embed political orientation in LLMs. With LLMs beginning to partially displace traditional information sources like search engines and Wikipedia, the societal implications of political biases embedded in LLMs are substantial.

Funder

Institute for Cultural Evolution

Publisher

Public Library of Science (PLoS)

Reference35 articles.

1. OpenAI et al., “GPT-4 Technical Report.” arXiv, Dec. 18, 2023. doi: 10.48550/arXiv.2303.08774.

2. Fair is Better than Sensational:Man is to Doctor as Woman is to Doctor;M. Nissim;arXiv:1905.09866 [cs],2019

3. Semantics derived automatically from language corpora contain human-like biases;A. Caliskan;Science,2017

4. Accuracy Comparison Across Face Recognition Algorithms: Where Are We on Measuring Race Bias?;J. G. Cavazos;IEEE Transactions on Biometrics, Behavior, and Identity Science,2021

5. Algorithmic Bias in Recidivism Prediction: A Causal Perspective (Student Abstract);A. Khademi;Proceedings of the AAAI Conference on Artificial Intelligence,2020

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Understanding model power in social AI;AI & SOCIETY;2024-08-14