Cognitive Network Science Reveals Bias in GPT-3, GPT-3.5 Turbo, and GPT-4 Mirroring Math Anxiety in High-School Students-Reference-Cited by-同舟云学术

Cognitive Network Science Reveals Bias in GPT-3, GPT-3.5 Turbo, and GPT-4 Mirroring Math Anxiety in High-School Students

Published:2023-06-27 Issue:3 Volume:7 Page:124
ISSN:2504-2289
Container-title:Big Data and Cognitive Computing
language:en
Short-container-title:BDCC

Author:

Abramski Katherine¹,Citraro Salvatore²,Lombardi Luigi³,Rossetti Giulio²^ORCID,Stella Massimo³^ORCID

Affiliation:

1. Department of Computer Science, University of Pisa, 56127 Pisa, Italy

2. Institute of Information Science and Technologies—National Research Council, 56124 Pisa, Italy

3. Department of Psychology and Cognitive Science, University of Trento, 38122 Trento, Italy

Abstract

Large Language Models (LLMs) are becoming increasingly integrated into our lives. Hence, it is important to understand the biases present in their outputs in order to avoid perpetuating harmful stereotypes, which originate in our own flawed ways of thinking. This challenge requires developing new benchmarks and methods for quantifying affective and semantic bias, keeping in mind that LLMs act as psycho-social mirrors that reflect the views and tendencies that are prevalent in society. One such tendency that has harmful negative effects is the global phenomenon of anxiety toward math and STEM subjects. In this study, we introduce a novel application of network science and cognitive psychology to understand biases towards math and STEM fields in LLMs from ChatGPT, such as GPT-3, GPT-3.5, and GPT-4. Specifically, we use behavioral forma mentis networks (BFMNs) to understand how these LLMs frame math and STEM disciplines in relation to other concepts. We use data obtained by probing the three LLMs in a language generation task that has previously been applied to humans. Our findings indicate that LLMs have negative perceptions of math and STEM fields, associating math with negative concepts in 6 cases out of 10. We observe significant differences across OpenAI’s models: newer versions (i.e., GPT-4) produce 5× semantically richer, more emotionally polarized perceptions with fewer negative associations compared to older versions and N=159 high-school students. These findings suggest that advances in the architecture of LLMs may lead to increasingly less biased models that could even perhaps someday aid in reducing harmful stereotypes in society rather than perpetuating them.

Publisher

MDPI AG

Subject

Artificial Intelligence,Computer Science Applications,Information Systems,Management Information Systems

Link

https://www.mdpi.com/2504-2289/7/3/124/pdf

Reference69 articles.

1. OpenAI (2023). GPT-4 Technical Report. arXiv.

2. Language models are few-shot learners;Brown;Adv. Neural Inf. Process. Syst.,2020

3. Text characterization based on recurrence networks;Silva;Inf. Sci.,2023

4. Using cognitive psychology to understand GPT-3;Binz;Proc. Natl. Acad. Sci. USA,2023

5. Probing the psychology of AI models;Shiffrin;Proc. Natl. Acad. Sci. USA,2023

Cited by 16 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. ChatGPT‐3.5 and ‐4.0 and mechanical engineering: Examining performance on the FE mechanical engineering and undergraduate exams;Computer Applications in Engineering Education;2024-07-14

2. Factors influencing students’ acceptance and use generative artificial intelligence in elementary education: an expansion of the UTAUT model;Education and Information Technologies;2024-06-13

3. A review on cultivating effective learning: synthesizing educational theories and virtual reality for enhanced educational experiences;PeerJ Computer Science;2024-05-23

4. Utilization of Artificial Intelligence in Education: A Perspective on Learning Strategies;Artificial Intelligence for Quality Education [Working Title];2024-05-16

5. Diluie: constructing diverse demonstrations of in-context learning with large language model for unified information extraction;Neural Computing and Applications;2024-04-22