Large language models are able to downplay their cognitive abilities to fit the persona they simulate-Reference-Cited by-同舟云学术

Large language models are able to downplay their cognitive abilities to fit the persona they simulate

Published:2024-03-13 Issue:3 Volume:19 Page:e0298522
ISSN:1932-6203
Container-title:PLOS ONE
language:en
Short-container-title:PLoS ONE

Author:

Milička Jiří,Marklová Anna^ORCID,VanSlambrouck Klára,Pospíšilová Eva,Šimsová Jana,Harvan Samuel,Drobil Ondřej

Abstract

This study explores the capabilities of large language models to replicate the behavior of individuals with underdeveloped cognitive and language skills. Specifically, we investigate whether these models can simulate child-like language and cognitive development while solving false-belief tasks, namely, change-of-location and unexpected-content tasks. GPT-3.5-turbo and GPT-4 models by OpenAI were prompted to simulate children (N = 1296) aged one to six years. This simulation was instantiated through three types of prompts: plain zero-shot, chain-of-thoughts, and primed-by-corpus. We evaluated the correctness of responses to assess the models’ capacity to mimic the cognitive skills of the simulated children. Both models displayed a pattern of increasing correctness in their responses and rising language complexity. That is in correspondence with a gradual enhancement in linguistic and cognitive abilities during child development, which is described in the vast body of research literature on child development. GPT-4 generally exhibited a closer alignment with the developmental curve observed in ‘real’ children. However, it displayed hyper-accuracy under certain conditions, notably in the primed-by-corpus prompt type. Task type, prompt type, and the choice of language model influenced developmental patterns, while temperature and the gender of the simulated parent and child did not consistently impact results. We conducted analyses of linguistic complexity, examining utterance length and Kolmogorov complexity. These analyses revealed a gradual increase in linguistic complexity corresponding to the age of the simulated children, regardless of other variables. These findings show that the language models are capable of downplaying their abilities to achieve a faithful simulation of prompted personas.

Funder

Czech Science Foundation

Publisher

Public Library of Science (PLoS)

Reference63 articles.

1. Reynolds L, McDonell K. Prompt programming for large language models: Beyond the few-shot paradigm. In: Extended Abstracts of the 2021 CHI Conference on Human Factors in Computing Systems; 2021. p. 1–7.

2. Janus. Simulators; 2023. Available from: https://www.lesswrong.com/posts/vJFdjigzmcXMhNTsx/simulators.

3. Shanahan M, McDonell K, Reynolds L. Role-Play with Large Language Models. arXiv preprint arXiv:230516367. 2023;.

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The use of ChatGPT for personality research: Administering questionnaires using generated personas;Personality and Individual Differences;2024-10

2. Why ‘Computational’ Learning Theories?;Advances in Analytics for Learning and Teaching;2024