Effect of Private Deliberation: Deception of Large Language Models in Game Play-Reference-Cited by-同舟云学术

Effect of Private Deliberation: Deception of Large Language Models in Game Play

Published:2024-06-18 Issue:6 Volume:26 Page:524
ISSN:1099-4300
Container-title:Entropy
language:en
Short-container-title:Entropy

Author:

Poje Kristijan¹,Brcic Mario¹^ORCID,Kovac Mihael¹^ORCID,Babac Marina Bagic¹^ORCID

Affiliation:

1. Faculty of Electrical Engineering and Computing, University of Zagreb, 10000 Zagreb, Croatia

Abstract

Integrating large language model (LLM) agents within game theory demonstrates their ability to replicate human-like behaviors through strategic decision making. In this paper, we introduce an augmented LLM agent, called the private agent, which engages in private deliberation and employs deception in repeated games. Utilizing the partially observable stochastic game (POSG) framework and incorporating in-context learning (ICL) and chain-of-thought (CoT) prompting, we investigated the private agent’s proficiency in both competitive and cooperative scenarios. Our empirical analysis demonstrated that the private agent consistently achieved higher long-term payoffs than its baseline counterpart and performed similarly or better in various game settings. However, we also found inherent deficiencies of LLMs in certain algorithmic capabilities crucial for high-quality decision making in games. These findings highlight the potential for enhancing LLM agents’ performance in multi-player games using information-theoretic approaches of deception and communication with complex environments.

Publisher

MDPI AG

Link

https://www.mdpi.com/1099-4300/26/6/524/pdf

Reference62 articles.

1. Language models are few-shot learners;Brown;Adv. Neural Inf. Process. Syst.,2020

2. Hoglund, S., and Khedri, J. (2024, May 01). Comparison Between RLHF and RLAIF in Fine-Tuning a Large Language Model. Available online: https://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-331926.

3. Chain-of-thought prompting elicits reasoning in large language models;Wei;Adv. Neural Inf. Process. Syst.,2022

4. Creswell, A., Shanahan, M., and Higgins, I. (2022). Selection-inference: Exploiting large language models for interpretable logical reasoning. arXiv.

5. Meta Fundamental AI Research Diplomacy Team (FAIR), Bakhtin, A., Brown, N., Dinan, E., Farina, G., Flaherty, C., Fried, D., Goff, A., Gray, J., and Hu, H. (2022). Human-level play in the game of diplomacy by combining language models with strategic reasoning. Science, 378, 1067–1074.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Quantitative analysis of the relationship between expressing gratitude and forgiveness and user sentiment on social media;Global Knowledge, Memory and Communication;2024-08-08