Qualitative Research Methods for Large Language Models: Conducting Semi-Structured Interviews with ChatGPT and BARD on Computer Science Education-Reference-Cited by-同舟云学术

Qualitative Research Methods for Large Language Models: Conducting Semi-Structured Interviews with ChatGPT and BARD on Computer Science Education

Published:2023-10-12 Issue:4 Volume:10 Page:78
ISSN:2227-9709
Container-title:Informatics
language:en
Short-container-title:Informatics

Author:

Dengel Andreas¹,Gehrlein Rupert¹,Fernes David¹^ORCID,Görlich Sebastian¹,Maurer Jonas¹,Pham Hai Hoang¹,Großmann Gabriel¹,Eisermann Niklas Dietrich genannt¹

Affiliation:

1. Institute for the Didactics of Mathematics and Computer Science, Goethe University Frankfurt, 63025 Frankfurt, Germany

Abstract

In the current era of artificial intelligence, large language models such as ChatGPT and BARD are being increasingly used for various applications, such as language translation, text generation, and human-like conversation. The fact that these models consist of large amounts of data, including many different opinions and perspectives, could introduce the possibility of a new qualitative research approach: Due to the probabilistic character of their answers, “interviewing” these large language models could give insights into public opinions in a way that otherwise only interviews with large groups of subjects could deliver. However, it is not yet clear if qualitative content analysis research methods can be applied to interviews with these models. Evaluating the applicability of qualitative research methods to interviews with large language models could foster our understanding of their abilities and limitations. In this paper, we examine the applicability of qualitative content analysis research methods to interviews with ChatGPT in English, ChatGPT in German, and BARD in English on the relevance of computer science in K-12 education, which was used as an exemplary topic. We found that the answers produced by these models strongly depended on the provided context, and the same model could produce heavily differing results for the same questions. From these results and the insights throughout the process, we formulated guidelines for conducting and analyzing interviews with large language models. Our findings suggest that qualitative content analysis research methods can indeed be applied to interviews with large language models, but with careful consideration of contextual factors that may affect the responses produced by these models. The guidelines we provide can aid researchers and practitioners in conducting more nuanced and insightful interviews with large language models. From an overall view of our results, we generally do not recommend using interviews with large language models for research purposes, due to their highly unpredictable results. However, we suggest using these models as exploration tools for gaining different perspectives on research topics and for testing interview guidelines before conducting real-world interviews.

Publisher

MDPI AG

Subject

Computer Networks and Communications,Human-Computer Interaction,Communication

Link

https://www.mdpi.com/2227-9709/10/4/78/pdf

Reference53 articles.

1. OpenAI (2023, July 18). GPT-4 Technical Report, Available online: http://xxx.lanl.gov/abs/2303.08774.

2. Google (2023, July 18). Bard Experiment. Available online: https://bard.google.com.

3. ChatGPT and other large language models are double-edged swords;Shen;Radiology,2023

4. Risks and benefits of large language models for the environment;Rillig;Environ. Sci. Technol.,2023

5. Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, Ł., and Polosukhin, I. (2017, January 4–9). Attention is all you need. Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, Long Beach, CA, USA.

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Carbon emission reduction in construction industry: qualitative insights on procurement, policies and artificial intelligence;Built Environment Project and Asset Management;2024-07-19

2. Voices from the algorithm: Large language models in social research;Energy Research & Social Science;2024-07

3. Examining How the Large Language Models Impact the Conceptual Design with Human Designers: A Comparative Case Study;International Journal of Human–Computer Interaction;2024-07

4. Societal impacts of chatbot and mitigation strategies for negative impacts: A large-scale qualitative survey of ChatGPT users;Technology in Society;2024-06

5. Disruptive Factors in Product Portfolio Management: An Exploratory Study in B2B Manufacturing for Sustainable Transition;Sustainability;2024-05-23