The imitation game: Detecting human and AI-generated texts in the era of ChatGPT and BARD-Reference-Cited by-同舟云学术

The imitation game: Detecting human and AI-generated texts in the era of ChatGPT and BARD

Published:2024-02-14 Issue: Volume: Page:
ISSN:0165-5515
Container-title:Journal of Information Science
language:en
Short-container-title:Journal of Information Science

Author:

Hayawi Kadhim¹,Shahriar Sakib¹^ORCID,Mathew Sujith Samuel¹

Affiliation:

1. Computational Systems, College of Interdisciplinary Studies, Zayed University, United Arab Emirates

Abstract

The potential of artificial intelligence (AI)-based large language models (LLMs) holds considerable promise in revolutionising education, research and practice. However, distinguishing between human-written and AI-generated text has become a significant task. This article presents a comparative study, introducing a novel dataset of human-written and LLM-generated texts in different genres: essays, stories, poetry and Python code. We employ several machine learning models to classify the texts. Results demonstrate the efficacy of these models in discerning between human and AI-generated text, despite the dataset’s limited sample size. However, the task becomes more challenging when classifying GPT-generated text, particularly in story writing. The results indicate that the models exhibit superior performance in binary classification tasks, such as distinguishing human-generated text from a specific LLM, compared with the more complex multiclass tasks that involve discerning among human-generated and multiple LLMs. Our findings provide insightful implications for AI text detection, while our dataset paves the way for future research in this evolving area.

Publisher

SAGE Publications

Link

http://journals.sagepub.com/doi/pdf/10.1177/01655515241227531

Reference47 articles.

1. Chatting and cheating: Ensuring academic integrity in the era of ChatGPT

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Putting GPT-4o to the Sword: A Comprehensive Evaluation of Language, Vision, Speech, and Multimodal Proficiency;Applied Sciences;2024-09-03

2. The art of deception: humanizing AI to outsmart detection;Global Knowledge, Memory and Communication;2024-08-16

3. Can human intelligence safeguard against artificial intelligence? Exploring individual differences in the discernment of human from AI texts;2024-04-29

4. Decoding the AI’s Gaze: Unraveling ChatGPT’s Evaluation of Poetic Creativity;Communications in Computer and Information Science;2024

5. Generative AI and large language models: A new frontier in reverse vaccinology;Informatics in Medicine Unlocked;2024