Emerging trends: When can users trust GPT, and when should they intervene?

Author:

Church KennethORCID

Abstract

AbstractUsage of large language models and chat bots will almost surely continue to grow, since they are so easy to use, and so (incredibly) credible. I would be more comfortable with this reality if we encouraged more evaluations with humans-in-the-loop to come up with a better characterization of when the machine can be trusted and when humans should intervene. This article will describe a homework assignment, where I asked my students to use tools such as chat bots and web search to write a number of essays. Even after considerable discussion in class on hallucinations, many of the essays were full of misinformation that should have been fact-checked. Apparently, it is easier to believe ChatGPT than to be skeptical. Fact-checking and web search are too much trouble.

Publisher

Cambridge University Press (CUP)

Reference42 articles.

1. Wang, J. , Hu, X. , Hou, W. , Chen, H. , Zheng, R. , Wang, Y. , Yang, L. , Huang, H. , Ye, W. , Geng, X. , Jiao, B. , Zhang, Y. and Xie, X. (2023). On the robustness of chatgpt: an adversarial and out-of-distribution perspective. ArXiv, abs/2302.12095.

2. Emerging trends: unfair, biased, addictive, dangerous, deadly, and insanely profitable;Church;Natural Language Engineering,2023

3. Morris, J. , Lifland, E. , Yoo, J.Y. , Grigsby, J. , Jin, D. and Qi, Y. (2020). TextAttack: a framework for adversarial attacks, data augmentation, and adversarial training in NLP. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations. Association for Computational Linguistics, pp. 119–126. Online.

4. Chain-of-thought prompting elicits reasoning in large language models;Wei;Advances in Neural Information Processing Systems,2022

Cited by 3 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Google or ChatGPT: Who is the better helper for university students;Education and Information Technologies;2024-09-10

2. Large language models: Expectations for semantics-driven systems engineering;Data & Knowledge Engineering;2024-07

3. They Prefer Humans! Experimental Measurement of Student Trust in ChatGPT;Extended Abstracts of the CHI Conference on Human Factors in Computing Systems;2024-05-02

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3