Hallucination Detection: Robustly Discerning Reliable Answers in Large Language Models-Reference-Cited by-同舟云学术

Hallucination Detection: Robustly Discerning Reliable Answers in Large Language Models

Published:2023-10-21 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the 32nd ACM International Conference on Information and Knowledge Management
language:
Short-container-title:

Author:

Chen Yuyan¹^ORCID,Fu Qiang²^ORCID,Yuan Yichen³^ORCID,Wen Zhihao⁴^ORCID,Fan Ge⁵^ORCID,Liu Dayiheng⁶^ORCID,Zhang Dongmei⁷^ORCID,Li Zhixu⁸^ORCID,Xiao Yanghua⁹^ORCID

Affiliation:

1. Shanghai Key Laboratory of Data Science & School of Computer Science, Fudan University, Shanghai, China

2. Microsoft, Beijing, China

3. Shanghai Key Laboratory of Data Science, Beijing, China

4. Singapore Management University, Singapore, Singapore

5. Tencent, Shenzhen, China

6. DAMO Academy, Hangzhou, China

7. Microsoft, Shanghai, China

8. Shanghai Key Laboratory of Data Science & School of Computer Science, Fudan University & Fudan-Aishu Cognitive Intelligence Joint Research Center, Shanghai, China

9. Shanghai Key Laboratory of Data Science & School of Computer Science, Fudan University & Fudan-Aishu Cognitive Intelligence Joint Research Center, Beijing, China

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3583780.3614905

Reference83 articles.

1. Rachith Aiyappa , Jisun An , Haewoon Kwak , and Yong-Yeol Ahn . 2023. Can we trust the evaluation on ChatGPT? arXiv preprint arXiv:2303.12767 ( 2023 ). Rachith Aiyappa, Jisun An, Haewoon Kwak, and Yong-Yeol Ahn. 2023. Can we trust the evaluation on ChatGPT? arXiv preprint arXiv:2303.12767 (2023).

2. Amos Azaria and Tom Mitchell . 2023. The Internal State of an LLM Knows When its Lying. arXiv preprint arXiv:2304.13734 ( 2023 ). Amos Azaria and Tom Mitchell. 2023. The Internal State of an LLM Knows When its Lying. arXiv preprint arXiv:2304.13734 (2023).

3. Yejin Bang Samuel Cahyawijaya Nayeon Lee Wenliang Dai Dan Su Bryan Wilie Holy Lovenia Ziwei Ji Tiezheng Yu Willy Chung etal 2023. A multitask multilingual multimodal evaluation of chatgpt on reasoning hallucination and interactivity. arXiv preprint arXiv:2302.04023 (2023). Yejin Bang Samuel Cahyawijaya Nayeon Lee Wenliang Dai Dan Su Bryan Wilie Holy Lovenia Ziwei Ji Tiezheng Yu Willy Chung et al. 2023. A multitask multilingual multimodal evaluation of chatgpt on reasoning hallucination and interactivity. arXiv preprint arXiv:2302.04023 (2023).

4. Samy Bengio , Oriol Vinyals , Navdeep Jaitly , and Noam Shazeer . 2015. Scheduled sampling for sequence prediction with recurrent neural networks. Advances in neural information processing systems , Vol. 28 ( 2015 ). Samy Bengio, Oriol Vinyals, Navdeep Jaitly, and Noam Shazeer. 2015. Scheduled sampling for sequence prediction with recurrent neural networks. Advances in neural information processing systems, Vol. 28 (2015).

5. Ali Borji . 2023. A categorical archive of ChatGPT failures. arXiv preprint arXiv:2302.03494 ( 2023 ). Ali Borji. 2023. A categorical archive of ChatGPT failures. arXiv preprint arXiv:2302.03494 (2023).

Cited by 15 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Design and Implementation of an Interactive Question-Answering System with Retrieval-Augmented Generation for Personalized Databases;Applied Sciences;2024-09-06

2. LLaVA-Docent: Instruction Tuning with Multimodal Large Language Model to Support Art Appreciation Education;Computers and Education: Artificial Intelligence;2024-09

3. Proposal of User Interface Based on Heavy User Usage Analysis in LLM Service;Archives of Design Research;2024-08-31

4. The Combined Use of GIS and Generative Artificial Intelligence in Detecting Potential Geodiversity Sites and Promoting Geoheritage;Resources;2024-08-27

5. Cost-efficient prompt engineering for unsupervised entity resolution in the product matching domain;Discover Artificial Intelligence;2024-08-16