GeneTuring tests GPT models in genomics-Reference-Cited by-同舟云学术

GeneTuring tests GPT models in genomics

Published:2023-03-13 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Hou Wenpin,Ji Zhicheng

Abstract

ABSTRACTGenerative Pre-trained Transformers (GPT) are powerful language models that have great potential to transform biomedical research. However, they are known to suffer from artificial hallucinations and provide false answers that are seemingly correct in some situations. We developed GeneTuring, a comprehensive QA database with 600 questions in genomics, and manually scored 10,800 answers returned by six GPT models, including GPT-3, ChatGPT, and New Bing. New Bing has the best overall performance and significantly reduces the level of AI hallucination compared to other models, thanks to its ability to recognize its incapacity in answering questions. We argue that improving incapacity awareness is equally important as improving model accuracy to address AI hallucination.

Publisher

Cold Spring Harbor Laboratory

Reference19 articles.

1. Language models are unsupervised multitask learners;OpenAI blog,2019

2. Luo, R. et al. Biogpt: generative pre-trained transformer for biomedical text generation and mining. Briefings Bioinforma. 23 (2022).

3. Venigalla, A. , Frankle, J. & Carbin, M. Biomedlm: a domain-specific large language model for biomedical text. https://www.mosaicml.com/blog/introducing-pubmed-gpt.

4. Language models are few-shot learners;Adv. neural information processing systems,2020

5. ChatGPT: five priorities for research

Cited by 22 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Large language model to multimodal large language model: A journey to shape the biological macromolecules to biological sciences and medicine;Molecular Therapy - Nucleic Acids;2024-09

2. Foundation models for bioinformatics;Quantitative Biology;2024-07-24

3. Bioinformatics and biomedical informatics with ChatGPT: Year one review;Quantitative Biology;2024-06-27

4. Artificial Intelligence in Newborn Medicine;Newborn;2024-06-21

5. Scientific figures interpreted by ChatGPT: strengths in plot recognition and limits in color perception;npj Precision Oncology;2024-04-05