How Can the Clinical Aptitude of AI Assistants Be Assayed?-Reference-Cited by-同舟云学术

How Can the Clinical Aptitude of AI Assistants Be Assayed?

Published:2023-12-05 Issue: Volume:25 Page:e51603
ISSN:1438-8871
Container-title:Journal of Medical Internet Research
language:en
Short-container-title:J Med Internet Res

Author:

Thirunavukarasu Arun James^ORCID

Abstract

Large language models (LLMs) are exhibiting remarkable performance in clinical contexts, with exemplar results ranging from expert-level attainment in medical examination questions to superior accuracy and relevance when responding to patient queries compared to real doctors replying to queries on social media. The deployment of LLMs in conventional health care settings is yet to be reported, and there remains an open question as to what evidence should be required before such deployment is warranted. Early validation studies use unvalidated surrogate variables to represent clinical aptitude, and it may be necessary to conduct prospective randomized controlled trials to justify the use of an LLM for clinical advice or assistance, as potential pitfalls and pain points cannot be exhaustively predicted. This viewpoint states that as LLMs continue to revolutionize the field, there is an opportunity to improve the rigor of artificial intelligence (AI) research to reward innovation, conferring real benefits to real patients.

Publisher

JMIR Publications Inc.

Subject

Health Informatics

Reference16 articles.

1. Large language models will not replace healthcare professionals: curbing popular fears and hype

2. Foundation models for generalist medical artificial intelligence

3. Large language models in medicine

4. Trialling a Large Language Model (ChatGPT) in General Practice With the Applied Knowledge Test: Observational Study Demonstrating Opportunities and Limitations in Primary Care

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Large language models approach expert-level clinical knowledge and reasoning in ophthalmology: A head-to-head cross-sectional study;PLOS Digital Health;2024-04-17

2. Clinical performance of automated machine learning: A systematic review;Annals of the Academy of Medicine, Singapore;2024-03-27

3. Clinical performance of automated machine learning: A systematic review;Annals of the Academy of Medicine, Singapore;2024-03-27