Artificial Intelligence in Postoperative Care: Assessing Large Language Models for Patient Recommendations in Plastic Surgery
-
Published:2024-05-24
Issue:11
Volume:12
Page:1083
-
ISSN:2227-9032
-
Container-title:Healthcare
-
language:en
-
Short-container-title:Healthcare
Author:
Gomez-Cabello Cesar A.1ORCID, Borna Sahar1ORCID, Pressman Sophia M.1ORCID, Haider Syed Ali1ORCID, Sehgal Ajai2ORCID, Leibovich Bradley C.23, Forte Antonio J.12ORCID
Affiliation:
1. Division of Plastic Surgery, Mayo Clinic, Jacksonville, FL 32224, USA 2. Center for Digital Health, Mayo Clinic, Rochester, MN 55905, USA 3. Department of Urology, Mayo Clinic, Rochester, MN 55905, USA
Abstract
Since their release, the medical community has been actively exploring large language models’ (LLMs) capabilities, which show promise in providing accurate medical knowledge. One potential application is as a patient resource. This study analyzes and compares the ability of the currently available LLMs, ChatGPT-3.5, GPT-4, and Gemini, to provide postoperative care recommendations to plastic surgery patients. We presented each model with 32 questions addressing common patient concerns after surgical cosmetic procedures and evaluated the medical accuracy, readability, understandability, and actionability of the models’ responses. The three LLMs provided equally accurate information, with GPT-3.5 averaging the highest on the Likert scale (LS) (4.18 ± 0.93) (p = 0.849), while Gemini provided significantly more readable (p = 0.001) and understandable responses (p = 0.014; p = 0.001). There was no difference in the actionability of the models’ responses (p = 0.830). Although LLMs have shown their potential as adjunctive tools in postoperative patient care, further refinement and research are imperative to enable their evolution into comprehensive standalone resources.
Reference38 articles.
1. Hadi, M.U., Al-Tashi, Q., Qureshi, R., Shah, A., Muneer, A., Irfan, M., Zafar, A., Shaikh, M.B., Akhtar, N., and Al-Garadi, M.A. (2023). Large Language Models: A Comprehensive Survey of Applications, Challenges, Limitations, and Future Prospects. TechRxiv. 2. Abi-Rafeh, J., Henry, N., Xu, H.H., Bassiri-Tehrani, B., Arezki, A., Kazan, R., Gilardino, M.S., and Nahai, F. (2024). Utility and Comparative Performance of Current Artificial Intelligence Large Language Models as Postoperative Medical Support Chatbots in Aesthetic Surgery. Aesthet. Surg. J., sjae025. 3. Evaluating Chatbot Efficacy for Answering Frequently Asked Questions in Plastic Surgery: A ChatGPT Case Study Focused on Breast Augmentation;Seth;Aesthet. Surg. J.,2023 4. Application of ChatGPT in Cosmetic Plastic Surgery: Ally or Antagonist?;Gupta;Aesthet. Surg. J.,2023 5. American Society of Plastic Surgeons (2024, February 12). American Society of Plastic Surgeons Reveals 2022’s Most Sought-After Procedures. Available online: https://www.plasticsurgery.org/news/press-releases/american-society-of-plastic-surgeons-reveals-2022s-most-sought-after-procedures.
|
|