Analysis of ‘One in a Million’ primary care consultation conversations using natural language processing-Reference-Cited by-同舟云学术

Analysis of ‘One in a Million’ primary care consultation conversations using natural language processing

Published:2023-04 Issue:1 Volume:30 Page:e100659
ISSN:2632-1009
Container-title:BMJ Health & Care Informatics
language:en
Short-container-title:BMJ Health Care Inform

Author:

Pyne Yvette^ORCID,Wong Yik Ming,Fang Haishuo,Simpson Edwin

Abstract

BackgroundModern patient electronic health records form a core part of primary care; they contain both clinical codes and free text entered by the clinician. Natural language processing (NLP) could be employed to generate these records through ‘listening’ to a consultation conversation.ObjectivesThis study develops and assesses several text classifiers for identifying clinical codes for primary care consultations based on the doctor–patient conversation. We evaluate the possibility of training classifiers using medical code descriptions, and the benefits of processing transcribed speech from patients as well as doctors. The study also highlights steps for improving future classifiers.MethodsUsing verbatim transcripts of 239 primary care consultation conversations (the ‘One in a Million’ dataset) and novel additional datasets for distant supervision, we trained NLP classifiers (naïve Bayes, support vector machine, nearest centroid, a conventional BERT classifier and few-shot BERT approaches) to identify the International Classification of Primary Care-2 clinical codes associated with each consultation.ResultsOf all models tested, a fine-tuned BERT classifier was the best performer. Distant supervision improved the model’s performance (F1 score over 16 classes) from 0.45 with conventional supervision with 191 labelled transcripts to 0.51. Incorporating patients’ speech in addition to clinician’s speech increased the BERT classifier’s performance from 0.45 to 0.55 F1 (p=0.01, paired bootstrap test).ConclusionsOur findings demonstrate that NLP classifiers can be trained to identify clinical area(s) being discussed in a primary care consultation from audio transcriptions; this could represent an important step towards a smart digital assistant in the consultation room.

Funder

National Institute for Health Research

Wellcome Trust

Publisher

BMJ

Reference32 articles.

1. Topol EJ . The topol review: preparing the healthcare workforce to deliver the digital future. 2019.

2. A Time-Motion Study of Primary Care Physicians’ Work in the Electronic Health Record Era

3. Allocation of Physician Time in Ambulatory Practice: A Time and Motion Study in 4 Specialties

4. Natural language processing in oncology: A review;Yim;JAMA Oncol,2016

5. Applying natural language processing and machine learning techniques to patient experience feedback: a systematic review;Khanbhai;BMJ Health Care Inform,2021

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Integrating deep learning and multi-attention for joint extraction of entities and relationships in engineering consulting texts;Automation in Construction;2024-12

2. Artificial Intelligence and Primary Care: A Scoping Review (Preprint);2024-08-30

3. Deep Learning and Vision Transformer for Medical Image Analysis;Journal of Imaging;2023-07-21