A Natural Language Processing–Assisted Extraction System for Gleason Scores: Development and Usability Study-Reference-Cited by-同舟云学术

A Natural Language Processing–Assisted Extraction System for Gleason Scores: Development and Usability Study

Published:2021-07-02 Issue:3 Volume:7 Page:e27970
ISSN:2369-1999
Container-title:JMIR Cancer
language:en
Short-container-title:JMIR Cancer

Author:

Yu Shun^ORCID,Le Anh^ORCID,Feld Emily^ORCID,Schriver Emily^ORCID,Gabriel Peter^ORCID,Doucette Abigail^ORCID,Narayan Vivek^ORCID,Feldman Michael^ORCID,Schwartz Lauren^ORCID,Maxwell Kara^ORCID,Mowery Danielle^ORCID

Abstract

Background Natural language processing (NLP) offers significantly faster variable extraction compared to traditional human extraction but cannot interpret complicated notes as well as humans can. Thus, we hypothesized that an “NLP-assisted” extraction system, which uses humans for complicated notes and NLP for uncomplicated notes, could produce faster extraction without compromising accuracy. Objective The aim of this study was to develop and pilot an NLP-assisted extraction system to leverage the strengths of both human and NLP extraction of prostate cancer Gleason scores. Methods We collected all available clinical and pathology notes for prostate cancer patients in an unselected academic biobank cohort. We developed an NLP system to extract prostate cancer Gleason scores from both clinical and pathology notes. Next, we designed and implemented the NLP-assisted extraction system algorithm to categorize notes into “uncomplicated” and “complicated” notes. Uncomplicated notes were assigned to NLP extraction and complicated notes were assigned to human extraction. We randomly reviewed 200 patients to assess the accuracy and speed of our NLP-assisted extraction system and compared it to NLP extraction alone and human extraction alone. Results Of the 2051 patients in our cohort, the NLP system extracted a prostate surgery Gleason score from 1147 (55.92%) patients and a prostate biopsy Gleason score from 1624 (79.18%) patients. Our NLP-assisted extraction system had an overall accuracy rate of 98.7%, which was similar to the accuracy of human extraction alone (97.5%; P=.17) and significantly higher than the accuracy of NLP extraction alone (95.3%; P<.001). Moreover, our NLP-assisted extraction system reduced the workload of human extractors by approximately 95%, resulting in an average extraction time of 12.7 seconds per patient (vs 256.1 seconds per patient for human extraction alone). Conclusions We demonstrated that an NLP-assisted extraction system was able to achieve much faster Gleason score extraction compared to traditional human extraction without sacrificing accuracy.

Publisher

JMIR Publications Inc.

Subject

Cancer Research,Oncology

Reference10 articles.

1. Using Natural Language Processing to Improve Efficiency of Manual Chart Abstraction in Research: The Case of Breast Cancer Recurrence

2. Validation of a contemporary prostate cancer grading system using prostate cancer death as outcome

3. New Prostate Cancer Grading System Predicts Long-term Survival Following Surgery for Gleason Score 8–10 Prostate Cancer

4. The Eighth Edition AJCC Cancer Staging Manual: Continuing to build a bridge from a population-based to a more “personalized” approach to cancer staging

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Applications of Natural Language Processing and Large Language Models for Social Determinants of Health: Protocol for a Systematic Review (Preprint);2024-09-03

2. Natural language processing pipeline to extract prostate cancer-related information from clinical notes;European Radiology;2024-06-06

3. Digital Health Applications in Oncology: An Opportunity to Seize;JNCI: Journal of the National Cancer Institute;2022-05-31