Hierarchical attention networks for information extraction from cancer pathology reports-Reference-Cited by-同舟云学术

Hierarchical attention networks for information extraction from cancer pathology reports

Published:2017-11-16 Issue:3 Volume:25 Page:321-330
ISSN:1067-5027
Container-title:Journal of the American Medical Informatics Association
language:en
Short-container-title:

Author:

Gao Shang¹,Young Michael T¹,Qiu John X¹,Yoon Hong-Jun¹,Christian James B¹,Fearn Paul A²,Tourassi Georgia D¹,Ramanthan Arvind¹

Affiliation:

1. Computational Science and Engineering Division, Oak Ridge National Laboratory, Oak Ridge, TN, USA

2. Surveillance Informatics Branch, Division of Cancer Control and Population Sciences, National Cancer Institute, Bethesda, MD, USA

Abstract

Abstract Objective We explored how a deep learning (DL) approach based on hierarchical attention networks (HANs) can improve model performance for multiple information extraction tasks from unstructured cancer pathology reports compared to conventional methods that do not sufﬁciently capture syntactic and semantic contexts from free-text documents. Materials and Methods Data for our analyses were obtained from 942 deidentiﬁed pathology reports collected by the National Cancer Institute Surveillance, Epidemiology, and End Results program. The HAN was implemented for 2 information extraction tasks: (1) primary site, matched to 12 International Classification of Diseases for Oncology topography codes (7 breast, 5 lung primary sites), and (2) histological grade classiﬁcation, matched to G1–G4. Model performance metrics were compared to conventional machine learning (ML) approaches including naive Bayes, logistic regression, support vector machine, random forest, and extreme gradient boosting, and other DL models, including a recurrent neural network (RNN), a recurrent neural network with attention (RNN w/A), and a convolutional neural network. Results Our results demonstrate that for both information tasks, HAN performed signiﬁcantly better compared to the conventional ML and DL techniques. In particular, across the 2 tasks, the mean micro and macroF-scores for the HAN with pretraining were (0.852,0.708), compared to naive Bayes (0.518, 0.213), logistic regression (0.682, 0.453), support vector machine (0.634, 0.434), random forest (0.698, 0.508), extreme gradient boosting (0.696, 0.522), RNN (0.505, 0.301), RNN w/A (0.637, 0.471), and convolutional neural network (0.714, 0.460). Conclusions HAN-based DL models show promise in information abstraction tasks within unstructured clinical pathology reports.

Funder

NIH

Lawrence Livermore National Laboratory

Los Alamos National Laboratory

Oak Ridge National Laboratory

Publisher

Oxford University Press (OUP)

Subject

Health Informatics

Link

http://academic.oup.com/jamia/article-pdf/25/3/321/34150614/ocx131.pdf

Reference29 articles.

1. Aiming high—changing the trajectory for cancer;Lowy;New Engl J Med.,2016

2. Ask me anything: dynamic memory networks for natural language processing;Kumar;Proc Int Conf Mach Learn.,2016

3. Convolutional neural networks for sentence classiﬁcation;Kim;arXiv preprint arXiv:14085882.,2014

4. A critical review of recurrent neural networks for sequence learning;Lipton;arXiv preprint arXiv:150600019.,2015

Cited by 85 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Integrating predictive coding and a user-centric interface for enhanced auditing and quality in cancer registry data;Computational and Structural Biotechnology Journal;2024-12

2. Investigating quantitative histological characteristics in renal pathology using HistoLens;Scientific Reports;2024-07-30

3. Classifying Cancer Stage with Open-Source Clinical Large Language Models;2024 IEEE 12th International Conference on Healthcare Informatics (ICHI);2024-06-03

4. Replicating Current Procedural Terminology code assignment of rhinology operative notes using machine learning;World Journal of Otorhinolaryngology - Head and Neck Surgery;2024-05-28

5. Aspect based hotel recommendation system using dilated multichannel CNN and BiGRU with hyperbolic linear unit;International Journal of Machine Learning and Cybernetics;2024-05-05