Cohort selection for clinical trials using hierarchical neural network-Reference-Cited by-同舟云学术

Cohort selection for clinical trials using hierarchical neural network

Published:2019-07-15 Issue:11 Volume:26 Page:1203-1208
ISSN:1067-5027
Container-title:Journal of the American Medical Informatics Association
language:en
Short-container-title:

Author:

Xiong Ying¹,Shi Xue¹,Chen Shuai¹,Jiang Dehuan¹,Tang Buzhou¹,Wang Xiaolong¹,Chen Qingcai¹,Yan Jun²

Affiliation:

1. Department of Computer Science, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, China

2. Yidu Cloud (Beijing) Technology Co., Ltd, Beijing, China

Abstract

Abstract Objective Cohort selection for clinical trials is a key step for clinical research. We proposed a hierarchical neural network to determine whether a patient satisfied selection criteria or not. Materials and Methods We designed a hierarchical neural network (denoted as CNN-Highway-LSTM or LSTM-Highway-LSTM) for the track 1 of the national natural language processing (NLP) clinical challenge (n2c2) on cohort selection for clinical trials in 2018. The neural network is composed of 5 components: (1) sentence representation using convolutional neural network (CNN) or long short-term memory (LSTM) network; (2) a highway network to adjust information flow; (3) a self-attention neural network to reweight sentences; (4) document representation using LSTM, which takes sentence representations in chronological order as input; (5) a fully connected neural network to determine whether each criterion is met or not. We compared the proposed method with its variants, including the methods only using the first component to represent documents directly and the fully connected neural network for classification (denoted as CNN-only or LSTM-only) and the methods without using the highway network (denoted as CNN-LSTM or LSTM-LSTM). The performance of all methods was measured by micro-averaged precision, recall, and F1 score. Results The micro-averaged F1 scores of CNN-only, LSTM-only, CNN-LSTM, LSTM-LSTM, CNN-Highway-LSTM, and LSTM-Highway-LSTM were 85.24%, 84.25%, 87.27%, 88.68%, 88.48%, and 90.21%, respectively. The highest micro-averaged F1 score is higher than our submitted 1 of 88.55%, which is 1 of the top-ranked results in the challenge. The results indicate that the proposed method is effective for cohort selection for clinical trials. Discussion Although the proposed method achieved promising results, some mistakes were caused by word ambiguity, negation, number analysis and incomplete dictionary. Moreover, imbalanced data was another challenge that needs to be tackled in the future. Conclusion In this article, we proposed a hierarchical neural network for cohort selection. Experimental results show that this method is good at selecting cohort.

Funder

NIH

Publisher

Oxford University Press (OUP)

Subject

Health Informatics

Link

http://academic.oup.com/jamia/article-pdf/26/11/1203/36089025/ocz099.pdf

Reference31 articles.

1. Symbolic rule-based classification of lung cancer stages from free-text pathology reports;Nguyen;J Am Med Inform Assoc,2010

2. A machine learning-based framework to identify type 2 diabetes through electronic health records;Zheng;Int J Med Inform,2017

3. Automated extraction of ejection fraction for quality measurement using regular expressions in Unstructured Information Management Architecture (UIMA) for heart failure;Garvin;J Am Med Inform Assoc,2012

4. Extracting principal diagnosis, co-morbidity and smoking status for asthma research: evaluation of a natural language processing system;Zeng;BMC Med Inform Dec Making,2006

5. Recognizing obesity and comorbidities in sparse data;Uzuner;J Am Med Inform Assoc,2009

Cited by 24 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Artificial intelligence for optimizing recruitment and retention in clinical trials: a scoping review;Journal of the American Medical Informatics Association;2024-09-11

2. Advancing Chinese biomedical text mining with community challenges;Journal of Biomedical Informatics;2024-09

3. Towards Efficient Patient Recruitment for Clinical Trials: Application of a Prompt-Based Learning Model;2024

4. LeafAI: query generator for clinical cohort discovery rivaling a human programmer;Journal of the American Medical Informatics Association;2023-08-07

5. Tracking persistent postoperative opioid use: a proof-of-concept study demonstrating a use case for natural language processing;Regional Anesthesia & Pain Medicine;2023-07-06