ML-Net: multi-label classification of biomedical texts with deep neural networks-Reference-Cited by-同舟云学术

ML-Net: multi-label classification of biomedical texts with deep neural networks

Published:2019-06-24 Issue:11 Volume:26 Page:1279-1285
ISSN:1067-5027
Container-title:Journal of the American Medical Informatics Association
language:en
Short-container-title:

Author:

Du Jingcheng¹²,Chen Qingyu¹,Peng Yifan¹^ORCID,Xiang Yang²,Tao Cui²,Lu Zhiyong¹

Affiliation:

1. National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), Bethesda, Maryland, USA

2. The University of Texas School of Biomedical Informatics, Houston, Texas, USA

Abstract

Abstract Objective In multi-label text classification, each textual document is assigned 1 or more labels. As an important task that has broad applications in biomedicine, a number of different computational methods have been proposed. Many of these methods, however, have only modest accuracy or efficiency and limited success in practical use. We propose ML-Net, a novel end-to-end deep learning framework, for multi-label classification of biomedical texts. Materials and Methods ML-Net combines a label prediction network with an automated label count prediction mechanism to provide an optimal set of labels. This is accomplished by leveraging both the predicted confidence score of each label and the deep contextual information (modeled by ELMo) in the target document. We evaluate ML-Net on 3 independent corpora in 2 text genres: biomedical literature and clinical notes. For evaluation, we use example-based measures, such as precision, recall, and the F measure. We also compare ML-Net with several competitive machine learning and deep learning baseline models. Results Our benchmarking results show that ML-Net compares favorably to state-of-the-art methods in multi-label classification of biomedical text. ML-Net is also shown to be robust when evaluated on different text genres in biomedicine. Conclusion ML-Net is able to accuractely represent biomedical document context and dynamically estimate the label count in a more systematic and accurate manner. Unlike traditional machine learning methods, ML-Net does not require human effort for feature engineering and is a highly efficient and scalable approach to tasks with a large set of labels, so there is no need to build individual classifiers for each separate label.

Funder

NIH

Publisher

Oxford University Press (OUP)

Subject

Health Informatics

Link

http://academic.oup.com/jamia/article-pdf/26/11/1279/36089060/ocz085.pdf

Reference38 articles.

1. Recommending MeSH terms for annotating biomedical articles;Huang;J Am Med Inform Assoc,2011

2. DeepMeSH: Deep semantic representation for improving large-scale MeSH indexing;Peng;Bioinformatics,2016

3. Diagnosis code assignment: models and evaluation metrics;Perotte;J Am Med Inform Assoc,2014

Cited by 81 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Updating Correlation-Enhanced Feature Learning for Multi-Label Classification;Mathematics;2024-07-07

2. CoocNet: a novel approach to multi-label text classification with improved label co-occurrence modeling;Applied Intelligence;2024-07-02

3. MCICT: Graph convolutional network-based end-to-end model for multi-label classification of imbalanced clinical text;Biomedical Signal Processing and Control;2024-05

4. Semantic features analysis for biomedical lexical answer type prediction using ensemble learning approach;Knowledge and Information Systems;2024-04-25

5. A Hybrid Principal Label Space Transformation-Based Binary Relevance Support Vector Machine and Q-Learning Algorithm for Multi-label Classification;Arabian Journal for Science and Engineering;2024-04-20