Integrating PubMed Label Hierarchy Knowledge into a Complex Hierarchical Deep Neural Network-Reference-Cited by-同舟云学术

Integrating PubMed Label Hierarchy Knowledge into a Complex Hierarchical Deep Neural Network

Published:2023-12-09 Issue:24 Volume:13 Page:13117
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Silvestri Stefano¹^ORCID,Gargiulo Francesco¹^ORCID,Ciampi Mario¹^ORCID

Affiliation:

1. Institute for High Performance Computing and Networking, National Research Council of Italy (ICAR-CNR), Via Pietro Castellino 111, 80131 Naples, Italy

Abstract

This paper proposes an innovative method that exploits a complex deep learning network architecture, called Hierarchical Deep Neural Network (HDNN), specifically developed for the eXtreme Multilabel Text Classification (XMTC) task, when the label set is hierarchically organized, such as the case of the PubMed article labeling task. In detail, the topology of the proposed HDNN architecture follows the exact hierarchical structure of the label set to integrate this knowledge directly into the DNN. We assumed that if a label set hierarchy is available, as in the case of the PubMed Dataset, forcing this information into the network topology could enhance the classification performances and the interpretability of the results, especially related to the hierarchy. We performed an experimental assessment of the PubMed article classification task, demonstrating that the proposed HDNN provides performance improvement for a baseline based on a classic flat Convolution Neural Network (CNN) deep learning architecture, in particular in terms of hierarchical measures. These results provide useful hints for integrating previous and innate knowledge in a deep neural network. The drawback of the HDNN is the high computational time required to train the neural network, which can be addressed with a parallel implementation planned as a future work.

Funder

European Union

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/13/24/13117/pdf

Reference44 articles.

1. Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L., and Polosukhin, I. (2017, January 4–9). Attention is All you Need. Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, Long Beach, CA, USA.

2. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding;Devlin;Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies,2019

3. Transformers in medical imaging: A survey;Shamshad;Med. Image Anal.,2023

4. Clinical concept extraction using transformers;Yang;J. Am. Med. Inform. Assoc.,2020

5. Xiao, H., Li, L., Liu, Q., Zhu, X., and Zhang, Q. (2023). Transformers in medical image segmentation: A review. Biomed. Signal Process. Control, 84.