Natural Language Processing Techniques for Text Classification of Biomedical Documents: A Systematic Review

Author:

Kesiku Cyrille YetuYetuORCID,Chaves-Villota AndreaORCID,Garcia-Zapirain BegonyaORCID

Abstract

The classification of biomedical literature is engaged in a number of critical issues that physicians are expected to answer. In many cases, these issues are extremely difficult. This can be conducted for jobs such as diagnosis and treatment, as well as efficient representations of ideas such as medications, procedure codes, and patient visits, as well as in the quick search of a document or disease classification. Pathologies are being sought from clinical notes, among other sources. The goal of this systematic review is to analyze the literature on various problems of classification of medical texts of patients based on criteria such as: the quality of the evaluation metrics used, the different methods of machine learning applied, the different data sets, to highlight the best methods in this type of problem, and to identify the different challenges associated. The study covers the period from 1 January 2016 to 10 July 2022. We used multiple databases and archives of research articles, including Web Of Science, Scopus, MDPI, arXiv, IEEE, and ACM, to find 894 articles dealing with the subject of text classification, which we were able to filter using inclusion and exclusion criteria. Following a thorough review, we selected 33 articles dealing with biological text categorization issues. Following our investigation, we discovered two major issues linked to the methodology and data used for biomedical text classification. First, there is the data-centric challenge, followed by the data quality challenge.

Publisher

MDPI AG

Subject

Information Systems

Reference59 articles.

1. Efficient estimation of word representations in vector space;Mikolov;arXiv,2013

2. Attention is all you need;Vaswani;Proceedings of the 31st Conference on Neural Information Processing Systems (NIPS 2017),2017

3. The International Classification of Diseases, 10th Revision

4. Automatic ICD-10 Coding and Training System: Deep Neural Network Based on Supervised Learning

5. Pressure injury image analysis with machine learning techniques: A systematic review on previous and possible future methods

Cited by 3 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Advancing Preauthorization Task in Healthcare: An Application of Deep Active Incremental Learning for Medical Text Classification;Engineering, Technology & Applied Science Research;2023-12-05

2. BIOMEDICAL TEXT DOCUMENT CLASSIFICATION;international journal of engineering technology and management sciences;2023

3. Systematic review of natural language processing for recurrent cancer detection from electronic medical records;Informatics in Medicine Unlocked;2023

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3