Abstract
ABSTRACTBackgroundHypertension is a global health concern with a vast body of unstructured data, such as clinical notes, diagnosis reports, and discharge summaries, that can provide valuable insights. Natural Language Processing (NLP) has emerged as a powerful tool for extracting knowledge from unstructured data. This scoping review aims to explore the development and application of NLP on unstructured clinical data in hypertension, synthesizing existing research to identify trends, gaps, and underexplored areas for future investigation.MethodsWe conducted a systematic search of electronic databases, including PubMed/MEDLINE, Embase, Cochrane Library, Scopus, Web of Science, ACM Digital Library, and IEEE Xplore Digital Library, to identify relevant studies published until the end of 2022. The search strategy included keywords related to hypertension, NLP, and unstructured data. Data extraction included study characteristics, NLP methods, types of unstructured data sources, and key findings and limitations.ResultsThe initial search yielded 951 articles, of which 45 met the inclusion criteria. The selected studies spanned various aspects of hypertension, including diagnosis, treatment, epidemiology, and clinical decision support. NLP was primarily used for extracting clinical information from unstructured electronic health records (EHRs) documents and text classification. Clinical notes were the most common sources of unstructured data. Key findings included improved diagnostic accuracy and the ability to comprehensively identify hypertensive patients with a combination of structured and unstructured data. However, the review revealed a lack of more advanced NLP techniques used in hypertension, generalization of NLP outside of benchmark datasets, and a limited focus on the integration of NLP tools into clinical practice.DiscussionThis scoping review highlights the diverse applications of NLP in hypertension research, emphasizing its potential to transform the field by harnessing valuable insights from unstructured data sources. There is a need to adopt and customize more advanced NLP for hypertension research. Future research should prioritize the development of NLP tools that can be seamlessly integrated into clinical settings to enhance hypertension management.ConclusionNLP demonstrates considerable promise in gleaning meaningful insights from the vast expanse of unstructured data within the field of hypertension, shedding light on diagnosis, treatment, and the identification of patient cohorts. As the field advances, there is a critical need to promote the use and development of advanced NLP methodologies that are tailored to hypertension and validated on real-world unstructured data.
Publisher
Cold Spring Harbor Laboratory
Reference70 articles.
1. World Health Organization , First WHO report details devastating impact of hypertension and ways to stop it. World Health Organization, 2023.
2. Consumer Devices for Patient-Generated Health Data Using Blood Pressure Monitors for Managing Hypertension: Systematic Review;JMIR mHealth and uHealth,2022
3. Social Networking Service, Patient-Generated Health Data, and Population Health Informatics: National Cross-sectional Study of Patterns and Implications of Leveraging Digital Technologies to Support Mental Health and Well-being;Journal of medical Internet research,2022
4. Natural language processing of clinical notes on chronic diseases: systematic review;JMIR medical informatics,2019
5. Ye, J. and Z. Ren , Examining the impact of sex differences and the COVID-19 pandemic on health and health care: findings from a national cross-sectional study. JAMIA Open, 2022.