Data Processing and Text Mining Technologies on Electronic Medical Records: A Review-Reference-Cited by-同舟云学术

Data Processing and Text Mining Technologies on Electronic Medical Records: A Review

Published:2018 Issue: Volume:2018 Page:1-9
ISSN:2040-2295
Container-title:Journal of Healthcare Engineering
language:en
Short-container-title:Journal of Healthcare Engineering

Author:

Sun Wencheng¹,Cai Zhiping¹^ORCID,Li Yangyang²,Liu Fang³,Fang Shengqun¹,Wang Guoyan⁴

Affiliation:

1. College of Computer, National University of Defense Technology, Changsha 410073, China

2. Innovation Center, China Academy of Electronics and Information Technology, Beijing 100041, China

3. School of Data and Computer Science, Sun Yat-sen University, Guangzhou 510006, China

4. Xuzhou University of Technology, Xuzhou 221002, China

Abstract

Currently, medical institutes generally use EMR to record patient’s condition, including diagnostic information, procedures performed, and treatment results. EMR has been recognized as a valuable resource for large-scale analysis. However, EMR has the characteristics of diversity, incompleteness, redundancy, and privacy, which make it difficult to carry out data mining and analysis directly. Therefore, it is necessary to preprocess the source data in order to improve data quality and improve the data mining results. Different types of data require different processing technologies. Most structured data commonly needs classic preprocessing technologies, including data cleansing, data integration, data transformation, and data reduction. For semistructured or unstructured data, such as medical text, containing more health information, it requires more complex and challenging processing methods. The task of information extraction for medical texts mainly includes NER (named-entity recognition) and RE (relation extraction). This paper focuses on the process of EMR processing and emphatically analyzes the key techniques. In addition, we make an in-depth study on the applications developed based on text mining together with the open challenges and research issues for future work.

Funder

National Natural Science Foundation of China

Publisher

Hindawi Limited

Subject

Health Informatics,Biomedical Engineering,Surgery,Biotechnology

Link

http://downloads.hindawi.com/journals/jhe/2018/4302425.pdf

Reference29 articles.

1. A Time and Location Correlation Incentive Scheme for Deep Data Gathering in Crowdsourcing Networks

2. An Aggregate Signature Based Trust Routing for Data Gathering in Sensor Networks

3. Green Data Gathering under Delay Differentiated Services Constraint for Internet of Things