TCM Constitution Analysis Method Based on Parallel FP-Growth Algorithm in Hadoop Framework

Author:

Li Mingzheng12ORCID,Lv Xiaojuan3ORCID,Liu Ye1,Wang Lin14ORCID,Song Jianqiang1ORCID

Affiliation:

1. School of Information Engineering, Henan University of Science and Technology, Luoyang 471003, China

2. Lushi Chinese Medicine Hospital, Lushi County, Sanmenxia 472100, China

3. Information Department of PLA Rocket Force Characteristic Medical Center, Beijing 100120, China

4. Henan Qunzhi Information Technology Co. Ltd., Luoyang 471003, China

Abstract

This work is devoted to establishing a comparatively accurate classification model between symptoms, constitutions, and regimens for traditional Chinese medicine (TCM) constitution analysis to provide preliminary screening and decision support for clinical diagnosis. However, for the analysis of massive distributed medical data in a cloud platform, the traditional data mining methods have the problems of low mining efficiency and large memory consumption, and long tuning time, an association rules method for TCM constitution analysis (ARA-TCM) is proposed that based on FP-growth algorithm and the open-source distributed file system in Hadoop framework (HDFS) to make full use of its powerful parallel processing capability. Firstly, the proposed method was used to explore the association rules between the 9 kinds of TCM constitutions and symptoms, as well as the regimen treatment plans, so as to discover the rules of typical clinical symptoms and treatment rules of different constitutions and to conduct an evidence-based medical evaluation of TCM effects in constitution-related chronic disease health management. Secondly, experiments were applied on a self-built TCM clinical records database with a total of 30,071 entries and it is found that the top three constitutions are mid constitution (42.3%), hot and humid constitution (31.3%), and inherited special constitution (26.2%), respectively. What is more, there are obvious promotions in the precision and recall rate compared with the Apriori algorithm, which indicates that the proposed method is suitable for the classification of TCM constitutions. This work is mainly focused on uncovering the rules of “disease symptoms constitution regimen” in TCM medical records, but tongue image and pulse signal are also very important to TCM constitution analysis. Therefore, this additional information should be considered into further studies to be more in line with the actual clinical needs.

Funder

National Natural Science Foundation of China

Publisher

Hindawi Limited

Subject

Health Informatics,Biomedical Engineering,Surgery,Biotechnology

Reference27 articles.

1. Deep learning and medical image processing for coronavirus (COVID-19) pandemic: A survey

2. Research Progress of Data Mining Algorithm in Traditional Chinese Medicine;X. Zhang;Journal of Jiangxi University of Traditional Chinese Medicine,2015

3. Integration of Data Mining and Complex Networks and its Application in Traditional Chinese Medicine;L. V. Qing-Li;Chinese Traditional & Herbal Drugs,2016

4. Hybrid genetic algorithm and a fuzzy logic classifier for heart disease diagnosis

5. FDSMO: Frequent DNA Sequence Mining Using FBSB and Optimization

Cited by 2 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3