Optimal Policy Learning for Disease Prevention Using Reinforcement Learning

Author:

Alam Khan Zahid1,Feng Zhengyong1,Uddin M. Irfan2ORCID,Mast Noor2,Ali Shah Syed Atif34ORCID,Imtiaz Muhammad5,Al-Khasawneh Mahmoud Ahmad4ORCID,Mahmoud Marwan6ORCID

Affiliation:

1. China West Normal University, Nanchong, China

2. Institute of Computing, Kohat University of Science and Technology, Kohat, Pakistan

3. Faculty of Engineering and Information Technology, Northern University, Nowshera, Pakistan

4. Faculty of Computer and Information Technology, Al-Madinah International University, Kuala Lumpur, Malaysia

5. Faculty of Computer Science, University of Swabi, Swabi, Pakistan

6. King Abdulaziz University, Jeddah, Saudi Arabia

Abstract

Diseases can have a huge impact on the quality of life of the human population. Humans have always been in the quest to find strategies to avoid diseases that are life-threatening or affect the quality of life of humans. Effective use of resources available to human to control different diseases has always been critical. Researchers are recently more interested to find AI-based solutions to control the human population from diseases due to the overwhelming popularity of deep learning. There are many supervised techniques that have always been applied for disease diagnosis. However, the main problem of supervised based solutions is the availability of data, which is not always possible or not always complete. For instance, we do not have enough data that shows the different states of humans and different states of environments, and how all different actions taken by humans or viruses have ultimately resulted in a disease that eventually takes the lives of humans. Therefore, there is a need to find unsupervised based solutions or some techniques that do not have a dependency on the underlying dataset. In this paper, we have explored the reinforcement learning approach. We have tried different reinforcement learning algorithms to research different solutions for the prevention of diseases in the simulation of the human population. We have explored different techniques for controlling the transmission of diseases and its effects on health in the human population simulated in an environment. Our algorithms have found out policies that are best for the human population to protect themselves from the transmission and infection of malaria. The paper concludes that deep learning-based algorithms such as Deep Deterministic Policy Gradient (DDPG) have outperformed traditional algorithms such as Q-Learning or SARSA.

Funder

King Abdulaziz University

Publisher

Hindawi Limited

Subject

Computer Science Applications,Software

Cited by 5 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. The application of artificial intelligence in health policy: a scoping review;BMC Health Services Research;2023-12-15

2. Change Detection in Water Body Areas Through Optimization Algorithm Using High- and Low-Resolution Satellite Images;Recent Developments in Machine and Human Intelligence;2023-09-11

3. A Privacy-Preserving Untraceable Group Data-Sharing Technique;Recent Developments in Machine and Human Intelligence;2023-09-11

4. Enhancing Prostate Cancer Diagnosis with Multi-class Classification of CT Scan Images;2023 Third International Conference on Secure Cyber Computing and Communication (ICSCCC);2023-05-26

5. Simple Deterministic Selection-Based Genetic Algorithm for Hyperparameter Tuning of Machine Learning Models;Applied Sciences;2022-01-24

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3