From predicting to decision making: Reinforcement learning in biomedicine

Author:

Liu Xuhan12ORCID,Zhang Jun3ORCID,Hou Zhonghuai2ORCID,Yang Yi Isaac1ORCID,Gao Yi Qin1456ORCID

Affiliation:

1. Institute of Systems and Physics Biology Shenzhen Bay Laboratory Shenzhen China

2. Hefei National Laboratory for Physical Sciences at the Microscale University of Science & Technology of China Hefei China

3. Beijing Changping Laboratory Beijing China

4. Beijing National Laboratory for Molecular Sciences, College of Chemistry and Molecular Engineering Peking University Beijing China

5. Beijing Advanced Innovation Center for Genomics Peking University Beijing China

6. Biomedical Pioneering Innovation Center Peking University Beijing China

Abstract

AbstractReinforcement learning (RL) is one important branch of artificial intelligence (AI), which intuitively imitates the learning style of human beings. It is commonly derived from solving game playing problems and is extensively used for decision‐making, control and optimization problems. It has been extensively applied for solving complicated problems with the property of Markov decision‐making processes. With data accumulation and comprehensive analysis, researchers are not only satisfied with predicting the results for experimental systems but also hope to design or control them for the sake of obtaining the desired properties or functions. RL is potentially facilitated to solve a large number of complicated biological and chemical problems, because they could be decomposed into multi‐step decision‐making process. In practice, substantial progress has been made in the application of RL to the field of biomedicine. In this paper, we will first give a brief description about RL, including its definition, basic theory and different type of methods. Then we will review some detailed applications in various domains, for example, molecular design, reaction planning, molecular simulation and etc. In the end, we will summarize the essentialities of RL approaches to solve more diverse problems compared with other machine learning methods and also outlook the possible trends to overcome their limitations in the future.This article is categorized under: Data Science > Chemoinformatics Data Science > Computer Algorithms and Programming Data Science > Artificial Intelligence/Machine Learning

Funder

National Key Research and Development Program of China

National Natural Science Foundation of China

Publisher

Wiley

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3