Learning in continuous action space for developing high dimensional potential energy models-Reference-Cited by-同舟云学术

Learning in continuous action space for developing high dimensional potential energy models

Published:2022-01-18 Issue:1 Volume:13 Page:
ISSN:2041-1723
Container-title:Nature Communications
language:en
Short-container-title:Nat Commun

Author:

Manna Sukriti,Loeffler Troy D.,Batra Rohit,Banik Suvo,Chan Henry^ORCID,Varughese Bilvin,Sasikumar Kiran,Sternberg Michael,Peterka Tom,Cherukara Mathew J.^ORCID,Gray Stephen K.^ORCID,Sumpter Bobby G.^ORCID,Sankaranarayanan Subramanian K. R. S.^ORCID

Abstract

AbstractReinforcement learning (RL) approaches that combine a tree search with deep learning have found remarkable success in searching exorbitantly large, albeit discrete action spaces, as in chess, Shogi and Go. Many real-world materials discovery and design applications, however, involve multi-dimensional search problems and learning domains that have continuous action spaces. Exploring high-dimensional potential energy models of materials is an example. Traditionally, these searches are time consuming (often several years for a single bulk system) and driven by human intuition and/or expertise and more recently by global/local optimization searches that have issues with convergence and/or do not scale well with the search dimensionality. Here, in a departure from discrete action and other gradient-based approaches, we introduce a RL strategy based on decision trees that incorporates modified rewards for improved exploration, efficient sampling during playouts and a “window scaling scheme" for enhanced exploitation, to enable efficient and scalable search for continuous action space problems. Using high-dimensional artificial landscapes and control RL problems, we successfully benchmark our approach against popular global optimization schemes and state of the art policy gradient methods, respectively. We demonstrate its efficacy to parameterize potential models (physics based and high-dimensional neural networks) for 54 different elemental systems across the periodic table as well as alloys. We analyze error trends across different elements in the latent space and trace their origin to elemental structural diversity and the smoothness of the element energy surface. Broadly, our RL strategy will be applicable to many other physical science problems involving search over continuous action spaces.

Publisher

Springer Science and Business Media LLC

Subject

General Physics and Astronomy,General Biochemistry, Genetics and Molecular Biology,General Chemistry,Multidisciplinary

Link

https://www.nature.com/articles/s41467-021-27849-6.pdf

Reference51 articles.

1. Sutton, R. S. & Barto, A. G. Reinforcement Learning: An Introduction (MIT Press, 2018).

2. Silver, D. et al. Mastering the game of go with deep neural networks and tree search. Nature 529, 484–489 (2016).

3. Silver, D. et al. A general reinforcement learning algorithm that masters chess, shogi, and go through self-play. Science 362, 1140–1144 (2018).

4. Wang, X. et al. Towards efficient discovery of green synthetic pathways with Monte Carlo tree search and reinforcement learning. Chem. Sci. 11, 10959–10972 (2020).

5. Batra, R., Song, L. & Ramprasad, R. Emerging materials intelligence ecosystems propelled by machine learning. Nat. Rev. Mater. 6, 655–678 (2020).

Cited by 17 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Learning the stable and metastable phase diagram to accelerate the discovery of metastable phases of boron;APL Machine Learning;2024-01-08

2. Data Efficient and Stability Indicated Sampling for Developing Reactive Machine Learning Potential to Achieve Ultralong Simulation in Lithium-Metal Batteries;The Journal of Physical Chemistry C;2023-12-13

3. Construction of High Accuracy Machine Learning Interatomic Potential for Surface/Interface of Nanomaterials—A Review;Advanced Materials;2023-11-30

4. Identifying the communication of burnout syndrome on the Twitter platform from the individual, organizational, and environmental perspective;Frontiers in Psychology;2023-10-19

5. A Continuous Action Space Tree search for INverse desiGn (CASTING) framework for materials discovery;npj Computational Materials;2023-09-30