Affiliation:
1. Department of Chemistry, University of Pittsburgh , 219 Parkman Avenue, Pittsburgh, Pennsylvania 15260, USA
Abstract
Genetic algorithms (GAs) are a powerful tool to search large chemical spaces for inverse molecular design. However, GAs have multiple hyperparameters that have not been thoroughly investigated for chemical space searches. In this tutorial, we examine the general effects of a number of hyperparameters, such as population size, elitism rate, selection method, mutation rate, and convergence criteria, on key GA performance metrics. We show that using a self-termination method with a minimum Spearman’s rank correlation coefficient of 0.8 between generations maintained for 50 consecutive generations along with a population size of 32, a 50% elitism rate, three-way tournament selection, and a 40% mutation rate provides the best balance of finding the overall champion, maintaining good coverage of elite targets, and improving relative speedup for general use in molecular design GAs.
Subject
Physical and Theoretical Chemistry,General Physics and Astronomy
Reference58 articles.
1. A.
Nigam
, R.Pollice, G.Tom, K.Jorner, L. A.Thiede, A.Kundaje, and A.Aspuru-Guzik, “Tartarus: A benchmarking platform for realistic and practical inverse molecular design,” arXiv:2209.12487 (2022).
2. Parallel tempered genetic algorithm guided by deep neural networks for inverse molecular design;Digital Discovery,2022
3. Computational evolution of high-performing unfused non-fullerene acceptors for organic solar cells;J. Chem. Phys.,2022
4. Virtual screening of norbornadiene-based molecular solar thermal energy storage systems using a genetic algorithm;J. Chem. Phys.,2021
5. Inverse molecular design using machine learning: Generative models for matter engineering;Science,2018
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献