Synergy of semiempirical models and machine learning in computational chemistry

Author:

Fedik Nikita12ORCID,Nebgen Benjamin1ORCID,Lubbers Nicholas3ORCID,Barros Kipton12ORCID,Kulichenko Maksim1ORCID,Li Ying Wai3ORCID,Zubatyuk Roman4ORCID,Messerly Richard1ORCID,Isayev Olexandr4ORCID,Tretiak Sergei125ORCID

Affiliation:

1. Theoretical Division, Los Alamos National Laboratory 1 , Los Alamos, New Mexico 87545, USA

2. Center for Nonlinear Studies, Los Alamos National Laboratory 2 , Los Alamos, New Mexico 87545, USA

3. Computer, Computational, and Statistical Sciences Division, Los Alamos National Laboratory 3 , Los Alamos, New Mexico 87545, USA

4. Department of Chemistry, Mellon College of Science, Carnegie Mellon University 4 , Pittsburgh, Pennsylvania 15213, USA

5. Center for Integrated Nanotechnologies Los Alamos National Laboratory 5 , Los Alamos, New Mexico 87545, USA

Abstract

Catalyzed by enormous success in the industrial sector, many research programs have been exploring data-driven, machine learning approaches. Performance can be poor when the model is extrapolated to new regions of chemical space, e.g., new bonding types, new many-body interactions. Another important limitation is the spatial locality assumption in model architecture, and this limitation cannot be overcome with larger or more diverse datasets. The outlined challenges are primarily associated with the lack of electronic structure information in surrogate models such as interatomic potentials. Given the fast development of machine learning and computational chemistry methods, we expect some limitations of surrogate models to be addressed in the near future; nevertheless spatial locality assumption will likely remain a limiting factor for their transferability. Here, we suggest focusing on an equally important effort—design of physics-informed models that leverage the domain knowledge and employ machine learning only as a corrective tool. In the context of material science, we will focus on semi-empirical quantum mechanics, using machine learning to predict corrections to the reduced-order Hamiltonian model parameters. The resulting models are broadly applicable, retain the speed of semiempirical chemistry, and frequently achieve accuracy on par with much more expensive ab initio calculations. These early results indicate that future work, in which machine learning and quantum chemistry methods are developed jointly, may provide the best of all worlds for chemistry applications that demand both high accuracy and high numerical efficiency.

Funder

Los Alamos National Laboratory

Center for Integrated Nanotechnologies

Center for Nonlinear Studies

Office of Science

Basic Energy Sciences

Chemical Sciences, Geosciences, and Biosciences Division

National Science Foundation

Publisher

AIP Publishing

Subject

Physical and Theoretical Chemistry,General Physics and Astronomy

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3