Impact of non-normal error distributions on the benchmarking and ranking of quantum machine learning models-Reference-Cited by-同舟云学术

Impact of non-normal error distributions on the benchmarking and ranking of quantum machine learning models

Published:2020-08-18 Issue:3 Volume:1 Page:035011
ISSN:2632-2153
Container-title:Machine Learning: Science and Technology
language:
Short-container-title:Mach. Learn.: Sci. Technol.

Author:

Pernot Pascal^ORCID,Huang Bing^ORCID,Savin Andreas^ORCID

Abstract

Abstract Quantum machine learning models have been gaining significant traction within atomistic simulation communities. Conventionally, relative model performances are being assessed and compared using learning curves (prediction error vs. training set size). This article illustrates the limitations of using the Mean Absolute Error (MAE) for benchmarking, which is particularly relevant in the case of non-normal error distributions. We analyze more specifically the prediction error distribution of the kernel ridge regression with SLATM representation and L 2 distance metric (KRR-SLATM-L2) for effective atomization energies of QM7b molecules calculated at the level of theory CCSD(T)/cc-pVDZ. Error distributions of HF and MP2 at the same basis set referenced to CCSD(T) values were also assessed and compared to the KRR model. We show that the true performance of the KRR-SLATM-L2 method over the QM7b dataset is poorly assessed by the Mean Absolute Error, and can be notably improved after adaptation of the learning set.

Publisher

IOP Publishing

Subject

Artificial Intelligence,Human-Computer Interaction,Software

Link

https://iopscience.iop.org/article/10.1088/2632-2153/aba184/pdf

Reference32 articles.

1. Prediction uncertainty of density functional approximations for properties of crystals with cubic symmetry;Pernot;J. Phys. Chem. A,2015

2. Probabilistic performance estimators for computational chemistry methods: the empirical cumulative distribution function of absolute errors;Pernot;J. Chem. Phys.,2018

3. Intensive atomization energy: Re-thinking a metric for electronic structure theory methods;Perdew;Z. Phys. Chem.,2016

4. Is the error on first-principles volume predictions absolute or relative?;Lejaeghere;Comput. Mater. Sci.,2016

5. Prediction errors of molecular machine learning models lower than hybrid DFT error;Faber;J. Chem. Theory Comput.,2017

Cited by 17 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An ensemble‐based approach to estimate confidence of predicted protein–ligand binding affinity values;Molecular Informatics;2024-02-15

2. The central role of density functional theory in the AI age;Science;2023-07-14

3. Statistical Equivalence of Quantum Chemical Methods for Energy Distribution in Water Clusters;ChemPhysChem;2023-02-28

4. Prediction uncertainty validation for computational chemists;The Journal of Chemical Physics;2022-10-14

5. QDataSet, quantum datasets for machine learning;Scientific Data;2022-09-23