Affiliation:
1. Department of Chemistry Ben Gurion University of the Negev P.O.B 653 Beer-Sheva 8410501 Israel
Abstract
AbstractThe chemistry community is currently witnessing a surge of scientific discoveries in organic chemistry supported by machine learning (ML) techniques. Whereas many of these techniques were developed for big data applications, the nature of experimental organic chemistry often confines practitioners to small datasets. Herein, we touch upon the limitations associated with small data in ML and emphasize the impact of bias and variance on constructing reliable predictive models. We aim to raise awareness to these possible pitfalls, and thus, provide an introductory guideline for good practice. Ultimately, we stress the great value associated with statistical analysis of small data, which can be further boosted by adopting a holistic data‐centric approach in chemistry.
Funder
Israel Science Foundation
Subject
General Chemistry,Catalysis
Reference95 articles.
1. C. Magee The Age of Imagination: Coming Soon to a Civilization Near You 1993.
2. B. Saha D. Srivastava 2014 IEEE 30thinternational conference on data engineering IEEE 2014 pp. 1294–1297.
3.
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献