Abstract
AbstractThere is a paradigm shift towards data-centric AI, where model efficacy relies on quality, unified data. The common research analytics and data lifecycle environment (CRADLE™) is an infrastructure and framework that supports a data-centric paradigm and materials data science at scale through heterogeneous data management, elastic scaling, and accessible interfaces. We demonstrate CRADLE’s capabilities through five materials science studies: phase identification in X-ray diffraction, defect segmentation in X-ray computed tomography, polymer crystallization analysis in atomic force microscopy, feature extraction from additive manufacturing, and geospatial data fusion. CRADLE catalyzes scalable, reproducible insights to transform how data is captured, stored, and analyzed.
Graphical abstract
Funder
National Science Foundation
National Nuclear Security Administration
Publisher
Springer Science and Business Media LLC
Reference50 articles.
1. C. Draxl, M. Scheffler, NOMAD: the FAIR concept for big data-driven materials science. MRS Bull. 43, 676–682 (2018). https://doi.org/10.1557/mrs.2018.208
2. L. Himanen, A. Geurts, A.S. Foster, P. Rinke, Data-driven materials science: status, challenges, and perspectives. Adv. Sci. 6, 1900808 (2019). https://doi.org/10.1002/advs.201900808
3. The Minerals, Metals & Materials Society, Building a Materials Data Infrastructure: Opening New Pathways to Discovery and Innovation in Science and Engineering (The Minerals, Metals & Materials Society, 2017). https://doi.org/10.7449/mdistudy_1
4. R. Stevens, V. Taylor, J. Nichols, A.B. Maccabe, K. Yelick, D. Brown, AI for Science: Report on the Department of Energy (DOE) Town Halls on Artificial Intelligence (AI) for Science (Argonne National Lab (ANL), Argonne, 2020)
5. J. Carter, J. Feddema, D. Kothe, R. Neely, J. Pruet, R. Stevens, AI for Science, Energy, and Security Report (Argonne National Lab (ANL), Argonne, 2020)