Predicting High‐Performance Thermoelectric Materials With StarryData2

Author:

Parse Nuttawat1,Recatala‐Gomez Jose2,Zhu Ruiming23,Low Andre KY23,Hippalgaonkar Kedar23,Mato Tomoya4,Katsura Yukari56,Pinitsoontorn Supree17ORCID

Affiliation:

1. Department of Physics Faculty of Science Khon Kaen University Khon Kaen 40002 Thailand

2. School of Materials Science and Engineering Nanyang Technological University Singapore 639798 Singapore

3. Institute of Materials Research and Engineering Agency for Science Technology and Research (A*STAR) Singapore 138634 Republic of Singapore

4. Materials Modelling Group Data‐driven Materials Research Field Centre for Basic Research on Materials NIMS Tsukuba 305–0047 Japan

5. National Institute for Materials Science Ibaraki 305–0047 Japan

6. Department of Advanced Materials Science The University of Tokyo Kashiwa 277–8561 Japan

7. Institute of Nanomaterials Research and Innovation for Energy (IN‐RIE) Khon Kaen University Khon Kaen 40002 Thailand

Abstract

AbstractIn recent years, machine learning (ML) has emerged as a potential tool in the exploration of thermoelectric (TE) materials. This study exploits the StarryData2 public database to construct an ML model for predicting the figure‐of‐merit ZT of TE materials. The original dataset from StarryData2 (372,480 datapoints) underwent systematic cleaning, resulting in a refined dataset of 18,126 instances with 2,761 unique compounds. The cleaned data is employed to train an XGBoost regressor model, utilizing chemical formulas of TE compounds as features to predict ZT at given temperatures. The XGBoost regressor exhibited high prediction accuracy, achieving the coefficient of determination (R2) scores of 0.815 and mean absolute error (MAE) of 0.103 for the test set, further evaluated through cross‐validation across 5 folds. The learning curve analysis demonstrated improved model performance with increased training data. Furthermore, the contributions of different chemical descriptors to ZT are analyzed based on feature importance analysis. Beyond conventional TE families in the training set, the trained model is applied to predict ZT for promising unexplored TE materials and estimate optimal doping concentrations. This comprehensive study shows the impact of ML on TE material research, offering valuable insights and accelerating the discovery of materials with enhanced TE properties.

Funder

Khon Kaen University

National Research Council of Thailand

Publisher

Wiley

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3