Affiliation:
1. Chemical Data‐Driven Research Center Korea Research Institute of Chemical Technology Daejeon 34114 Republic of Korea
2. Interface Materials and Engineering Laboratory Korea Research Institute of Chemical Technology Daejeon 34114 Republic of Korea
Abstract
AbstractMachine learning can be used to predict the properties of polymers and explore vast chemical spaces. However, the limited number of available experimental datasets hinders the enhancement of the predictive performance of a model. This study proposes a machine learning approach that leverages transfer learning and ensemble modeling to efficiently predict the glass transition temperature (Tg) of fluorinated polymers and guide the design of high Tg copolymers. Initially, the quantum machine 9 (QM9) dataset is employed for model pretraining, thus providing robust molecular representations for the subsequent fine‐tuning of a specialized copolymer dataset. Ensemble modeling is used to further enhance prediction robustness and reliability, effectively addressing the problems owing to the limited and unevenly distributed nature of the copolymer dataset. Finally, a fine‐tuned ensemble model is used to navigate a vast chemical space comprising 61 monomers and identify promising candidates for high Tg fluorinated polymers. The model predicts 247 entries capable of achieving a Tg over 390 K, of which 14 are experimentally validated. This study demonstrates the potential of machine learning in material design and discovery, highlighting the effectiveness of transfer learning and ensemble modeling strategies for overcoming the challenges posed by small datasets in complex copolymer systems.
Funder
Korea Research Institute of Chemical Technology