Abstract
Abstract
Graph Neural Networks (GNNs) exhibit potential in predicting the properties of molecules, but computational analyses with the GNNs often encounter the problem of data imbalance or overfitting. Augmentation techniques have emerged as a popular solution, and adversarial perturbation to node features achieves a significant improvement in enhancing the model's generalization capacity. Despite remarkable advancement, there is scarce research about systematically tuning the adversarial augmentation. We propose a new framework for an adversarial perturbation with influential graph features. Our method selects the data to apply adversarial augmentation based on the one-step influence function that measures the influence of each training sample on prediction in each iteration. In particular, the approximation of the one-step influence function has wide applicability to evaluate a model's validity on the observation level for a large-scale neural network. Selected data using the one-step influence function are likely to be located near the decision boundary, and experimental results demonstrated that augmentation of such data has improved the model's performance.
Publisher
Research Square Platform LLC
Reference51 articles.
1. Chen, Hongming and Engkvist, Ola and Wang, Yinhai and Olivecrona, Marcus and Blaschke, Thomas (2018) The rise of deep learning in drug discovery. Drug discovery today 23(6): 1241--1250 https://doi.org/10.1016/j.drudis.2018.01.039
2. Gajewicz-Skretna, Agnieszka and Kar, Supratik and Piotrowska, Magdalena and Leszczynski, Jerzy (2021) The kernel-weighted local polynomial regression (KwLPR) approach: an efficient, novel tool for development of QSAR/QSAAR toxicity extrapolation models. Journal of cheminformatics 13: 1--20 Springer
3. Andy H. Vo and Terry R Van Vleet and Rishi Raj Gupta and Michael J. Liguori and Mohan S. Rao (2020) An Overview of Machine Learning and Big Data for Drug Toxicity Evaluation.. Chemical research in toxicology 33(1): 20--37 https://doi.org/10.1021/acs.chemrestox.9b00227
4. Ren, Gao-Peng and Yin, Yi-Jian and Wu, Ke-Jun and He, Yuchen (2023) Force field-inspired molecular representation learning for property prediction. Journal of Cheminformatics 15(1): 17 Springer
5. Gilmer, Justin and Schoenholz, Samuel S and Riley, Patrick F and Vinyals, Oriol and Dahl, George E (2017) Neural message passing for quantum chemistry. PMLR, 1263--1272, 70, International Conference on Machine Learning