Abstract
AbstractIntelligent systems are applied in a wide range of areas, and computer-aided drug design is a highly important one. One major approach to drug design is the inverse QSAR/QSPR (quantitative structure-activity and structure-property relationship), for which a method that uses both artificial neural networks (ANN) and mixed integer linear programming (MILP) has been proposed recently. This method consists of two phases: a forward prediction phase, and an inverse, inference phase. In the prediction phase, a feature function f over chemical compounds is defined, whereby a chemical compound G is represented as a vector f(G) of descriptors. Following, for a given chemical property $$\pi$$, using a dataset of chemical compounds with known values for property $$\pi$$, a regressive prediction function $$\psi$$ is computed by an ANN. It is desired that $$\psi (f(G))$$ takes a value that is close to the true value of property $$\pi$$ for the compound G for many of the compounds in the dataset. In the inference phase, one starts with a target value $$y^*$$ of the chemical property $$\pi$$, and then a chemical structure $$G^*$$ such that $$\psi (f(G^*))$$ is within a certain tolerance level of $$y^*$$ is constructed from the solution to a specially formulated MILP. This method has been used for the case of inferring acyclic chemical compounds. With this paper, we propose a new concept on acyclic chemical graphs, called a skeleton tree, and based on it develop a new MILP formulation for inferring acyclic chemical compounds. Our computational experiments indicate that our newly proposed method significantly outperforms the existing method when the diameter of graphs is up to 8. In a particular example where we inferred acyclic chemical compounds with 38 non-hydrogen atoms from the set {C, O, S} times faster.
Funder
Japan Society for the Promotion of Science
Publisher
Springer Science and Business Media LLC
Reference26 articles.
1. Akutsu T, Fukagawa D, Jansson J, Sadakane K (2012) Inferring a graph from path frequency. Discrete Applied Mathematics 160(10–11):1416–1428
2. Akutsu, T, Nagamochi, H: A novel method for inference of chemical compounds with prescribed topological substructures based on integer programming, arXiv: 2010.09203 (2020)
3. Akutsu, T, Nagamochi, H: A mixed integer linear programming formulation to artificial neural networks. In: Proceedings of the 2nd international conference on information science and systems. pp 215–220. ACM (2019)
4. Azam, NA, Chiewvanichakorn, R, Zhang, F, Shurbevski, A, Nagamochi, H, Akutsu, T: A method for the inverse QSAR/QSPR based on artificial neural networks and mixed integer linear programming with guaranteed admissibility. In: Proceedings of the 13th international joint conference on biomedical engineering systems and technologies – Volume 3: BIOINFORMATICS. pp 101–108 (2020)
5. Azam, NA, Zhu, J, Sun, Y, Shi, Y, Shurbevski, A, Zhao, L, Nagamochi, H, Akutsu, T: A novel method for inference of acyclic chemical compounds with bounded branch-height based on artificial neural networks and integer programming, arXiv:2009.09646 (2020)