Abstract
AbstractBackgroundAnomaly detection in graphs is critical in various domains, notably in medicine and biology, where anomalies often encapsulate pivotal information. Here, we focused on network analysis of molecular interactions between proteins, which is commonly used to study and infer the impact of proteins on health and disease. In such a network, an anomalous protein might indicate its impact on the organism’s health.ResultsWe propose Weighted Graph Anomalous Node Detection (WGAND), a novel machine learning-based method for detecting anomalies in weighted graphs. WGAND is based on the observation that edge patterns of anomalous nodes tend to deviate significantly from expected patterns. We quantified these deviations to generate features, and utilized the resulting features to model the anomaly of nodes, resulting in node anomaly scores. We created four variants of the WGAND methods and compared them to two previously-published (baseline) methods. We evaluated WGAND on data of protein interactions in 17 human tissues, where anomalous nodes corresponded to proteins with major roles in tissue contexts. In 13 of the tissues, WGAND obtained higher AUC and P@K than baseline methods. We demonstrate that WGAND effectively identified proteins that participate in tissue-specific processes and diseases.ConclusionWe present WGAND, a new approach to anomaly detection in weighted graphs. Our results underscore its capability to highlight critical proteins within protein-protein interaction networks. WGAND holds the promise to enhance our understanding of intricate biological processes and might pave the way for novel therapeutic strategies targeting tissue-specific diseases. Its versatility ensures its applicability across diverse weighted graphs, making it a robust tool for detecting anomalous nodes.
Publisher
Cold Spring Harbor Laboratory
Reference35 articles.
1. Fatima Mohamad Dakalbab. Machine learning for anomaly detection: A systematic review;Ieee Access,2021
2. Graph based anomaly detection and description: a survey
3. Meng-Chieh Lee , Hung T Nguyen , Dimitris Berberidis , Vincent S Tseng , and Leman Akoglu . Gawd: graph anomaly detection in weighted directed graph databases. In Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, pages 143–150, 2021.
4. Leman Akoglu , Mary McGlohon , and Christos Faloutsos . Oddball: Spotting anomalies in weighted graphs. In Advances in Knowledge Discovery and Data Mining, pages 410–421. Springer, 2010.