Abstract
ABSTRACTUnderstanding the patterns of viral disease transmissions helps establish public health policies and aids in controlling and ending a disease outbreak. Classical methods for studying disease transmission dynamics that rely on epidemiological data, such as times of sample collection and duration of exposure intervals, struggle to provide desired insight due to limited informativeness of such data. A more precise characterization of disease transmissions may be acquired from sequencing data that reveals genetic distance between viral populations in patient samples. Indeed, genetic distance between viral strains present in hosts contains valuable information about transmission history, thus motivating the design of methods that rely on genomic data to reconstruct a directed disease transmission network, detect transmission clusters, and identify significant network nodes (e.g., super-spreaders). In this paper, we present a novel end-to-end framework for the analysis of viral transmissions utilizing viral genomic (sequencing) data. The proposed framework groups infected hosts into transmission clusters based on reconstructed viral quasispecies; the genetic distance between a pair of hosts is calculated using Earth Mover’s Distance, and further used to infer transmission direction between the hosts. To quantify the significance of a host in the transmission network, the importance score is calculated by a graph convolutional auto-encoder. The viral transmission network is represented by a directed minimum spanning tree utilizing the Edmond’s algorithm modified to incorporate constraints on the importance scores of the hosts. Results on realistic synthetic as well as experimental data demonstrate that the proposed framework outperforms state-of-the-art techniques for the analysis of viral transmission dynamics.CCS CONCEPTSApplied computing → Bioinformatics.ACM Reference FormatZiqi Ke and Haris Vikalo. 2022. Graph-Based Reconstruction and Analysis of Disease Transmission Networks using Viral Genomic Data. In Proceedings of The Seventh International Workshop on Computational Network Biology (CNB-MAC 2022). ACM, New York, NY, USA, 10 pages. https://doi.org/XXXXXXX.XXXXXXX
Publisher
Cold Spring Harbor Laboratory
Reference49 articles.
1. Gundlapalli Adi , Ma Xiulian , Benuzillo Jose , Pettey Warren , Greenberg Richard , Hales Joseph , Leecaster Molly , and Samore Matthew . 2009. Social network analyses of patient-healthcare worker interactions: implications for disease transmission. AMIA Annu Symp Proc (2009), 213/7.
2. Viral quasispecies reconstruction via tensor factorization with successive read removal
3. Inferring epidemiological links from deep sequencing data: a statistical learning approach for human, animal and plant diseases;Phil. Trans.R. Soc,2019
4. Modelling the spread of infectious salmon anaemia among salmon farms based on seaway distances between farms and genetic relationships between infectious salmon anaemia virus isolates;Journal of the Royal Society, Interface,2011
5. Bayesian inference of transmission chains using timing of symptoms, pathogen genomes and contact data;PLoS Comput Biol,2019