A bioinformatics tool for identifying intratumoral microbes from the ORIEN dataset
Author:
Wang CankunORCID, Ma AnjunORCID, McNutt Megan E., Hoyd RebeccaORCID, Wheeler Caroline E.ORCID, Robinson Lary A.ORCID, Chan Carlos H.F.ORCID, Zakharia Yousef, Dodd Rebecca D., Ulrich Cornelia M.ORCID, Hardikar SheetalORCID, Churchman Michelle L., Tarhini Ahmad A.ORCID, Singer Eric A., Ikeguchi Alexandra P.ORCID, McCarter Martin D.ORCID, Denko NicholasORCID, Tinoco GabrielORCID, Husain MariumORCID, Jin NingORCID, Osman Afaf E.G.ORCID, Eljilany IslamORCID, Tan Aik ChoonORCID, Coleman Samuel S., Denko Louis, Riedlinger GregoryORCID, Schneider Bryan P.ORCID, Spakowicz DanielORCID, Ma QinORCID
Abstract
ABSTRACTEvidence supports significant interactions among microbes, immune cells, and tumor cells in at least 10–20% of human cancers, emphasizing the importance of further investigating these complex relationships. However, the implications and significance of tumor-related microbes remain largely unknown. Studies have demonstrated the critical roles of host microbes in cancer prevention and treatment responses. Understanding interactions between host microbes and cancer can drive cancer diagnosis and microbial therapeutics (bugs as drugs). Computational identification of cancer-specific microbes and their associations is still challenging due to the high dimensionality and high sparsity of intratumoral microbiome data, which requires large datasets containing sufficient event observations to identify relationships, and the interactions within microbial communities, the heterogeneity in microbial composition, and other confounding effects that can lead to spurious associations. To solve these issues, we present a bioinformatics tool, MEGA, to identify the microbes most strongly associated with 12 cancer types. We demonstrate its utility on a dataset from a consortium of 9 cancer centers in the Oncology Research Information Exchange Network (ORIEN). This package has 3 unique features: species-sample relations are represented in a heterogeneous graph and learned by a graph attention network; it incorporates metabolic and phylogenetic information to reflect intricate relationships within microbial communities; and it provides multiple functionalities for association interpretations and visualizations. We analyzed 2704 tumor RNA-seq samples and MEGA interpreted the tissue-resident microbial signatures of each of 12 cancer types. MEGA can effectively identify cancer-associated microbial signatures and refine their interactions with tumors.SIGNIFICANCEStudying the tumor microbiome in high-throughput sequencing data is challenging because of the extremely sparse data matrices, heterogeneity, and high likelihood of contamination. We present a new deep-learning tool, microbial graph attention (MEGA), to refine the organisms that interact with tumors.
Publisher
Cold Spring Harbor Laboratory
Reference25 articles.
1. The human microbiome: at the interface of health and disease 2. The Role of The Tumor Microbiome in Tumor Development and Its Treatment;Front Immunol,2022 3. Hoyd, R. , Wheeler, C.E. , Liu, Y. , Singh, M.J. , Muniak, M. , Denko, N. , Carbone, D. , Mo, X. and Spakowicz, D. (2022) Exogenous sequences in tumors and immune cells (exotic): a tool for estimating the microbe abundances in tumor RNAseq data. 4. Hu, Z. , Dong, Y. , Wang, K. and Sun, Y. (2020) Heterogeneous Graph Transformer. 5. Single-cell biological network inference using a heterogeneous graph transformer
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|