Author:
Muralidharan Sachin,Ali Sarah,Yang Lilin,Badshah Joshua,Zahir Syeda Farah,Ali Rubbiya A.,Chandra Janin,Frazer Ian H.,Thomas Ranjeny,Mehdi Ahmed M.
Abstract
AbstractThe purpose of this study is to manually and semi-automatically curate a database and develop an R package that will act as a comprehensive resource to understand how biological processes are dysregulated due to interactions with environmental factors. The initial database search run on the Gene Expression Omnibus and the Molecular Signature Database retrieved a total of 90,018 articles. After title and abstract screening against pre-set criteria, a total of 237 datasets were selected and 522 gene modules were manually annotated. We then curated a database containing four environmental factors, cigarette smoking, diet, infections and toxic chemicals, along with a total of 25,789 genes that had an association with one or more of gene modules. The database and statistical analysis package was then tested with the differentially expressed genes obtained from the published literature related to type 1 diabetes, rheumatoid arthritis, small cell lung cancer, COVID-19, cobalt exposure and smoking. On testing, we uncovered statistically enriched biological processes, which revealed pathways associated with environmental factors and the genes. The curated database and enrichment tool are available as R packages at https://github.com/AhmedMehdiLab/E.PATH and https://github.com/AhmedMehdiLab/E.PAGE respectively.
Publisher
Springer Science and Business Media LLC
Reference43 articles.
1. Skinner, M. K. Environmental epigenomics and disease susceptibility. EMBO Rep. 12(7), 620–622 (2011).
2. Vojdani, A., Pollard, K. M. & Campbell, A. W. Environmental triggers and autoimmunity. Autoimmune Dis. 2014, 798029 (2014).
3. Rappaport, S. M. Discovering environmental causes of disease. J. Epidemiol. Community Health 66(2), 99–102 (2012).
4. Vojdani, A. A potential link between environmental triggers and autoimmunity. Autoimmune Dis. 2014, 437231 (2014).
5. He, K. Y., Ge, D. & He, M. M. Big data analytics for genomic medicine. Int. J. Mol. Sci. 18(2), 412 (2017).
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献