Author:
Nazer Naghme,Sepehri Mohammad Hossein,Mohammadzade Hoda,Mehrmohamadi Mahya
Abstract
AbstractDNA methylation is a major epigenetic modification involved in many physiological processes. Normal methylation patterns are disrupted in many diseases and methylation-based biomarkers have shown promise in several contexts. Marker discovery typically involves the analysis of publicly available DNA methylation data from high-throughput assays. Numerous methods for identification of differentially methylated biomarkers have been developed, making the need for best practices guidelines and context-specific analyses workflows exceedingly high. To this end, here we propose TASA, a novel method for simulating methylation array data in various scenarios. We then comprehensively assess different data analysis workflows using real and simulated data and suggest optimal start-to-finish analysis workflows. Our study demonstrates that the choice of analysis pipeline for DNA methylation-based marker discovery is crucial and different across different contexts.
Funder
Converging Technologies Development Center of Vice Presidency for Science Technology and Knowledge-based Economy
Research and Technology Office of SUT
Iran National Science Foundation
Kazemi-Ashtiani from BMN
Publisher
Springer Science and Business Media LLC