Author:
Zhang Shuangquan,Yang Lili,Wu Xiaotian,Sheng Nan,Fu Yuan,Ma Anjun,Wang Yan
Abstract
AbstractAssay for Transposase-Accessible Chromatin sequencing (ATAC-seq) utilizes hyperactive Tn5 transposase to cut open chromatin and reveal chromatin accessibility at a genome-wide level. ATAC-seq can reveal more kinds of transcription factor binding regions than Chromatin immunoprecipitation sequencing (ChIP-seq) and DNase I hypersensitive sites sequencing (DNase-seq). Transcription factor binding sites (TFBSs) prediction is a crucial step to reveal the functions of TFs from the high throughput sequencing data. TFBSs of the same TF tend to be conserved in the sequence level, which is named motif. Several deep learning models based on the convolutional neural networks are used to find motifs from ATAC-seq data. However, these methods didn’t take into account that multiple TFs bind to a given sequence and the probability that a fragment of a given sequence is a TFBS. To find binding sites of multiple TFs, we developed a novel GNN model named GraphPred for TFBSs prediction and finding multiple motifs using the coexisting probability of k-mers. In the light of the experiment results, GraphPred can find more and higher quality motifs from 88 ATAC-seq datasets than comparison tools. Meanwhile, GraphPred achieved an area of eight metrics radar (AEMR) score of 2.31.
Publisher
Cold Spring Harbor Laboratory
Reference49 articles.
1. M. Bajic , K.A. Maher , R.B. Deal , Identification of open chromatin regions in plant genomes using ATAC-Seq, in: Plant Chromatin Dynamics, Springer, 2018, pp. 183–201.
2. Y. Wang , S. Zhang , A. Ma , C. Wang , Q. Ma , Assessing deep learning algorithms in cis -regulatory motif finding based on genomic sequencing data, (2020).
3. S. Zhang , A. Ma , J. Zhao , D. Xu , Q. Ma , Y. Wang , Assessing deep learning methods in cis-regulatory motif finding based on genomic sequencing data, Briefings in Bioinformatics, (2021).
4. H. John , G. Charles , N.W. Stafford , T.L. Bailey , Assessing phylogenetic motif models for predicting transcription factor binding sites, Bioinformatics, i339–i347.
5. K. Vishnoi , N. Viswakarma , A. Rana , B. Rana , Transcription Factors in Cancer Development and Therapy, Cancers, 12.