Affiliation:
1. Federal State Institution Federal Research Centre «Fundamentals of Biotechnology», Russian Academy of Sciences, 119071 Moscow, Russia
Abstract
The exact identification of promoter sequences remains a serious problem in computational biology, as the promoter prediction algorithms under development continue to produce false-positive results. Therefore, to fully assess the validity of predicted sequences, it is necessary to perform a comprehensive test of their properties, such as the presence of downstream transcribed DNA regions behind them, or chromatin accessibility for transcription factor binding. In this paper, we examined the promoter sequences of chromosome 1 of the rice Oryza sativa genome from the Database of Potential Promoter Sequences predicted using a mathematical algorithm based on the derivation and calculation of statistically significant promoter classes. In this paper TATA motifs and cis-regulatory elements were identified in the predicted promoter sequences. We also verified the presence of potential transcription start sites near the predicted promoters by analyzing CAGE-seq data. We searched for unannotated transcripts behind the predicted sequences by de novo assembling transcripts from RNA-seq data. We also examined chromatin accessibility in the region of the predicted promoters by analyzing ATAC-seq data. As a result of this work, we identified the predicted sequences that are most likely to be promoters for further experimental validation in an in vivo or in vitro system.
Subject
Plant Science,Ecology,Ecology, Evolution, Behavior and Systematics
Reference30 articles.
1. The rice genome revolution: From an ancient grain to Green Super Rice;Wing;Nat. Rev. Genet.,2018
2. The map-based sequence of the rice genome;Khurana;Nature,2005
3. The Rice Annotation Project Database (RAP-DB): 2008 update;Tsuyoshi;Nucleic Acids Res.,2008
4. The TIGR Rice Genome Annotation Resource: Improvements and new features;Ouyang;Nucleic Acids Res.,2007
5. Yella, V.R., and Bansal, M. (2015). Systems and Synthetic Biology, Springer.