Phylogenomic Identification of Regulatory Sequences in Bacteria: an Analysis of Statistical Power and an Application to Borrelia burgdorferi Sensu Lato

Author:

Martin Che I.1,Sukarna Tika Y.1,Akther Saymon2,Ramrattan Girish2,Pagan Pedro2,Di Lia2,Mongodin Emmanuel F.3,Fraser Claire M.3,Schutzer Steven E.4,Luft Benjamin J.5,Casjens Sherwood R.6,Qiu Wei-Gang12ORCID

Affiliation:

1. Department of Biology, The Graduate Center, City University of New York, New York, USA

2. Department of Biological Sciences and Center for Translational and Basic Research, Hunter College, City University of New York, New York, USA

3. Institute for Genome Sciences, University of Maryland BioPark, Baltimore, Maryland, USA

4. Department of Medicine, New Jersey Medical School, Rutgers, the State University of New Jersey, Newark, New Jersey, USA

5. Department of Medicine, Health Science Center, Stony Brook University, Stony Brook, New York, USA

6. Department of Pathology, Division of Molecular Cell Biology and Immunology, University of Utah School of Medicine, Salt Lake City, Utah, USA

Abstract

ABSTRACT Phylogenomic footprinting is an approach for ab initio identification of genome-wide regulatory elements in bacterial species based on sequence conservation. The statistical power of the phylogenomic approach depends on the degree of sequence conservation, the length of regulatory elements, and the level of phylogenetic divergence among genomes. Building on an earlier model, we propose a binomial model that uses synonymous tree lengths as neutral expectations for determining the statistical significance of conserved intergenic spacer (IGS) sequences. Simulations show that the binomial model is robust to variations in the value of evolutionary parameters, including base frequencies and the transition-to-transversion ratio. We used the model to search for regulatory sequences in the Lyme disease species group ( Borrelia burgdorferi sensu lato ) using 23 genomes. The model indicates that the currently available set of Borrelia genomes would not yield regulatory sequences shorter than five bases, suggesting that genome sequences of additional B. burgdorferi sensu lato species are needed. Nevertheless, we show that previously known regulatory elements are indeed strongly conserved in sequence or structure across these Borrelia species. Further, we predict with sufficient confidence two new RpoS binding sites, 39 promoters, 19 transcription terminators, 28 noncoding RNAs, and four sets of coregulated genes. These putative cis - and trans -regulatory elements suggest novel, Borrelia -specific mechanisms regulating the transition between the tick and host environments, a key adaptation and virulence mechanism of B. burgdorferi . Alignments of IGS sequences are available on BorreliaBase.org, an online database of orthologous open reading frame (ORF) and IGS sequences in Borrelia . IMPORTANCE While bacterial genomes contain mostly protein-coding genes, they also house DNA sequences regulating the expression of these genes. Gene regulatory sequences tend to be conserved during evolution. By sequencing and comparing related genomes, one can therefore identify regulatory sequences in bacteria based on sequence conservation. Here, we describe a statistical framework by which one may determine how many genomes need to be sequenced and at what level of evolutionary relatedness in order to achieve a high level of statistical significance. We applied the framework to Borrelia burgdorferi , the Lyme disease agent, and identified a large number of candidate regulatory sequences, many of which are known to be involved in regulating the phase transition between the tick vector and mammalian hosts.

Publisher

American Society for Microbiology

Subject

Virology,Microbiology

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3