TSS-Captur: A User-Friendly Characterization Pipeline for Transcribed but Unclassified RNA transcripts

Author:

Paz Mathias WitteORCID,Vogel Thomas,Nieselt KayORCID

Abstract

AbstractRNA-seq and its 5’-enrichment-based methods for prokaryotes have enabled the base-exact identification of transcription starting sites (TSSs) and have improved gene expression analysis. Computational methods analyze this experimental data to identify TSSs and classify them based on proximal annotated genes. While some TSSs cannot be classified at all (orphan TSSs), other TSSs are found on the reverse strand of known genes (antisense TSSs), but are not associated with the direct transcription of any known gene. Here, we introduceTSS-Captur, a novel pipeline, that uses computational approaches to characterize genomic regions starting from experimentally confirmed, but unclassified TSSs. By analyzing experimental TSS data,TSS-Capturcharacterizes unclassified signals, hence complementing prokaryotic genome annotation tools and enhancing the bacterial transcriptome understanding.TSS-Capturclassifies extracted transcripts into coding or non-coding genes and predicts for each putative transcript its transcription termination site. For non-coding genes, the secondary structure is computed. Furthermore, putative promoter regions are analyzed to identify enriched motifs. An interactive report allows a seamless data exploration. We validatedTSS-Capturwith aCampylobacter jejunidataset and characterized unlabeled non-coding RNAs inStreptomyces coelicolor. Besides its usage over the command-line,TSS-Capturis available as a web-application to enhance its user accessibility and explorative capabilities.

Publisher

Cold Spring Harbor Laboratory

Reference34 articles.

1. MEME SUITE: tools for motif discovery and searching

2. Fitting a mixture model by expectation maximization to discover motifs in biopolymers;Proceedings. International Conference on Intelligent Systems for Molecular Biology,1994

3. Differential RNA-seq (dRNA-seq) for annotation of transcriptional start sites and small RNAs in Helicobacter pylori

4. Inverse folding based pre-training for the reliable identification of intrinsic transcription terminators;PLOS Computational Biology,2022

5. Non-coding RNAs: what are we missing?

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3