PRAM: a novel pooling approach for discovering intergenic transcripts from large-scale RNA sequencing experiments-Reference-Cited by-同舟云学术

PRAM: a novel pooling approach for discovering intergenic transcripts from large-scale RNA sequencing experiments

Published:2020-09-21 Issue:11 Volume:30 Page:1655-1666
ISSN:1088-9051
Container-title:Genome Research
language:en
Short-container-title:Genome Res.

Author:

Liu Peng,Soukup Alexandra A.,Bresnick Emery H.,Dewey Colin N.^ORCID,Keleş Sündüz

Abstract

Publicly available RNA-seq data is routinely used for retrospective analysis to elucidate new biology. Novel transcript discovery enabled by joint analysis of large collections of RNA-seq data sets has emerged as one such analysis. Current methods for transcript discovery rely on a ‘2-Step’ approach where the first step encompasses building transcripts from individual data sets, followed by the second step that merges predicted transcripts across data sets. To increase the power of transcript discovery from large collections of RNA-seq data sets, we developed a novel ‘1-Step’ approach named Pooling RNA-seq and Assembling Models (PRAM) that builds transcript models from pooled RNA-seq data sets. We demonstrate in a computational benchmark that 1-Step outperforms 2-Step approaches in predicting overall transcript structures and individual splice junctions, while performing competitively in detecting exonic nucleotides. Applying PRAM to 30 human ENCODE RNA-seq data sets identified unannotated transcripts with epigenetic and RAMPAGE signatures similar to those of recently annotated transcripts. In a case study, we discovered and experimentally validated new transcripts through the application of PRAM to mouse hematopoietic RNA-seq data sets. We uncovered new transcripts that share a differential expression pattern with a neighboring gene Pik3cg implicated in human hematopoietic phenotypes, and we provided evidence for the conservation of this relationship in human. PRAM is implemented as an R/Bioconductor package.

Funder

National Institutes of Health

NIH

National Heart, Lung, and Blood Institute

National Institute of Diabetes and Digestive and Kidney Diseases

Carbone Cancer Center

Publisher

Cold Spring Harbor Laboratory

Subject

Genetics(clinical),Genetics

Reference51 articles.

1. MetaSRA: normalized human sample-specific metadata for the Sequence Read Archive

2. Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses

3. BLAST+: architecture and applications

4. Reproducible RNA-seq analysis using recount2

5. Landscape of transcription in human cells

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Transcriptomic and metabolomic analyses to study the key role by which Ralstonia insidiosa induces Listeria monocytogenes to form suspended aggregates;Frontiers in Microbiology;2023-10-12

2. Graph pangenome captures missing heritability and empowers tomato breeding;Nature;2022-06-08