Improved definition of the mouse transcriptome via targeted RNA sequencing-Reference-Cited by-同舟云学术

Improved definition of the mouse transcriptome via targeted RNA sequencing

Published:2016-05 Issue:5 Volume:26 Page:705-716
ISSN:1088-9051
Container-title:Genome Research
language:en
Short-container-title:Genome Res.

Author:

Bussotti Giovanni^ORCID,Leonardi Tommaso^ORCID,Clark Michael B.,Mercer Tim R.,Crawford Joanna,Malquori Lorenzo,Notredame Cedric,Dinger Marcel E.,Mattick John S.,Enright Anton J.^ORCID

Abstract

Targeted RNA sequencing (CaptureSeq) uses oligonucleotide probes to capture RNAs for sequencing, providing enriched read coverage, accurate measurement of gene expression, and quantitative expression data. We applied CaptureSeq to refine transcript annotations in the current murine GRCm38 assembly. More than 23,000 regions corresponding to putative or annotated long noncoding RNAs (lncRNAs) and 154,281 known splicing junction sites were selected for targeted sequencing across five mouse tissues and three brain subregions. The results illustrate that the mouse transcriptome is considerably more complex than previously thought. We assemble more complete transcript isoforms than GENCODE, expand transcript boundaries, and connect interspersed islands of mapped reads. We describe a novel filtering pipeline that identifies previously unannotated but high-quality transcript isoforms. In this set, 911 GENCODE neighboring genes are condensed into 400 expanded gene models. Additionally, 594 GENCODE lncRNAs acquire an open reading frame (ORF) when their structure is extended with CaptureSeq. Finally, we validate our observations using current FANTOM and Mouse ENCODE resources.

Funder

National Health and Medical Research Council

Publisher

Cold Spring Harbor Laboratory

Subject

Genetics (clinical),Genetics

Reference78 articles.

1. A novel 2 bp deletion in the TM4SF2 gene is associated with MRX58

2. Complex architecture and regulated expression of the Sox2ot locus during vertebrate development

3. lncRNAdb: a reference database for long noncoding RNAs

4. HTSeq--a Python framework to work with high-throughput sequencing data

Cited by 32 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. RNaseH-based ribodepletion of total planarian RNA improves detection of longer and non-polyadenylated transcripts;2024-07-21

2. IsoVis – a webserver for visualization and annotation of alternative RNA isoforms;Nucleic Acids Research;2024-05-06

3. A pan-tissue, pan-disease compendium of human orphan genes;2024-02-23

4. Fear extinction is regulated by the activity of long noncoding RNAs at the synapse;Nature Communications;2023-11-22

5. A High-Efficiency Capture-Based NGS Approach for Comprehensive Analysis of Mitochondrial Transcriptome;Analytical Chemistry;2023-11-08