A long-read RNA-seq approach to identify novel transcripts of very large genes-Reference-Cited by-同舟云学术

A long-read RNA-seq approach to identify novel transcripts of very large genes

Published:2020-06 Issue:6 Volume:30 Page:885-897
ISSN:1088-9051
Container-title:Genome Research
language:en
Short-container-title:Genome Res.

Author:

Uapinyoying Prech^ORCID,Goecks Jeremy^ORCID,Knoblach Susan M.^ORCID,Panchapakesan Karuna,Bonnemann Carsten G.^ORCID,Partridge Terence A.^ORCID,Jaiswal Jyoti K.^ORCID,Hoffman Eric P.^ORCID

Abstract

RNA-seq is widely used for studying gene expression, but commonly used sequencing platforms produce short reads that only span up to two exon junctions per read. This makes it difficult to accurately determine the composition and phasing of exons within transcripts. Although long-read sequencing improves this issue, it is not amenable to precise quantitation, which limits its utility for differential expression studies. We used long-read isoform sequencing combined with a novel analysis approach to compare alternative splicing of large, repetitive structural genes in muscles. Analysis of muscle structural genes that produce medium (Nrap: 5 kb), large (Neb: 22 kb), and very large (Ttn: 106 kb) transcripts in cardiac muscle, and fast and slow skeletal muscles identified unannotated exons for each of these ubiquitous muscle genes. This also identified differential exon usage and phasing for these genes between the different muscle types. By mapping the in-phase transcript structures to known annotations, we also identified and quantified previously unannotated transcripts. Results were confirmed by endpoint PCR and Sanger sequencing, which revealed muscle-type-specific differential expression of these novel transcripts. The improved transcript identification and quantification shown by our approach removes previous impediments to studies aimed at quantitative differential expression of ultralong transcripts.

Funder

National Institutes of Health

Publisher

Cold Spring Harbor Laboratory

Subject

Genetics (clinical),Genetics

Reference59 articles.

1. Detecting differential usage of exons from RNA-seq data

2. HTSeq--a Python framework to work with high-throughput sequencing data

3. Roles of Nebulin Family Members in the Heart

4. The Complete Gene Sequence of Titin, Expression of an Unusual ≈700-kDa Titin Isoform, and Its Interaction With Obscurin Identify a Novel Z-Line to I-Band Linking System

5. Trimmomatic: a flexible trimmer for Illumina sequence data

Cited by 30 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Differential inclusion ofNEBexons 143 and 144 provides insight intoNEB-related myopathy variant interpretation and disease manifestation;2024-03-26

2. Using long-read CAGE sequencing to profile cryptic-promoter-derived transcripts and their contribution to the immunopeptidome;Genome Research;2023-12

3. Tissue-specific transcriptome and metabolome analyses reveal candidate genes for lignan biosynthesis in the medicinal plant Schisandra sphenanthera;BMC Genomics;2023-10-11

4. Muscle growth and plasticity in teleost fish: the significance of evolutionarily diverse sarcomeric proteins;Reviews in Fish Biology and Fisheries;2023-08-17

5. Beyond the exome: What’s next in diagnostic testing for Mendelian conditions;The American Journal of Human Genetics;2023-08