Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences-Reference-Cited by-同舟云学术

Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences

Published:2016-02-29 Issue: Volume:4 Page:1521
ISSN:2046-1402
Container-title:F1000Research
language:en
Short-container-title:F1000Res

Author:

Soneson Charlotte^ORCID,Love Michael I.^ORCID,Robinson Mark D.^ORCID

Abstract

High-throughput sequencing of cDNA (RNA-seq) is used extensively to characterize the transcriptome of cells. Many transcriptomic studies aim at comparing either abundance levels or the transcriptome composition between given conditions, and as a first step, the sequencing reads must be used as the basis for abundance quantification of transcriptomic features of interest, such as genes or transcripts. Various quantification approaches have been proposed, ranging from simple counting of reads that overlap given genomic regions to more complex estimation of underlying transcript abundances. In this paper, we show that gene-level abundance estimates and statistical inference offer advantages over transcript-level analyses, in terms of performance and interpretability. We also illustrate that the presence of differential isoform usage can lead to inflated false discovery rates in differential gene expression analyses on simple count matrices but that this can be addressed by incorporating offsets derived from transcript-level abundance estimates. We also show that the problem is relatively minor in several real data sets. Finally, we provide an R package (tximport) to help users integrate transcript-level abundance estimates from common quantification pipelines into count-based statistical inference engines.

Publisher

F1000 Research Ltd

Subject

General Pharmacology, Toxicology and Pharmaceutics,General Immunology and Microbiology,General Biochemistry, Genetics and Molecular Biology,General Medicine

Link

https://f1000research.com/articles/4-1521/v2/pdf

Reference35 articles.

1. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features.;Y Liao;Bioinformatics.,2014

2. HTSeq--a Python framework to work with high-throughput sequencing data.;S Anders;Bioinformatics.,2015

3. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks.;C Trapnell;Nat Protoc.,2012

4. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome.;B Li;BMC Bioinformatics.,2011

5. Identifying differentially expressed transcripts from RNA-seq data with biological variation.;P Glaus;Bioinformatics.,2012

Cited by 2395 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Unravelling the intricate language of fish guts: Impact of plant-based vs. plant-insect-poultry-based diets on intestinal pathways in European seabass;Aquaculture;2025-01

2. The PAH1-encoded phosphatidate phosphatase of Yarrowia lipolytica differentially affects gene expression and lipid biosynthesis;Biochimica et Biophysica Acta (BBA) - Molecular and Cell Biology of Lipids;2024-12

3. New potential diagnostic markers for verrucous hyperplasia and verrucous carcinoma based on RNA-sequencing data;Molecular and Cellular Probes;2024-10

4. Metatranscriptomic responses of High-Arctic tundra soil microbiomes to carbon input;Soil Biology and Biochemistry;2024-10

5. lncRNA-gene network analysis reveals the effects of early maternal nutrition on mineral homeostasis and energy metabolism in the fetal liver transcriptome of beef heifers;The Journal of Nutritional Biochemistry;2024-10