Alignment and mapping methodology influence transcript abundance estimation-Reference-Cited by-同舟云学术

Alignment and mapping methodology influence transcript abundance estimation

Published:2020-09-07 Issue:1 Volume:21 Page:
ISSN:1474-760X
Container-title:Genome Biology
language:en
Short-container-title:Genome Biol

Author:

Srivastava Avi,Malik Laraib,Sarkar Hirak,Zakeri Mohsen,Almodaresi Fatemeh,Soneson Charlotte,Love Michael I.,Kingsford Carl,Patro Rob

Abstract

Abstract Background The accuracy of transcript quantification using RNA-seq data depends on many factors, such as the choice of alignment or mapping method and the quantification model being adopted. While the choice of quantification model has been shown to be important, considerably less attention has been given to comparing the effect of various read alignment approaches on quantification accuracy. Results We investigate the influence of mapping and alignment on the accuracy of transcript quantification in both simulated and experimental data, as well as the effect on subsequent differential expression analysis. We observe that, even when the quantification model itself is held fixed, the effect of choosing a different alignment methodology, or aligning reads using different parameters, on quantification estimates can sometimes be large and can affect downstream differential expression analyses as well. These effects can go unnoticed when assessment is focused too heavily on simulated data, where the alignment task is often simpler than in experimentally acquired samples. We also introduce a new alignment methodology, called selective alignment, to overcome the shortcomings of lightweight approaches without incurring the computational cost of traditional alignment. Conclusion We observe that, on experimental datasets, the performance of lightweight mapping and alignment-based approaches varies significantly, and highlight some of the underlying factors. We show this variation both in terms of quantification and downstream differential expression analysis. In all comparisons, we also show the improved performance of our proposed selective alignment method and suggest best practices for performing RNA-seq quantification.

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1186/s13059-020-02151-8.pdf

Reference58 articles.

1. Lister R, O’Malley RC, Tonti-Filippini J, Gregory BD, Berry CC, Harvey Millar A, Ecker JR. Highly integrated single-base resolution maps of the epigenome in Arabidopsis. Cell. 2008; 133(3):523–36.

2. Nagalakshmi U, Wang Z, Waern K, Shou C, Raha D, Gerstein M, Snyder M. The transcriptional landscape of the yeast genome defined by RNA sequencing. Science. 2008; 320(5881):1344–9.

3. Mortazavi A, Williams B, McCue K, Schaeffer L, Wold B. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods. 2008; 5(7):621.

4. Patro R, Mount SM, Kingsford C. Sailfish enables alignment-free isoform quantification from RNA-seq reads using lightweight algorithms. Nat Biotechnol. 2014; 32(5):462.

5. Bray NL, Pimentel H. Páll Melsted, and Lior Pachter. Near-optimal probabilistic RNA-seq quantification. Nat Biotechnol. 2016; 34(5):525.

Cited by 112 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Patient subtyping analysis of baseline multi-omic data reveals distinct pre-immune states associated with antibody response to seasonal influenza vaccination;Clinical Immunology;2024-09

2. Exploring the effects of assembly strategies on differential gene expression – A case study in a non-model crustacean species, the wild black tiger prawn (Penaeus monodon);2024-08-26

3. Transcriptomic profiling of gill biopsies to define predictive markers for seawater survival in farmed Atlantic salmon;2024-08-21

4. A comprehensiveSchizosaccharomyces pombeatlas of physical transcription factor interactions with proteins and chromatin;2024-08-20

5. Refining dual RNA-seq mapping: sequential and combined approaches in host-parasite plant dynamics;2024-07-29