Abstract
AbstractBackgroundConventional differential gene expression analysis pipelines for non-model organisms require computationally expensive transcriptome assembly. We recently proposed an alternative strategy of directly aligning RNA-seq reads to a protein database, and demonstrated drastic improvements in speed, memory usage, and accuracy in identifying differentially expressed genes.ResultHere we report a further speed-up by replacing DNA-protein alignment by quasi-mapping, making our pipeline>1000 × faster than assembly-based approach, and still more accurate. We also compare quasi-mapping to other mapping techniques, and show that it is faster but at the cost of sensitivity.ConclusionWe provide a quick-and-dirty differential gene expression analysis pipeline for non-model organisms without a reference transcriptome, which directly quasi-maps RNA-seq reads to a reference protein database, avoiding computationally expensive transcriptome assembly.
Publisher
Cold Spring Harbor Laboratory
Reference32 articles.
1. Challenges and strategies in transcriptome assembly and differential gene expression quantification;a comprehensive in-silico assessment of RNA-seq experiments. Molecular Ecology,2012
2. Hsieh, P.-H. , Oyang, Y.-J. , Chen, C.-Y. : Effect of de novo transcriptome assembly on transcript quantification. Scientific Reports 9(1) (2019)
3. Shrestha, A.M.S. , Guiao, J.E.B. , Santiago, K.C.L. : Assembly-free rapid differential gene expression analysis in non-model organisms using DNA-protein alignment. BMC Genomics 23(1) (2022)
4. Ultrafast functional profiling of RNA-seq data for nonmodel organisms
5. Salmon provides fast and bias-aware quantification of transcript expression;Nature Methods,2017