SQANTI3: curation of long-read transcriptomes for accurate identification of known and novel isoforms
Author:
Pardo-Palacios Francisco J.ORCID, Arzalluz-Luque AngelesORCID, Kondratova Liudmyla, Salguero PedroORCID, Mestre-Tomás JorgeORCID, Amorín Rocío, Estevan-Morió Eva, Liu TianyuanORCID, Nanni AdalenaORCID, McIntyre LaurenORCID, Tseng ElizabethORCID, Conesa AnaORCID
Abstract
AbstractThe emergence of long-read RNA sequencing (lrRNA-seq) has provided an unprecedented opportunity to analyze transcriptomes at isoform resolution. However, the technology is not free from biases, and transcript models inferred from these data require quality control and curation. In this study, we introduce SQANTI3, a tool specifically designed to perform quality analysis on transcriptomes constructed using lrRNA-seq data. SQANTI3 provides an extensive naming framework to describe transcript model diversity in comparison to the reference transcriptome. Additionally, the tool incorporates a wide range of metrics to characterize various structural properties of transcript models, such as transcription start and end sites, splice junctions, and other structural features. These metrics can be utilized to filter out potential artifacts. Moreover, SQANTI3 includes a Rescue module that prevents the loss of known genes and transcripts exhibiting evidence of expression but displaying low-quality features. Lastly, SQANTI3 incorporates IsoAnnotLite, which enables functional annotation at the isoform level and facilitates functional iso-transcriptomics analyses. We demonstrate the versatility of SQANTI3 in analyzing different data types, isoform reconstruction pipelines, and sequencing platforms, and how it provides novel biological insights into isoform biology. The SQANTI3 software is available athttps://github.com/ConesaLab/SQANTI3.
Publisher
Cold Spring Harbor Laboratory
Reference48 articles.
1. Method of the year 2022: long-read sequencing 20(1), 1–1. https://doi.org/10.1038/s41592-022-01759-x. Number: 1 Publisher: Nature Publishing Group. Accessed 2023-02-06 2. Ding, C. , Yan, X. , Xu, M. , Zhou, R. , Zhao, Y. , Zhang, D. , Huang, Z. , Pan, Z. , Xiao, P. , Li, H. , Chen, L. , Wang, Y. : Short-read and long-read full-length transcriptome of mouse neural stem cells across neurodevelopmental stages 9(1), 69. https://doi.org/10.1038/s41597-022-01165-0. Number: 1 Publisher: Nature Publishing Group. Accessed 2022-11-08 3. Tilgner, H. , Grubert, F. , Sharon, D. , Snyder, M.P. : Defining a personal, allele-specific, and single-molecule long-read transcriptome 111(27), 9869–9874. https://doi.org/10.1073/pnas.1400447111. Publisher: Proceedings of the National Academy of Sciences. Accessed 2022-11-08 4. Singh, M. , Al-Eryani, G. , Carswell, S. , Ferguson, J.M. , Blackburn, J. , Barton, K. , Roden, D. , Luciani, F. , Giang Phan, T. , Junankar, S. , Jackson, K. , Goodnow, C.C. , Smith, M.A. , Swarbrick, A. : High-throughput targeted long-read single cell sequencing reveals the clonal and transcriptional landscape of lymphocytes 10(1), 3120. https://doi.org/10.1038/s41467-019-11049-4. Number: 1 Publisher: Nature Publishing Group. Accessed 2022-11-08 5. Wang, B. , Tseng, E. , Regulski, M. , Clark, T.A. , Hon, T. , Jiao, Y. , Lu, Z. , Olson, A. , Stein, J.C. , Ware, D. : Unveiling the complexity of the maize transcriptome by single-molecule long-read sequencing 7(1), 11708. https://doi.org/10.1038/ncomms11708. Number: 1 Publisher: Nature Publishing Group. Accessed 2022-11-08
Cited by
9 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|