SQANTI3: curation of long-read transcriptomes for accurate identification of known and novel isoforms

Author:

Pardo-Palacios Francisco J.ORCID,Arzalluz-Luque AngelesORCID,Kondratova Liudmyla,Salguero PedroORCID,Mestre-Tomás JorgeORCID,Amorín Rocío,Estevan-Morió Eva,Liu TianyuanORCID,Nanni AdalenaORCID,McIntyre LaurenORCID,Tseng ElizabethORCID,Conesa AnaORCID

Abstract

AbstractThe emergence of long-read RNA sequencing (lrRNA-seq) has provided an unprecedented opportunity to analyze transcriptomes at isoform resolution. However, the technology is not free from biases, and transcript models inferred from these data require quality control and curation. In this study, we introduce SQANTI3, a tool specifically designed to perform quality analysis on transcriptomes constructed using lrRNA-seq data. SQANTI3 provides an extensive naming framework to describe transcript model diversity in comparison to the reference transcriptome. Additionally, the tool incorporates a wide range of metrics to characterize various structural properties of transcript models, such as transcription start and end sites, splice junctions, and other structural features. These metrics can be utilized to filter out potential artifacts. Moreover, SQANTI3 includes a Rescue module that prevents the loss of known genes and transcripts exhibiting evidence of expression but displaying low-quality features. Lastly, SQANTI3 incorporates IsoAnnotLite, which enables functional annotation at the isoform level and facilitates functional iso-transcriptomics analyses. We demonstrate the versatility of SQANTI3 in analyzing different data types, isoform reconstruction pipelines, and sequencing platforms, and how it provides novel biological insights into isoform biology. The SQANTI3 software is available athttps://github.com/ConesaLab/SQANTI3.

Publisher

Cold Spring Harbor Laboratory

Reference48 articles.

1. Method of the year 2022: long-read sequencing 20(1), 1–1. https://doi.org/10.1038/s41592-022-01759-x. Number: 1 Publisher: Nature Publishing Group. Accessed 2023-02-06

2. Ding, C. , Yan, X. , Xu, M. , Zhou, R. , Zhao, Y. , Zhang, D. , Huang, Z. , Pan, Z. , Xiao, P. , Li, H. , Chen, L. , Wang, Y. : Short-read and long-read full-length transcriptome of mouse neural stem cells across neurodevelopmental stages 9(1), 69. https://doi.org/10.1038/s41597-022-01165-0. Number: 1 Publisher: Nature Publishing Group. Accessed 2022-11-08

3. Tilgner, H. , Grubert, F. , Sharon, D. , Snyder, M.P. : Defining a personal, allele-specific, and single-molecule long-read transcriptome 111(27), 9869–9874. https://doi.org/10.1073/pnas.1400447111. Publisher: Proceedings of the National Academy of Sciences. Accessed 2022-11-08

4. Singh, M. , Al-Eryani, G. , Carswell, S. , Ferguson, J.M. , Blackburn, J. , Barton, K. , Roden, D. , Luciani, F. , Giang Phan, T. , Junankar, S. , Jackson, K. , Goodnow, C.C. , Smith, M.A. , Swarbrick, A. : High-throughput targeted long-read single cell sequencing reveals the clonal and transcriptional landscape of lymphocytes 10(1), 3120. https://doi.org/10.1038/s41467-019-11049-4. Number: 1 Publisher: Nature Publishing Group. Accessed 2022-11-08

5. Wang, B. , Tseng, E. , Regulski, M. , Clark, T.A. , Hon, T. , Jiao, Y. , Lu, Z. , Olson, A. , Stein, J.C. , Ware, D. : Unveiling the complexity of the maize transcriptome by single-molecule long-read sequencing 7(1), 11708. https://doi.org/10.1038/ncomms11708. Number: 1 Publisher: Nature Publishing Group. Accessed 2022-11-08

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3