Splam: a deep-learning-based splice site predictor that improves spliced alignments-Reference-Cited by-同舟云学术

Splam: a deep-learning-based splice site predictor that improves spliced alignments

Published:2023-07-29 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Chao Kuan-Hao^ORCID,Mao Alan^ORCID,Salzberg Steven L^ORCID,Pertea Mihaela^ORCID

Abstract

AbstractThe process of splicing messenger RNA to remove introns plays a central role in creating genes and gene variants. Here we describe Splam, a novel method for predicting splice junctions in DNA based on deep residual convolutional neural networks. Unlike some previous models, Splam looks at a relatively limited window of 400 base pairs flanking each splice site, motivated by the observation that the biological process of splicing relies primarily on signals within this window. Additionally, Splam introduces the idea of training the network on donor and acceptor pairs together, based on the principle that the splicing machinery recognizes both ends of each intron at once. We compare Splam’s accuracy to recent state-of-the-art splice site prediction methods, particularly SpliceAI, another method that uses deep neural networks. Our results show that Splam is consistently more accurate than SpliceAI, with an overall accuracy of 96% at predicting human splice junctions. Splam generalizes even to non-human species, including distant ones like the flowering plantArabidopsis thaliana. Finally, we demonstrate the use of Splam on a novel application: processing the spliced alignments of RNA-seq data to identify and eliminate errors. We show that when used in this manner, Splam yields substantial improvements in the accuracy of downstream transcriptome analysis of both poly(A) and ribo-depleted RNA-seq libraries. Overall, Splam offers a faster and more accurate approach to detecting splice junctions, while also providing a reliable and efficient solution for cleaning up erroneous spliced alignments.

Publisher

Cold Spring Harbor Laboratory

Reference63 articles.

1. Spliced segments at the 5′ terminus of adenovirus 2 late mRNA

2. Alternative splicing as a regulator of development and tissue identity

3. Genome-Wide Survey of Human Alternative Pre-mRNA Splicing with Exon Junction Microarrays

4. Functional consequences of developmentally regulated alternative splicing

5. RNA processing and its regulation: global insights into biological networks

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A basic framework governing splice-site choice in eukaryotes;2024-03-27

2. Upstream open reading frames may contain hundreds of novel human exons;2024-03-23

3. Predicting cell-type-specific exon inclusion in the human brain reveals more complex splicing mechanisms in neurons than glia;2024-03-18