AttCRISPR: a spacetime interpretable model for prediction of sgRNA on-target activity-Reference-Cited by-同舟云学术

AttCRISPR: a spacetime interpretable model for prediction of sgRNA on-target activity

Published:2021-12 Issue:1 Volume:22 Page:
ISSN:1471-2105
Container-title:BMC Bioinformatics
language:en
Short-container-title:BMC Bioinformatics

Author:

Xiao Li-Ming,Wan Yun-Qi,Jiang Zhen-Ran

Abstract

Abstract Background More and more Cas9 variants with higher specificity are developed to avoid the off-target effect, which brings a significant volume of experimental data. Conventional machine learning performs poorly on these datasets, while the methods based on deep learning often lack interpretability, which makes researchers have to trade-off accuracy and interpretability. It is necessary to develop a method that can not only match deep learning-based methods in performance but also with good interpretability that can be comparable to conventional machine learning methods. Results To overcome these problems, we propose an intrinsically interpretable method called AttCRISPR based on deep learning to predict the on-target activity. The advantage of AttCRISPR lies in using the ensemble learning strategy to stack available encoding-based methods and embedding-based methods with strong interpretability. Comparison with the state-of-the-art methods using WT-SpCas9, eSpCas9(1.1), SpCas9-HF1 datasets, AttCRISPR can achieve an average Spearman value of 0.872, 0.867, 0.867, respectively on several public datasets, which is superior to these methods. Furthermore, benefits from two attention modules—one spatial and one temporal, AttCRISPR has good interpretability. Through these modules, we can understand the decisions made by AttCRISPR at both global and local levels without other post hoc explanations techniques. Conclusion With the trained models, we reveal the preference for each position-dependent nucleotide on the sgRNA (short guide RNA) sequence in each dataset at a global level. And at a local level, we prove that the interpretability of AttCRISPR can be used to guide the researchers to design sgRNA with higher activity.

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Structural Biology

Link

https://link.springer.com/content/pdf/10.1186/s12859-021-04509-6.pdf

Reference28 articles.

1. Jinek M, Chylinski K, Fonfara I, Hauer M, Doudna JA, Charpentier E. A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity. Science. 2012;337(6096):816–21.

2. Cong L, Ran FA, Cox D, Lin S, Barretto R, Habib N, Hsu PD, Wu X, Jiang W, Marraffini LA, Zhang F. Multiplex genome engineering using CRISPR/Cas systems. Science. 2013;339(6121):819–23.

3. Mali P, Yang L, Esvelt KM, Aach J, Guell M, DiCarlo JE, Norville JE, Church GM. RNA-guided human genome engineering via Cas9. Science. 2013;339(6121):823–6.

4. Rubeis G, Steger F. Risks and benefits of human germline genome editing: an ethical analysis. Asian Bioeth Rev. 2018;10(2):133–41.

5. Kang X, He W, Huang Y, Yu Q, Chen Y, Gao X, Sun X, Fan Y. Introducing precise genetic modifications into human 3PN embryos by CRISPR/Cas-mediated genome editing. J Assist Reprod Genet. 2016;33(5):581–8.

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Comprehensive evaluation and prediction of editing outcomes for near-PAMless adenine and cytosine base editors;2024-04-11

2. gRNA Design: How Its Evolution Impacted on CRISPR/Cas9 Systems Refinement;Biomolecules;2023-11-24

3. The promise of explainable deep learning for omics data analysis: Adding new discovery tools to AI;New Biotechnology;2023-11

4. A fusion framework of deep learning and machine learning for predicting sgRNA cleavage efficiency;Computers in Biology and Medicine;2023-10

5. Benchmarking deep learning methods for predicting CRISPR/Cas9 sgRNA on- and off-target activities;Briefings in Bioinformatics;2023-09-22