Generating, modeling, and evaluating a large-scale set of CRISPR/Cas9 off-target sites with bulges

Author:

Yaish OfirORCID,Orenstein YaronORCID

Abstract

CRISPR/Cas9 system is widely used in a broad range of gene-editing applications. While the CRISPR editing technique is quite accurate in the target region, there may be many unplanned offtarget sites (OTS). Consequently, a plethora of highthroughput experimental assays have been developed to measure OTS in a genome-wide manner. Based on these experimental data, computational methods have been developed to predict OTS given a guide RNA and a reference genome. However, these methods are highly inaccurate when considering OTS with bulges due to limited data compared to OTS without bulges. Recently, CHANGE-seq, a newin vitroexperimental technique to detect OTS, was used to produce a dataset of unprecedented scale and quality (more than 200,000 OTS over 110 guide RNAs). In addition, the same study includedin cellulaGUIDE-seq experiments for 58 of the guide RNAs. But, while the CHANGE-seq data included more than 20,000 OTS with bulges, the GUIDE-seq data did not include any OTS with bulges. Here, we fill this gap by generating the most comprehensive GUIDE-seq dataset with bulges, and training and evaluating state-of-the-art machine-learning models that consider OTS with bulges. We first reprocessed the publicly available experimental raw data of the CHANGE-seq study to generate 20 new GUIDE-seq datasets, and more than 450 OTS with bulges among the original and new GUIDE-seq experiments. We then trained various machinelearning models, evaluated their performance on multiple datasets, and demonstrated their state-of-the-art performance bothin vitroandin cellula. Last, we visualized the key features learned by our models on OTS with bulges. Our data and models will be instrumental to any future off-target study considering bulges.

Publisher

Cold Spring Harbor Laboratory

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3