Author:
Jin Lei,Liyanage Risi,Duan Dongsheng,Chen Shi-Jie
Abstract
AbstractThe CRISPR/Cas nucleases system is widely considered the most important tool in genome engineering. However, current methods for predicting on/off-target effects and designing guide RNA (gRNA) rely on purely data-driven approaches or focus solely on the system’s thermal equilibrium properties. Nonetheless, experimental evidence suggests that the process is kinetically controlled rather than being in equilibrium. In this study, we utilized a vast amount of available data and combined random forest, a supervised ensemble learning algorithm, and free energy landscape analysis to investigate the kinetic pathways of R-loop formation in the CRISPR/Cas9 system and the intricate molecular interactions between DNA and the Cas9 RuvC and HNH domains. The study revealed (a) a novel three-state kinetic mechanism, (b) the unfolding of the activation state of the R-loop being the most crucial kinetic determinant and the key predictor for on- and off-target cleavage efficiencies, and (c) the nucleotides from positions +13 to +16 being the kinetically critical nucleotides. The results provide a biophysical rationale for the design of a kinetic strategy for enhancing CRISPR/Cas9 gene editing accuracy and efficiency.
Publisher
Cold Spring Harbor Laboratory