A machine learning approach for predicting CRISPR-Cas9 cleavage efficiencies and patterns underlying its mechanism of action

被引:130
作者
Abadi, Shiran [1 ]
Yan, Winston X. [2 ,3 ,4 ]
Amar, David [5 ,6 ]
Mayrose, Itay [1 ]
机构
[1] Tel Aviv Univ, Dept Mol Biol & Ecol Plants, Tel Aviv, Israel
[2] Broad Inst MIT & Harvard, Cambridge, MA USA
[3] Harvard Med Sch, Grad Program Biophys, Boston, MA USA
[4] Harvard Med Sch, Harvard MIT Div Hlth Sci & Technol, Boston, MA USA
[5] Tel Aviv Univ, Blavatnik Sch Comp Sci, Tel Aviv, Israel
[6] Stanford Univ, Dept Med, Div Cardiovasc Med, Stanford, CA 94305 USA
关键词
OFF-TARGET SITES; GUIDE-RNA; CRISPR/CAS9; SYSTEMS; HUMAN-CELLS; CAS9; DNA; NUCLEASES; TOOL; SPECIFICITIES; ENDONUCLEASE;
D O I
10.1371/journal.pcbi.1005807
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The adaptation of the CRISPR-Cas9 system as a genome editing technique has generated much excitement in recent years owing to its ability to manipulate targeted genes and genomic regions that are complementary to a programmed single guide RNA (sgRNA). However, the efficacy of a specific sgRNA is not uniquely defined by exact sequence homology to the target site, thus unintended off-targets might additionally be cleaved. Current methods for sgRNA design are mainly concerned with predicting off-targets for a given sgRNA using basic sequence features and employ elementary rules for ranking possible sgRNAs. Here, we introduce CRISTA (CRISPR Target Assessment), a novel algorithm within the machine learning framework that determines the propensity of a genomic site to be cleaved by a given sgRNA. We show that the predictions made with CRISTA are more accurate than other available methodologies. We further demonstrate that the occurrence of bulges is not a rare phenomenon and should be accounted for in the prediction process. Beyond predicting cleavage efficiencies, the learning process provides inferences regarding patterns that underlie the mechanism of action of the CRISPR-Cas9 system. We discover that attributes that describe the spatial structure and rigidity of the entire genomic site as well as those surrounding the PAM region are a major component of the prediction capabilities.
引用
收藏
页数:24
相关论文
共 74 条
[21]   Genome-wide detection of DNA double-stranded breaks induced by engineered nucleases [J].
Frock, Richard L. ;
Hu, Jiazhi ;
Meyers, Robin M. ;
Ho, Yu-Jui ;
Kii, Erina ;
Alt, Frederick W. .
NATURE BIOTECHNOLOGY, 2015, 33 (02) :179-186
[22]   High-frequency off-target mutagenesis induced by CRISPR-Cas nucleases in human cells [J].
Fu, Yanfang ;
Foden, Jennifer A. ;
Khayter, Cyd ;
Maeder, Morgan L. ;
Reyon, Deepak ;
Joung, J. Keith ;
Sander, Jeffry D. .
NATURE BIOTECHNOLOGY, 2013, 31 (09) :822-+
[23]   Evaluation of off-target and on-target scoring algorithms and integration into the guide RNA selection tool CRISPOR [J].
Haeussler, Maximilian ;
Schoenig, Kai ;
Eckert, Helene ;
Eschstruth, Alexis ;
Mianne, Joffrey ;
Renaud, Jean-Baptiste ;
Schneider-Maunoury, Sylvie ;
Shkumatava, Alena ;
Teboul, Lydia ;
Kent, Jim ;
Joly, Jean-Stephane ;
Concordet, Jean-Paul .
GENOME BIOLOGY, 2016, 17
[24]   Control of DNA minor groove width and Fis protein binding by the purine 2-amino group [J].
Hancock, Stephen P. ;
Ghane, Tahereh ;
Cascio, Duilio ;
Rohs, Remo ;
Di Felice, Rosa ;
Johnson, Reid C. .
NUCLEIC ACIDS RESEARCH, 2013, 41 (13) :6750-6760
[25]   E-CRISP: fast CRISPR target site identification [J].
Heigwer, Florian ;
Kerr, Grainne ;
Boutros, Michael .
NATURE METHODS, 2014, 11 (02) :122-124
[26]   DNA targeting specificity of RNA-guided Cas9 nucleases [J].
Hsu, Patrick D. ;
Scott, David A. ;
Weinstein, Joshua A. ;
Ran, F. Ann ;
Konermann, Silvana ;
Agarwala, Vineeta ;
Li, Yinqing ;
Fine, Eli J. ;
Wu, Xuebing ;
Shalem, Ophir ;
Cradick, Thomas J. ;
Marraffini, Luciano A. ;
Bao, Gang ;
Zhang, Feng .
NATURE BIOTECHNOLOGY, 2013, 31 (09) :827-+
[27]   Efficient genome editing in zebrafish using a CRISPR-Cas system [J].
Hwang, Woong Y. ;
Fu, Yanfang ;
Reyon, Deepak ;
Maeder, Morgan L. ;
Tsai, Shengdar Q. ;
Sander, Jeffry D. ;
Peterson, Randall T. ;
Yeh, J-R Joanna ;
Joung, J. Keith .
NATURE BIOTECHNOLOGY, 2013, 31 (03) :227-229
[28]   RNA-guided editing of bacterial genomes using CRISPR-Cas systems [J].
Jiang, Wenyan ;
Bikard, David ;
Cox, David ;
Zhang, Feng ;
Marraffini, Luciano A. .
NATURE BIOTECHNOLOGY, 2013, 31 (03) :233-239
[29]   Structures of Cas9 Endonucleases Reveal RNA-Mediated Conformational Activation [J].
Jinek, Martin ;
Jiang, Fuguo ;
Taylor, David W. ;
Sternberg, Samuel H. ;
Kaya, Emine ;
Ma, Enbo ;
Anders, Carolin ;
Hauer, Michael ;
Zhou, Kaihong ;
Lin, Steven ;
Kaplan, Matias ;
Iavarone, Anthony T. ;
Charpentier, Emmanuelle ;
Nogales, Eva ;
Doudna, Jennifer A. .
SCIENCE, 2014, 343 (6176) :1215-+
[30]   RNA-programmed genome editing in human cells [J].
Jinek, Martin ;
East, Alexandra ;
Cheng, Aaron ;
Lin, Steven ;
Ma, Enbo ;
Doudna, Jennifer .
ELIFE, 2013, 2