• ISSN 1674-8301
  • CN 32-1810/R
Volume 35 Issue 2
Turn off MathJax
Article Contents

Alexander McKay, Gaetan Burgio. Harnessing CRISPR-Cas system diversity for gene editing technologies[J]. The Journal of Biomedical Research, 2021, 35(2): 91-106. doi: 10.7555/JBR.35.20200184
Citation: Alexander McKay, Gaetan Burgio. Harnessing CRISPR-Cas system diversity for gene editing technologies[J]. The Journal of Biomedical Research, 2021, 35(2): 91-106. doi: 10.7555/JBR.35.20200184

Harnessing CRISPR-Cas system diversity for gene editing technologies

doi: 10.7555/JBR.35.20200184
More Information
  • Corresponding author: Gaetan Burgio, Department of Immunology and Infectious Diseases, John Curtin School of Medical Research, Australian National University, 131 Garran Road, Canberra, ACT 2601, Australia. Tel/Fax: +61-2-6125-9428/+61-2-6247-4823, E-mail: gaetan.burgio@anu.edu.au
  • Received: 12 November 2020
  • Revised: 05 February 2021
  • Accepted: 19 February 2021
  • Published: 26 March 2021
  • Issue Date: March 2021
  • The discovery and utilization of RNA-guided surveillance complexes, such as CRISPR-Cas9, for sequence-specific DNA or RNA cleavage, has revolutionised the process of gene modification or knockdown. To optimise the use of this technology, an exploratory race has ensued to discover or develop new RNA-guided endonucleases with the most flexible sequence targeting requirements, coupled with high cleavage efficacy and specificity. Here we review the constraints of existing gene editing and assess the merits of exploiting the diversity of CRISPR-Cas effectors as a methodology for surmounting these limitations.
  • 加载中
  • [1] Jinek M, Chylinski K, Fonfara I, et al. A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity[J]. Science, 2012, 337(6096): 816–821. doi:  10.1126/science.1225829
    [2] Wang T, Wei JJ, Sabatini DM, et al. Genetic screens in human cells using the CRISPR-Cas9 system[J]. Science, 2014, 343(6166): 80–84. doi:  10.1126/science.1246981
    [3] Chen SD, Sanjana N, Zheng KJ, et al. Genome-wide CRISPR screen in a mouse model of tumor growth and metastasis[J]. Cell, 2015, 160(6): 1246–1260. doi:  10.1016/j.cell.2015.02.038
    [4] Wang HX, Song ZY, Lao YH, et al. Nonviral gene editing via CRISPR/Cas9 delivery by membrane-disruptive and endosomolytic helical polypeptide[J]. Proc Natl Acad Sci U S A, 2018, 115(19): 4903–4908. doi:  10.1073/pnas.1712963115
    [5] Teixeira M, Py BF, Bosc C, et al. Electroporation of mice zygotes with dual guide RNA/Cas9 complexes for simple and efficient cloning-free genome editing[J]. Sci Rep, 2018, 8(1): 474. doi:  10.1038/s41598-017-18826-5
    [6] Del'Guidice T, Lepetit-Stoffaes JP, Bordeleau LJ, et al. Membrane permeabilizing amphiphilic peptide delivers recombinant transcription factor and CRISPR-Cas9/Cpf1 ribonucleoproteins in hard-to-modify cells[J]. PLoS One, 2018, 13(4): e0195558. doi:  10.1371/journal.pone.0195558
    [7] Takeuchi R, Choi M, Stoddard BL. Redesign of extensive protein-DNA interfaces of meganucleases using iterative cycles of in vitro compartmentalization[J]. Proc Natl Acad Sci U S A, 2014, 111(11): 4061–4066. doi:  10.1073/pnas.1321030111
    [8] Makarova KS, Grishin NV, Shabalina SA, et al. A putative RNA-interference-based immune system in prokaryotes: computational analysis of the predicted enzymatic machinery, functional analogies with eukaryotic RNAi, and hypothetical mechanisms of action[J]. Biol Direct, 2006, 1(1): 7. doi:  10.1186/1745-6150-1-7
    [9] Barrangou R, Fremaux C, Deveau H, et al. CRISPR provides acquired resistance against viruses in prokaryotes[J]. Science, 2007, 315(5819): 1709–1712. doi:  10.1126/science.1138140
    [10] Makarova KS, Wolf YI, Iranzo J, et al. Evolutionary classification of CRISPR–Cas systems: a burst of class 2 and derived variants[J]. Nat Rev Microbiol, 2020, 18(2): 67–83. doi:  10.1038/s41579-019-0299-x
    [11] Hajizadeh Dastjerdi A, Newman A, Burgio G. The expanding class 2 CRISPR toolbox: diversity, applicability, and targeting drawbacks[J]. BioDrugs, 2019, 33(5): 503–513. doi:  10.1007/s40259-019-00369-y
    [12] Makarova KS, Wolf YI, Alkhnbashi OS, et al. An updated evolutionary classification of CRISPR-cas systems[J]. Nat Rev Microbiol, 2015, 13(11): 722–736. doi:  10.1038/nrmicro3569
    [13] Shmakov S, Smargon A, Scott D, et al. Diversity and evolution of class 2 CRISPR-Cas systems[J]. Nat Rev Microbiol, 2017, 15(3): 169–182. doi:  10.1038/nrmicro.2016.184
    [14] Yan WX, Chong SR, Zhang HB, et al. Cas13d is a compact RNA-targeting Type VI CRISPR effector positively modulated by a WYL-domain-containing accessory protein[J]. Mol Cell, 2018, 70(2): 327–339.e5. doi:  10.1016/j.molcel.2018.02.028
    [15] Shmakov SA, Faure G, Makarova KS, et al. Systematic prediction of functionally linked genes in bacterial and archaeal genomes[J]. Nat Protoc, 2019, 14(10): 3013–3031. doi:  10.1038/s41596-019-0211-1
    [16] Horvath P, Romero DA, Coûté-Monvoisin A, et al. Diversity, activity, and evolution of CRISPR loci in Streptococcus thermophilus[J]. J Bacteriol, 2008, 190(4): 1401–1412. doi:  10.1128/JB.01415-07
    [17] Pourcel C, Salvignol G, Vergnaud G. CRISPR elements in Yersinia pestis acquire new repeats by preferential uptake of bacteriophage DNA, and provide additional tools for evolutionary studies[J]. Microbiology, 2005, 151(3): 653–663. doi:  10.1099/mic.0.27437-0
    [18] Brouns SJJ, Jore MM, Lundgren M, et al. Small CRISPR RNAs guide antiviral defense in prokaryotes[J]. Science, 2008, 321(5891): 960–964. doi:  10.1126/science.1159689
    [19] Liu L, Li XY, Ma J, et al. The molecular architecture for RNA-Guided RNA cleavage by Cas13a[J]. Cell, 2017, 170(4): 714–726.e10. doi:  10.1016/j.cell.2017.06.050
    [20] Marraffini LA, Sontheimer EJ. CRISPR interference limits horizontal gene transfer in staphylococci by targeting DNA[J]. Science, 2008, 322(5909): 1843–1845. doi:  10.1126/science.1165771
    [21] Newman A, Starrs L, Burgio G. Cas9 cuts and consequences; detecting, predicting, and mitigating CRISPR/Cas9 on‐ and off‐target damage[J]. BioEssays, 2020, 42(9): 2000047. doi:  10.1002/bies.202000047
    [22] Liang F, Han MG, Romanienko PJ, et al. Homology-directed repair is a major double-strand break repair pathway in mammalian cells[J]. Proc Natl Acad Sci U S A, 1998, 95(9): 5172–5177. doi:  10.1073/pnas.95.9.5172
    [23] White MF, Allers T. DNA repair in the archaea—an emerging picture[J]. FEMS Microbiol Rev, 2018, 42(4): 514–526. doi:  10.1093/femsre/fuy020
    [24] Ayora S, Carrasco B, Cárdenas PP, et al. Double-strand break repair in bacteria: a view from Bacillus subtilis[J]. FEMS Microbiol Rev, 2011, 35(6): 1055–1081. doi:  10.1111/j.1574-6976.2011.00272.x
    [25] Wiktor J, van der Does M, Büller L, et al. Direct observation of end resection by RecBCD during double-stranded DNA break repair in vivo[J]. Nucleic Acids Res, 2018, 46(4): 1821–1833. doi:  10.1093/nar/gkx1290
    [26] van der Heijden T, Modesti M, Hage S, et al. Homologous recombination in real time: DNA strand exchange by RecA[J]. Mol Cell, 2008, 30(4): 530–538. doi:  10.1016/j.molcel.2008.03.010
    [27] Cong L, Ran FA, Cox D, et al. Multiplex genome engineering using CRISPR/Cas systems[J]. Science, 2013, 339(6121): 819–823. doi:  10.1126/science.1231143
    [28] Mali P, Yang L, Esvelt KM, et al. RNA-guided human genome engineering via Cas9[J]. Science, 2013, 339(6121): 823–826. doi:  10.1126/science.1232033
    [29] Davis AJ, Chen DJ. DNA double strand break repair via non-homologous end-joining[J]. Transl Cancer Res, 2013, 2(3): 130–143. doi:  10.3978/j.issn.2218-676X.2013.04.02
    [30] Shuman S, Glickman MS. Bacterial DNA repair by non-homologous end joining[J]. Nat Rev Microbiol, 2007, 5(11): 852–861. doi:  10.1038/nrmicro1768
    [31] Chang HHY, Watanabe G, Gerodimos CA, et al. Different DNA end configurations dictate which NHEJ components are most important for joining efficiency[J]. J Biol Chem, 2016, 291(47): 24377–24389. doi:  10.1074/jbc.M116.752329
    [32] Truong LN, Li YJ, Shi LZ, et al. Microhomology-mediated End Joining and Homologous Recombination share the initial end resection step to repair DNA double-strand breaks in mammalian cells[J]. Proc Natl Acad Sci U S A, 2013, 110(19): 7720–7725. doi:  10.1073/pnas.1213431110
    [33] Burgio G, Teboul L. Anticipating and identifying collateral damage in genome editing[J]. Trends Genet, 2020, 36(12): 905–914. doi:  10.1016/j.tig.2020.09.011
    [34] Shmakov S, Abudayyeh OO, Makarova KS, et al. Discovery and functional characterization of diverse class 2 CRISPR-cas systems[J]. Mol Cell, 2015, 60(3): 385–397. doi:  10.1016/j.molcel.2015.10.008
    [35] Burstein D, Harrington LB, Strutt SC, et al. New CRISPR–Cas systems from uncultivated microbes[J]. Nature, 2016, 542(7640): 237–241. doi:  10.1038/nature21059
    [36] Yan WX, Hunnewell P, Alfonse L, et al. Functionally diverse type V CRISPR-Cas systems[J]. Science, 2019, 363(6422): 88–91. doi:  10.1126/science.aav7271
    [37] Smargon AA, Cox DBT, Pyzocha NK, et al. Cas13b is a type VI-B CRISPR-associated RNA-guided RNase differentially regulated by accessory proteins Csx27 and Csx28[J]. Mol Cell, 2017, 65(4): 618–630.e7. doi:  10.1016/j.molcel.2016.12.023
    [38] Konermann S, Lotfy P, Brideau NJ, et al. Transcriptome engineering with RNA-targeting type VI-D CRISPR effectors[J]. Cell, 2018, 173(3): 665–676.e14. doi:  10.1016/j.cell.2018.02.033
    [39] Harrington LB, Burstein D, Chen JS, et al. Programmed DNA destruction by miniature CRISPR-Cas14 enzymes[J]. Science, 2018, 362(6416): 839–842. doi:  10.1126/science.aav4294
    [40] Shmakov SA, Makarova KS, Wolf YI, et al. Systematic prediction of genes functionally linked to CRISPR-Cas systems by gene neighborhood analysis[J]. Proc Natl Acad Sci U S A, 2018, 115(23): E5307–E5316. doi:  10.1073/pnas.1803440115
    [41] Levy A, Goren MG, Yosef I, et al. CRISPR adaptation biases explain preference for acquisition of foreign DNA[J]. Nature, 2015, 520(7548): 505–510. doi:  10.1038/nature14302
    [42] Radovčić M, Killelea T, Savitskaya E, et al. CRISPR–Cas adaptation in Escherichia coli requires RecBCD helicase but not nuclease activity, is independent of homologous recombination, and is antagonized by 5′ ssDNA exonucleases[J]. Nucleic Acids Res, 2018, 46(19): 10173–10183. doi:  10.1093/nar/gky799
    [43] Deltcheva E, Chylinski K, Sharma CM, et al. CRISPR RNA maturation by trans-encoded small RNA and host factor RNase III[J]. Nature, 2011, 471(7340): 602–607. doi:  10.1038/nature09886
    [44] Al-Shayeb B, Sachdeva R, Chen LX, et al. Clades of huge phages from across Earth's ecosystems[J]. Nature, 2020, 578(7795): 425–431. doi:  10.1038/s41586-020-2007-4
    [45] Chatterjee P, Jakimo N, Jacobson JM. Minimal PAM specificity of a highly similar SpCas9 ortholog[J]. Sci Adv, 2018, 4(10): eaau0766. doi:  10.1126/sciadv.aau0766
    [46] Ran FA, Cong L, Yan WX, et al. In vivo genome editing using Staphylococcus aureus Cas9[J]. Nature, 2015, 520(7546): 186–191. doi:  10.1038/nature14299
    [47] Sampson TR, Saroj SD, Llewellyn AC, et al. A CRISPR/Cas system mediates bacterial innate immune evasion and virulence[J]. Nature, 2013, 497(7448): 254–257. doi:  10.1038/nature12048
    [48] Dugar G, Leenay RT, Eisenbart SK, et al. CRISPR RNA-dependent binding and cleavage of endogenous RNAs by the Campylobacter jejuni Cas9[J]. Mol Cell, 2018, 69(5): 893–905.e7. doi:  10.1016/j.molcel.2018.01.032
    [49] Yamada M, Watanabe Y, Gootenberg JS, et al. Crystal structure of the minimal Cas9 from Campylobacter jejuni reveals the molecular diversity in the CRISPR-Cas9 systems[J]. Mol Cell, 2017, 65(6): 1109–1121.e3. doi:  10.1016/j.molcel.2017.02.007
    [50] Steinegger M, Söding J. MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets[J]. Nat Biotechnol, 2017, 35(11): 1026–1028. doi:  10.1038/nbt.3988
    [51] Altschul SF, Gish W, Miller W, et al. Basic local alignment search tool[J]. J Mol Biol, 1990, 215(3): 403–410. doi:  10.1016/S0022-2836(05)80360-2
    [52] Buchfink B, Xie C, Huson DH. Fast and sensitive protein alignment using DIAMOND[J]. Nat Methods, 2014, 12(1): 59–60. doi:  10.15496/publikation-1176
    [53] Zhang B, Ye YM, Ye WW, et al. Two HEPN domains dictate CRISPR RNA maturation and target cleavage in Cas13d[J]. Nat Commun, 2019, 10(1): 2544. doi:  10.1038/s41467-019-10507-3
    [54] Jore MM, Lundgren M, van Duijn E, et al. Structural basis for CRISPR RNA-guided DNA recognition by Cascade[J]. Nat Struct Mol Biol, 2011, 18(5): 529–536. doi:  10.1038/nsmb.2019
    [55] Sinkunas T, Gasiunas G, Fremaux C, et al. Cas3 is a single-stranded DNA nuclease and ATP-dependent helicase in the CRISPR/Cas immune system[J]. EMBO J, 2011, 30(7): 1335–1342. doi:  10.1038/emboj.2011.41
    [56] Morisaka H, Yoshimi K, Okuzaki Y, et al. CRISPR-Cas3 induces broad and unidirectional genome editing in human cells[J]. Nat Commun, 2019, 10(1): 5302. doi:  10.1038/s41467-019-13226-x
    [57] Dolan AE, Hou ZG, Xiao Yb, et al. Introducing a spectrum of long-range genomic deletions in human embryonic stem cells using type I CRISPR-cas[J]. Mol Cell, 2019, 74(5): 936–950.e5. doi:  10.1016/j.molcel.2019.03.014
    [58] Peters JE, Makarova KS, Shmakov S, et al. Recruitment of CRISPR-Cas systems by Tn7-like transposons[J]. Proc Natl Acad Sci U S A, 2017, 114(35): E7358–E7366. doi:  10.1073/pnas.1709035114
    [59] Klompe SE, Vo PLH, Halpin-Healy TS, et al. Transposon-encoded CRISPR–Cas systems direct RNA-guided DNA integration[J]. Nature, 2019, 571(7764): 219–225. doi:  10.1038/s41586-019-1323-z
    [60] Strecker J, Ladha A, Gardner Z, et al. RNA-guided DNA insertion with CRISPR-associated transposases[J]. Science, 2019, 365(6448): 48–53. doi:  10.1126/science.aax9181
    [61] Mogila I, Kazlauskiene M, Valinskyte S, et al. Genetic dissection of the type III-A CRISPR-Cas system csm complex reveals roles of individual subunits[J]. Cell Rep, 2019, 26(10): 2753–2765.e4. doi:  10.1016/j.celrep.2019.02.029
    [62] Li YJ, Pan SF, Zhang Y, et al. Harnessing Type I and Type III CRISPR-Cas systems for genome editing[J]. Nucleic Acids Res, 2016, 44(4): e34. doi:  10.1093/nar/gkv1044
    [63] Rahman K, Jamal M, Chen X, et al. Reprogramming the endogenous type III-A CRISPR-Cas system for genome editing, RNA interference and CRISPRi screening in Mycobacterium tuberculosis[EB/OL]. [2020-03-09]. https://www.biorxiv.org/content/10.1101/2020.03.09.983494v1.full.pdf+html.
    [64] Niewoehner O, Garcia-Doval C, Rostøl JT, et al. Type III CRISPR–Cas systems produce cyclic oligoadenylate second messengers[J]. Nature, 2017, 548(7669): 543–548. doi:  10.1038/nature23467
    [65] Lau RK, Ye QZ, Birkholz EA, et al. Structure and mechanism of a cyclic trinucleotide-activated bacterial endonuclease mediating bacteriophage immunity[J]. Mol Cell, 2020, 77(4): 723–733.e6. doi:  10.1016/j.molcel.2019.12.010
    [66] Kazlauskiene M, Kostiuk G, Venclovas Č, et al. A cyclic oligonucleotide signaling pathway in type III CRISPR-Cas systems[J]. Science, 2017, 357(6351): 605–609. doi:  10.1126/science.aao0100
    [67] Kazlauskiene M, Tamulaitis G, Kostiuk G, et al. Spatiotemporal control of Type III-A CRISPR-Cas immunity: coupling DNA degradation with the target RNA recognition[J]. Mol Cell, 2016, 62(2): 295–306. doi:  10.1016/j.molcel.2016.03.024
    [68] Han WY, Li YJ, Deng L, et al. A type III-B CRISPR-Cas effector complex mediating massive target DNA destruction[J]. Nucleic Acids Res, 2017, 45(4): 1983–1993. doi:  10.1093/nar/gkw1274
    [69] Elmore JR, Sheppard NF, Ramia N, et al. Bipartite recognition of target RNAs activates DNA cleavage by the Type III-B CRISPR–Cas system[J]. Genes Dev, 2016, 30(4): 447–459. doi:  10.1101/gad.272153.115
    [70] Smalakyte D, Kazlauskiene M, Havelund JF, et al. Type III-A CRISPR-associated protein Csm6 degrades cyclic hexa-adenylate activator using both CARF and HEPN domains[J]. Nucleic Acids Res, 2020, 48(16): 9204–9217. doi:  10.1093/nar/gkaa634
    [71] Estrella MA, Kuo FT, Bailey S. RNA-activated DNA cleavage by the Type III-B CRISPR–Cas effector complex[J]. Genes Dev, 2016, 30(4): 460–470. doi:  10.1101/gad.273722.115
    [72] Liu TY, Liu JJ, Aditham AJ, et al. Target preference of Type III-A CRISPR-Cas complexes at the transcription bubble[J]. Nat Commun, 2019, 10(1): 3001. doi:  10.1038/s41467-019-10780-2
    [73] Chou-Zheng L, Hatoum-Aslan A. A type III-A CRISPR-Cas system employs degradosome nucleases to ensure robust immunity[J]. eLife, 2019, 8: e45393. doi:  10.7554/eLife.45393
    [74] Crowley VM, Catching A, Taylor HN, et al. A type IV-A CRISPR-Cas system in pseudomonas aeruginosa mediates RNA-guided plasmid interference in vivo[J]. CRISPR J, 2019, 2(6): 434–440. doi:  10.1089/crispr.2019.0048
    [75] Pinilla-Redondo R, Mayo-Muñoz D, Russel J, Garrett RA, et al. Type IV CRISPR–Cas systems are highly diverse and involved in competition between plasmids[J]. Nucleic Acids Res, 2020, 48(4): 2000–2012. doi:  10.1093/nar/gkz1197
    [76] Jiang WZ, Zhou HB, Bi HH, et al. Demonstration of CRISPR/Cas9/sgRNA-mediated targeted gene modification in Arabidopsis, tobacco, sorghum and rice[J]. Nucleic Acids Res, 2013, 41(20): e188. doi:  10.1093/nar/gkt780
    [77] Ma XL, Zhu QL, Chen YL, et al. CRISPR/Cas9 Platforms for genome editing in plants: developments and applications[J]. Mol Plant, 2016, 9(7): 961–974. doi:  10.1016/j.molp.2016.04.009
    [78] Raper AT, Stephenson AA, Suo ZC. Functional insights revealed by the kinetic mechanism of CRISPR/Cas9[J]. J Am Chem Soc, 2018, 140(8): 2971–2984. doi:  10.1021/jacs.7b13047
    [79] Chen JS, Ma EB, Harrington LB, et al. CRISPR-Cas12a target binding unleashes indiscriminate single-stranded DNase activity[J]. Science, 2018, 360(6387): 436–439. doi:  10.1126/science.aar6245
    [80] Gasiunas G, Young JK, Karvelis T, et al. A catalogue of biochemically diverse CRISPR-Cas9 orthologs[J]. Nat Commun, 2020, 11(1): 5512. doi:  10.1038/s41467-020-19344-1
    [81] Kim E, Koo T, Park SW, et al. In vivo genome editing with a small Cas9 orthologue derived from Campylobacter jejuni[J]. Nat Commun, 2017, 8(1): 14500. doi:  10.1038/ncomms14500
    [82] Adli M. The CRISPR tool kit for genome editing and beyond[J]. Nat Commun, 2018, 9(1): 1911. doi:  10.1038/s41467-018-04252-2
    [83] Hou ZG, Zhang Y, Propson NE, et al. Efficient genome engineering in human pluripotent stem cells using Cas9 from Neisseria meningitidis[J]. Proc Natl Acad Sci U S A, 2013, 110(39): 15644–15649. doi:  10.1073/pnas.1313587110
    [84] Gasiunas G, Barrangou R, Horvath P, et al. Cas9-crRNA ribonucleoprotein complex mediates specific DNA cleavage for adaptive immunity in bacteria[J]. Proc Natl Acad Sci U S A, 2012, 109(39): E2579–E2586. doi:  10.1073/pnas.1208507109
    [85] Globyte V, Lee SH, Bae T, et al. CRISPR/Cas9 searches for a protospacer adjacent motif by lateral diffusion[J]. EMBO J, 2019, 38(4): e99466. doi:  10.15252/embj.201899466
    [86] Amrani N, Gao XD, Liu PP, et al. NmeCas9 is an intrinsically high-fidelity genome-editing platform[J]. Genome Biol, 2018, 19(1): 214. doi:  10.1186/s13059-018-1591-1
    [87] Tang YY, Fu Y. Class 2 CRISPR/Cas: an expanding biotechnology toolbox for and beyond genome editing[J]. Cell Biosci, 2018, 8(1): 59. doi:  10.1186/s13578-018-0255-x
    [88] Li SY, Cheng QX, Liu JK, et al. CRISPR-Cas12a has both cis- and trans-cleavage activities on single-stranded DNA[J]. Cell Res, 2018, 28(4): 491–493. doi:  10.1038/s41422-018-0022-x
    [89] Zhang LJ, Sun RR, Yang MY, et al. Conformational dynamics and cleavage sites of Cas12a are modulated by complementarity between crRNA and DNA[J]. iScience, 2019, 19: 492–503. doi:  10.1016/j.isci.2019.08.005
    [90] Swarts DC, Jinek M. Mechanistic insights into the cis- and trans-acting DNase activities of cas12a[J]. Mol Cell, 2019, 73(3): 589–600.e4. doi:  10.1016/j.molcel.2018.11.021
    [91] Swarts DC, van der Oost J, Jinek M. Structural basis for guide RNA processing and seed-dependent DNA Targeting by CRISPR-Cas12a[J]. Mol Cell, 2017, 66(2): 221–233.e4. doi:  10.1016/j.molcel.2017.03.016
    [92] Cofsky JC, Karandur D, Huang CJ, et al. CRISPR-Cas12a exploits R-loop asymmetry to form double-strand breaks[J]. eLife, 2020, 9: e55143. doi:  10.7554/eLife.55143
    [93] Karvelis T, Bigelyte G, Young JK, et al. PAM recognition by miniature CRISPR–Cas12f nucleases triggers programmable double-stranded DNA target cleavage[J]. Nucleic Acids Res, 2020, 48(9): 5016–5023. doi:  10.1093/nar/gkaa208
    [94] Pausch P, Al-Shayeb B, Bisom-Rapp E, et al. CRISPR-CasΦ from huge phages is a hypercompact genome editor[J]. Science, 2020, 369(6501): 333–337. doi:  10.1126/science.abb1400
    [95] Zetsche B, Gootenberg JS, Abudayyeh OO, et al. Cpf1 is a single RNA-guided endonuclease of a class 2 CRISPR-cas system[J]. Cell, 2015, 163(3): 759–771. doi:  10.1016/j.cell.2015.09.038
    [96] Rubin BE, Diamond S, Alexander BFC, et al. Targeted genome editing of bacteria within microbial communities[EB/OL]. [2020-07-21]. https://www.biorxiv.org/content/10.1101/2020.07.17.209189v2.
    [97] Anzalone AV, Randolph PB, Davis JR, et al. Search-and-replace genome editing without double-strand breaks or donor DNA[J]. Nature, 2019, 576(7785): 149–157. doi:  10.1038/s41586-019-1711-4
    [98] Ran FA, Hsu PD, Wright J, et al. Genome engineering using the CRISPR-Cas9 system[J]. Nat Protoc, 2013, 8(11): 2281–2308. doi:  10.1038/nprot.2013.143
    [99] Teng F, Li J, Cui TT, et al. Enhanced mammalian genome editing by new Cas12a orthologs with optimized crRNA scaffolds[J]. Genome Biol, 2019, 20(1): 15. doi:  10.1186/s13059-019-1620-8
    [100] Kappel S, Matthess Y, Kaufmann M, Strebhardt K. Silencing of mammalian genes by tetracycline-inducible shRNA expression[J]. Nat Protoc, 2007, 2(12): 3257–3269. doi:  10.1038/nprot.2007.458
    [101] Abudayyeh OO, Gootenberg JS, Konermann S, et al. C2c2 is a single-component programmable RNA-guided RNA-targeting CRISPR effector[J]. Science, 2016, 353(6299): aaf5573. doi:  10.1126/science.aaf5573
    [102] Koonin EV, Makarova KS, Zhang F. Diversity, classification and evolution of CRISPR-Cas systems[J]. Curr Opin Microbiol, 2017, 37: 67–78. doi:  10.1016/j.mib.2017.05.008
    [103] Abudayyeh OO, Gootenberg JS, Essletzbichler P, et al. RNA targeting with CRISPR–Cas13[J]. Nature, 2017, 550(7675): 280–284. doi:  10.1038/nature24049
  • 加载中
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Figures(7)  / Tables(1)

Article Metrics

Article views(377) PDF downloads(62) Cited by()

Proportional views

Harnessing CRISPR-Cas system diversity for gene editing technologies

doi: 10.7555/JBR.35.20200184
    Corresponding author: Gaetan Burgio, Department of Immunology and Infectious Diseases, John Curtin School of Medical Research, Australian National University, 131 Garran Road, Canberra, ACT 2601, Australia. Tel/Fax: +61-2-6125-9428/+61-2-6247-4823, E-mail: gaetan.burgio@anu.edu.au

Abstract: The discovery and utilization of RNA-guided surveillance complexes, such as CRISPR-Cas9, for sequence-specific DNA or RNA cleavage, has revolutionised the process of gene modification or knockdown. To optimise the use of this technology, an exploratory race has ensued to discover or develop new RNA-guided endonucleases with the most flexible sequence targeting requirements, coupled with high cleavage efficacy and specificity. Here we review the constraints of existing gene editing and assess the merits of exploiting the diversity of CRISPR-Cas effectors as a methodology for surmounting these limitations.

Alexander McKay, Gaetan Burgio. Harnessing CRISPR-Cas system diversity for gene editing technologies[J]. The Journal of Biomedical Research, 2021, 35(2): 91-106. doi: 10.7555/JBR.35.20200184
Citation: Alexander McKay, Gaetan Burgio. Harnessing CRISPR-Cas system diversity for gene editing technologies[J]. The Journal of Biomedical Research, 2021, 35(2): 91-106. doi: 10.7555/JBR.35.20200184
    • Gene editing is a cornerstone technology for the production of genetically modified organisms with diverse applications spanning the research, medical and pharmaceutical fields. Current approaches to site-specifically edit genomes rely on hijacking the host cell DNA repair pathways responsible for fixing double strand breaks (DSBs) and modify or completely inhibit the function of a target gene[12]. In concert with a delivery strategy, such as transfection or electroporation[36], a programmable endonuclease is introduced into the host cell, and generates a single or a double-stranded break (DSB) in the DNA[1,7]. A revolution in gene editing in recent years has precipitated as a result of the discovery that a ribonucleoprotein complex called CRISPR-Cas9 (clustered regularly interspaced short palindromic repeats-CRISPR associated protein) can be programmed to specifically target loci in the genome of an organism[1].

      To further improve upon the properties of the CRISPR-Cas9 gene editing system, and understand the ecological role of the technology, a significant focus has been exploring the evolutionary diversity of CRISPR-Cas systems. The CRISPR-Cas9 editing complexes were originally discovered from ubiquitous immune systems in prokaryotes, which endow bacteria with a unique memory against past bacteriophage infections that they utilise to mount an acquired immune response[89]. As a consequence of this selective pressure, an enormous diversity of different CRISPR-Cas systems exists[10]. Harnessing different interference proteins from this plethora of systems has expanded editing capabilities by providing a set of complementary proteins with different editing efficiencies across diverse organisms as well as different intrinsic specificities and requirements for targeting[11].

      Employing big-data style computational pipelines to mine terabyte-scale genome sequencing data for new CRISPR-Cas systems has procured many new such systems including effector proteins used to facilitate interference. These effectors may possess novel or improvement-of-function RNA-guided catalytic activities, and form the basis of improved or alternative gene editing strategies[1215]. This may overcome inherent issues with current gene editing tools, such as the editing efficiency at different target sites across the genome, or the introduction of large insertions and deletions as a by-product of host-cell DNA repair[89,16]. This review will focus on CRISPR-Cas system diversity to create improved or novel gene editing approaches and extend the gene editing toolbox.

    • CRISPR-Cas systems are defined as a set of genes, known as Cas associated sequences (Cas), which co-occur with a set of tandem repeat sequences, known as CRISPR[89]. During phage invasion in bacteria, a small fragment of DNA (protospacer), produced from phage replication is integrated into CRISPR arrays (Fig. 1)[9,17]. These arrays are structured as alternating segments of fixed-length DNA fragments, mostly acquired from previous infections (spacers) from the phage, and constant pseudo-palindromic repeat sequences (direct repeats)[9,17]. Subsequent transcription of the CRISPR array produces a series of concatenated crRNAs (pre-crRNA), which are cleaved into individual CRISPR RNA (crRNA) fragments[18]. For interference, each crRNA consists of a pseudo-palindromic sequence, which folds into a hairpin structure, used for recognition by the effector protein, and a guide RNA (gRNA) sequence complementary to the target nucleic acid sequence of interest (Fig. 1)[18]. If a phage containing sequences present in a CRISPR array, reinvades the host bacterium, CRISPR-Cas-crRNA complexes use the gRNA segment of the crRNA, in conjunction with a 2 to 5 bp sequence of DNA called a protospacer adjacent motif (PAM) encoded directly adjacent to the target site, to sequence-specifically bind and cleave phage DNA or RNA (Fig. 1)[9,1820]. In some instances, the gRNA and hairpin domains are split into two separate RNAs. The gRNA contains a crRNA and a hairpin domain with a trans-activating CRISPR RNA (tracrRNA). These two molecules form a duplex at one end of both molecules and function as a single crRNA as observed in other systems. Complementary binding then activates the cleavage activities of the effector, degrading the phage DNA and conferring immunity. This mechanism has been successfully harnessed to use the CRISPR system as a programmable nuclease to edit eukaryotic cells. CRISPR-based editing strategies rely on coercing the host-cell DNA repair pathways to integrate or substitute new genetic material[21]. Unlike other forms of DNA damage, DSBs are unique in that most organisms across the three kingdoms of life possess a homology-directed repair (HDR) pathway, which utilises a segment of DNA homologous to the severed region of the damaged DNA molecule as a template for repair[2224]. The pathway consists of four basic steps: resection, strand invasion, re-synthesis and Holliday junction resolution[2526] (Fig. 2).

      Figure 1.  Steps in CRISPR-Cas mediated acquired immunity.

      Figure 2.  Overview of the main double-stranded break repair mechanisms.

      Most modern gene editing platforms leverage this pathway to modify DNA by transfecting the cells of interest with exogenous DNA homologous to the base sequence of the target site. This DNA also encodes the desired mutation to incorporate[21,2728]. Competing with this pathway, however, is usually a second pathway known as non-homologous end joining (NHEJ), which involves the direct re-joining of the severed DNA ends without the use of a template[2930]. The accuracy of NHEJ strongly depends on the integrity of the severed DNA ends[31]. If the ends are intact, then repair is usually accurate, however if one of the ends is damaged, or the ends are incompatible, then repair usually results in the incorporation of small insertions or deletions (INDELs) at the position where the strands were severed[31]. This pathway is often used to produce loss-of-function mutations for target genes. Additionally, it is possible for a mutagenic secondary NHEJ pathway, known as microhomology-mediated end joining (MMEJ) to be activated during the initial resection step of HDR[32]. This pathway attempts to re-anneal and re-join large resected regions of single stranded DNA, which always results in DNA deletions as an artefact of the repair process[32]. Which repair pathway facilitates DSB repair for a given host cell thus determines not only the type of modification induced at the target site, also the incidence of artefacts produced from the editing process that can be mutagenic to the host cell (see Burgio et al[33] for more details). In concert with technical issues related to the RNA-guided problems themselves, such as the induction of off-target breaks in other regions of the host genome, as well as limits on the efficacy and target site programmability of various effectors, this has driven a search for editing systems with novel activities that could potentially bypass the dependency of the editing platforms on DSB induction. One of the most likely sources of the building-blocks for such platforms lies in a deeper exploration of CRISPR-Cas system diversity, and unravelling the functions of as yet undiscovered or known or not yet experimentally characterised systems.

    • All CRISPR-Cas systems are differentiated based on the structure and function of the main effector proteins or complexes, and the presence or absence of specific CRISPR-associated proteins. Class 1 CRISPR-Cas systems (Fig. 3A) utilise large multi-subunit surveillance complexes, for RNA-guided DNA binding. These systems then recruit additional proteins to mediate site-specific DNA degradation[10]. In contrast, class 2 CRISPR-Cas systems all employ a single RNA-guided monomeric effector protein, which effects site-specific strand cleavage of the target nucleic acid strand (Fig. 3B)[10].

      Figure 3.  Outline of the current parsimonious classification schemes for CRISPR-Cas systems.

      Each of the respective CRISPR-Cas system classes are further classified based on the evolutionary lineage of the effector protein and sub-classified based on the presence or absence of genetically linked accessory proteins (Fig. 3)[10,12]. All CRISPR-Cas systems of the same type possess monomeric effector proteins, or effector protein complexes which descend from the same common ancestral variant[12]. For instance, all Cas9 proteins, which are the defining effector in type Ⅱ CRISPR-Cas systems, are believed to descend from a single ancestral variant. There are three subtypes associated with the type Ⅱ systems all distinct to each other by either the presence of accessory proteins such as Cas4 or Csn2 important for spacer acquisition or interference. Type Ⅱ-A systems include the most commonly used protein for CRISPR gene editing, Streptococcus Pyogenes Cas9[12].

      Different types are also assigned a numeric designation based on common catalytic activities, and a potentially related (although not divergent) evolutionary origin followed by a letter designating a unique evolutionary lineage (Fig. 3)[12]. In the case of Class 1 systems with multiple subunits, a new effector type is defined if one of the subunits is novel or unrelated to the others (Fig. 3A).

      Type Ⅰ-E systems, such as that found in the K12 strain of Escherichia Coli, contain a surveillance complex called Cascade, which is comprised of five different proteins: Cas8e, Cas11, Cas7, Cas5 and Cas6[12]. By contrast, type Ⅰ-A systems comprise a surveillance complex only composed of four separate subunits, omitting Cas6 from the complex, but including an additional Cas3 protein ortholog used to mediate interference[12].

    • With the exception of Cas12a, which was discovered in Francisella novicia via direct examination of proteins encoded proximal to the CRISPR-arrays, all other Class 2 CRISPR-Cas effector proteins constituting novel systems have been discovered via the utilisation of computational pipelines which employ big-data filtration techniques to predict and isolate effector proteins based on their co-occurrence with conserved motifs in CRISPR-Cas systems, most notably CRISPR arrays (Fig. 4)[10,14,3438]. The precursor step toward performing this process entails compiling a multi-terabyte sized block of assembled prokaryotic sequencing data from metagenomic and reference sequence databases. The genome data block is then scanned to identify a "seed" or "bait" motif of interest[15]. This can be the tandem repeat pattern which is conserved in CRISPR arrays, or conserved CRISPR-associated proteins such as Cas1/Cas2[14,3536,34,39]. To avoid missing the detection of systems which employ a minimal structure, such as a single CRISPR-array in the absence of any co-occurring proteins, the union of multiple seeds can be taken, to link potential CRISPR-associated proteins to potentially any other CRISPR-associated proteins[1415,40].

      Figure 4.  Overview of the computational pipeline used to uncover CRISPR associated proteins.

      After seed identification, a 5–20 kb frame of DNA sequence upstream and downstream must be extracted and subject to open reading frame prediction to identify possible candidate CRISPR-associated proteins[15]. These proteins then need to be subject to filtering and clustering to produce putative families of CRISPR-associated proteins[15]. Finally, a "CRISPR-icity" score is then calculated by finding the number of occurrences of each putative CRISPR-associated protein proximal to CRISPR-arrays and dividing this by the total number of occurrences throughout the entire block of genome sequencing data[15]. This proves that the putative CRISPR-associated proteins are linked to their proximally encoded CRISPR arrays. Subsequent analysis of the domains, family sequence diversity and properties of the individual co-occurring CRISPR-associated protein sequences has enabled the deduction of many of these CRISPR-associated proteins to be undiscovered effector proteins, which were then verified experimentally using in vitro cleavage and PAM determination assays[15].

    • There are two main types of limitations, intrinsic and extrinsic with the computational pipelines that have been used to predict new CRISPR-Cas systems to date. The intrinsic limitations of computational pipelines are defined in this instance as the inherent flaws with the pipeline methodology itself, while the extrinsic limitations are those defined as outside the scope of the goals of previously constructed pipelines used for CRISPR-Cas system discovery.

      Foremost among the intrinsic limitations is a trade-off between the sensitivity, speed and computational resources required to compute a CRISPR-"icity" score, or equivalent, to validate that putative CRISPR-associated proteins are encoded next to CRISPR-arrays[15].

      Using CRISPRs or conserved CRISPR-Cas proteins as seeds comes with the logical corollary that proteins which play important roles in CRISPR-immunity, but are not genetically co-encoded with CRISPR-arrays or conserved proteins, are not detectable[4143]. Within the members of each cluster of predicted CRISPR-associated proteins, it is assumed that similar homology between related family members implies a conserved function between all members[14,3538,39,44]. Fewer efforts to date have been made to discriminate the functionality of CRISPR-associated proteins encoded within the same families as opposed to novel families of CRISPR-Cas proteins. Prior studies of the CRISPR-Cas9 effector proteins have already unveiled immense functional diversity from within a single lineage of effector proteins, which implies that similar diversity within other CRISPR-Cas systems exists, and is not characterised by the pipelines, which identify these proteins from within genome sequencing data[4549].

      The pipeline itself also possesses several intrinsic trade-offs between the allowed sensitivity for clustering and co-occurrence score calculation, the false positive rate at which novel putative CRISPR-associated proteins are detected, and the computational requirements to perform the clustering and co-occurrence calculation steps. Because most CRISPR-associated proteins are larger than the maximum size of the short tagged reads which serve as the raw output from metagenome sequencing, only assembled metagenome data can usually be used to detect CRISPR-associated proteins using a bait-based approach[14,3536,38,44]. Within the pipeline itself, a trade-off exists between the false-discovery rate for CRISPR-associated proteins and the window size[15]. A trade-off also exists between clustering or sequence search algorithm sensitivity and the amount of computational resources required to perform these calculations[50]. For both classes of algorithms, the underlying reason is that a more sensitive search almost always requires performing a larger number of comparisons between the decomposed query sequence (words) and the sequence database[5052]. As a result, most pipelines have employed algorithms, such as mmseqs2, which optimise this speed/sensitivity trade-off[15,50,53].

    • There have been several attempts to perform both gene knockout and HDR based gene editing using type Ⅰ CRISPR-Cas effectors[56]. This is possible because the single-stranded DNA degradation activity of Cas3, when applied to host-cell genomes, results in a single-stranded resected product anyway from 1 to > 50 kb in size, but most commonly in the range of around 5 to 10 kb[56], which has the potential to trigger the HDR repair pathways to incorporate foreign DNA into the host cell genome within the resected region (Fig. 5A). The key dilemma is that the single stranded progressive degradation catalytic activity of Cas3 when utilised for gene knockout results in lower efficacy in the range of 5% to 60% compared with Cas9 (10%–80%) when the DNA target site and mutation sites overlap[5657]. Notably, however, Cas3 showed slightly improved HDR knock-in efficacy (0.8%) compared with Cas9 (0.45%) for one target site when the mutation site was significantly downstream of the target site[56].

      Figure 5.  Gene editing strategies developed using standalone type I systems.

      An alternate, novel gene editing approach may come from the discovery of type Ⅰ-F systems, which naturally facilitate the programmable integration of Tn7 transposon DNA into a specific locus within a host cell (Fig. 5B). It has recently been discovered that a novel subclass of type Ⅰ and Ⅴ effectors are capable of site-specifically directing the integration of Tn7 transposons[58] with an efficiency in the range of 15% to 60% in bacteria depending on the exact target site specified[5960]. Unfortunately, there have been no attempts to date to determine whether the use of such system is feasible in mammalian cells. Nevertheless, this represents an important advance a potentially viable alternative approach for integrating DNA into the chromosomes of bacteria.

    • Both Type Ⅲ and Ⅳ systems employ a surveillance complex with significant structural and mechanistic similarities to type Ⅰ. In type Ⅲ systems, rather than binding directly to DNA, the complexes natively bind single stranded RNA in a sequence specific manner (Fig. 3A). Csm3 subunits are recruited to the complex upon the formation of a dsRNA duplex and introduce multiple breaks in a target RNA strand[61]. In some systems, the Cas10 subunit of the surveillance complex is able to concurrently degrade the complementary DNA strand. There have been several investigations where Type Ⅲ systems naturally present in bacteria were co-opted for gene editing via the transformation of the target organisms with plasmid encoded CRISPR-arrays containing spacers complementary to a read-out gene of interest. However, to date, no Type Ⅲ has been expressed from vectors in the same manner as type Ⅰ systems for editing purposes, which makes it difficult to assess the potential of these systems as transferable gene editing platforms in other organisms[6263].

      Another potential difficulty associated with the utilisation of type Ⅲ systems is that compared with the CRISPR-Cas systems of other types, the activity of the surveillance complex appears to be more integrated with the host cell defence response[6465]. Upon target recognition, the surveillance complex synthesises cyclic oligoadenylate molecules whilst simultaneously facilitating processive single stranded DNA degradation of the transcribed gene[64,6672]. These molecules then act as signalling substrates to stimulate the binding of interference proteins to facilitate RNA strand degradation[64,70]. However, they also activate non-specific nucleases such as NucC, which is found to be co-encoded with certain type Ⅲ systems[65,73]. Upon being activated, this protein non-specifically degrades the host genome[65]. This is a form of abortive infection to prevent the phage infection spreading to other cells in the same colony[65]. This may be an issue when utilising these proteins for gene editing, if the transformation of type Ⅲ CRISPR-Cas systems into a bacterium of interest induces cell death as a side effect. This means that, although potential editing applications exist for type Ⅲ systems, a significant amount of further research would be required to employ these type Ⅲ effectors in bacterial and eukaryotic cells as general purpose editing tools.

      Like type Ⅰ systems, Type Ⅳ systems consist of an RNA-guided multi-subunit surveillance complex which sequence specifically binds DNA[74]. However, unlike type Ⅰ and Ⅲ, type Ⅳ systems lack Cas1-Cas2 acquisition proteins and are mainly encoded on plasmids[75]. The spacers from their corresponding CRISPR arrays also appear to map to other plasmids, rather than host cell, or phage genomes wherein they have been observed to mediate interference[7475]. Intriguingly, this appears to be via a different mechanism to single-stranded DNA cleavage, instead requiring DinG, a protein with distant homology to helicase proteins used in recombinational repair[74]. The key potential advantage of utilising type Ⅳ systems for editing, is that the interference modules consist of fewer subunits than Types Ⅰ/Ⅲ which makes them easier to compact into delivery constructs[10,7475]. However, given that the exact mechanism interference in these has not yet been characterised[7475], it remains unclear what, if any applications for gene editing these systems possess.

    • It is widely accepted that the lion's share of gene editing advances enabled by CRISPR based gene editing have come from the discovery and utilisation of class 2 CRISPR-Cas effectors. Diverse lineages of class 2 effector proteins possess several generalisable properties, which grant them a comparative advantage in gene editing and knockdown applications compared with class 1 systems. Class 2 effector proteins are monomeric and sequence specifically degrade DNA at a single loci within the target genome or transcriptome upon binding, usually introducing either a single or double strand break in the target nucleic acid strand[10]. This avoids the possible side effects arising from degrading a large DNA region as is the case in type Ⅰ CRISPR-Cas effector mediated gene editing[5657]. As a consequence of these properties, the design and operation of computational pipelines, to extract and characterise the full extent of class 2 effectors diversity has been a major priority in the last 5 years. This has unveiled 3 basic types of class 2 effectors (Fig. 3B), subclassified into a plenitude of subtypes with the potential to complement or surpass the traditional first generation SpCas9-CRISPR mediated gene editing platform.

    • It is indisputable that the breakthrough generated from the utilisation of Streptococcus pyogenes Cas9 (SpCas9) to induce programmable RNA-guided Double Stranded cleavage has since become the epicentre of the CRISPR-Cas gene-editing world. Cas9 distinguishes itself from other effector types by its high abundance (present in approximately 10% of bacteria[12]), distinct mechanism of double stranded cleavage (Fig. 6A), high efficacy in diverse organisms and relatively low restrictions on programmability due to small PAM requirements[7677]. While other type Ⅴ effectors utilise a single RuvC domain to cleave both DNA strands, Cas9 uses its RuvC and HNH domains to cleave the complementary and non target strands almost simultaneously[78]. This cleavage is specific to the target site, with no indiscriminate single stranded DNA (ssDNA) cleavage occurring as a side reaction, which is often observed with the effectors of Type Ⅴ systems such as Cas12a[79].

      Figure 6.  Activities of Class 2 CRISPR-Cas effectors utilized in gene editing systems.

      Due to being the first ortholog discovered, there has been a much greater exploration of Cas9 ortholog diversity and re-engineering of successful orthologs into higher activity variants than for effectors from other types[80] (Table 1)[27,4647,81,101103]. The driving motivation for using different Cas9 orthologs for editing lies in their different PAM requirements, protein size to fit into a delivery vector and editing efficacy at different target sites in different organisms. Both Staphylococcus aureus Cas9 (SaCas9, 1053 residues) and Campylobacter jejuni Cas9 (CjeCas9, 984 residues) are smaller than SpCas9, the standard effector used for most editing applications[46,81]. This results in a smaller construct size when genes encoding either of these effector proteins are cloned onto an insertion vector for gene delivery Certain orthologs may also provide an efficacy and specificity improvement when used at certain target sites, due fewer possible off-target sites due to the effector's PAM being more specific to the target site of interest[8283]. Although overall, there has been relatively little success in finding a naturally occurring Cas9 ortholog, which surpasses SpCas9 in terms of functionality for general purpose use, when used as a collective toolbox for a specific target site, the utilisation of these alternative Cas9 orthologs can significantly increase the specificity and efficacy of the editing reaction, as well as the number of possible sites to induce cleavage in a gene of interest[84].

      AbbreviationSpeciesSize (aa)PAMgRNA size (bp)crRNA (5′→3′)tracrRNA (5′→3′)Reference
      NmCas9 Neisseria meningitidis
      (strain 8013)
      St1Cas9 Streptococcus thermophilus
      (strain LMD-9)
      St3Cas9 Streptococcus thermophilus
      (strain DGC7710)

      Table 1.  Properties of most commonly used Cas9 orthologs in gene editing

    • Although all Cas12 effectors possess and utilise a single RuvC domain for nucleic acid strand cleavage, the substrate requirements and mechanism of cleavage differ substantially between different effector types(Fig. 6B-F). To date there are 11 different known Cas12 effector types, alphabetised A to K (Fig. 3B). The mechanism underlying this cleavage has predominantly been studied in Cas12a orthologs, but provides some transferable insight into the process for other Cas12 effectors as well[8588]. For Cas12a, RNA-DNA heteroduplex formation induces conformational change in the NUC lobe, making the RuvC catalytic residues accessible[88]. Cleavage of the non-target strand must precede cleavage of the target strand[89]. This results in a staggered cleavage pattern with approximately 5 nt 5′ overhangs (Fig. 6B)[89]. This releases the PAM-distal target DNA fragment. However, the ribonucleoprotein complex is still catalytically competent while bound to the PAM-proximal DNA[8990]. This often results in the activation of a secondary activity wherein indiscriminate cleavage, or 'trans' cleavage of ssDNA (and in some orthologs single stranded RNA (ssRNA), and nicking of dsDNA) by the effector protein occurs[79,89].

    • One recent advance has been the discovery and characterisation of Cas12j (phi) effectors. These proteins, encoded exclusively on the genomes of large phages and more compact (700 to 800 aa in size) than other Cas12 effectors[92]. While these effectors possess dsDNA cleavage activity, there was a significant difference in efficacy between Cas12j mediated cleavage of the target strand, and Cas12a mediated cleavage of the target strand[38,39,9192]. This shortcoming means that while the characterisation of Cas12j represents an important step towards more compact, high efficacy editing proteins, it is however unlikely to supersede existing editors such as Cas12a or Cas9.

      One of the most exciting potential advances arising from the exploration of CRISPR-Cas effector diversity has been the discovery of tiny, 400 to 700 amino acids long effector proteins. These proteins are small enough to be delivered in a recombinant adenoviral vector (rAAV)[39]. Unfortunately, all Cas14 effectors discovered to date cleave ssDNA with relatively high efficacy, but are unable to cleave dsDNA with comparable efficacy which limits their potential application without protein engineering optimization (Fig. 6E)[39,91]. Nevertheless, there is considerable optimism that this limitation can be surmounted either via direct protein engineering of Cas14 to produce a gain of function variant with higher cleavage activity, or via further exploration and characterization of Cas14 orthologs.

    • A subclade of Cas12 effectors exist that lack functional RuvC catalytic residues and occur in the same operon as Tn7-like transposases. These function as RNA-guided DNA binding proteins and form a complex with Tn7 transposases to direct the site of transposon integration (Fig. 6D). Compared with type Ⅰ-Tn7 transposon integration systems, Cas12k guided systems offer two important distinct advantages in the form of higher insertion efficacy, depending on the loci chosen for targeting, and simpler and smaller construct size, due to the monomeric nature of Cas12k effectors compared with type Ⅰ-F and Ⅰ-B Cascade surveillance complexes[93]. The efficacy of integration by Cas12k proteins (in the range of 15% to 65% in E. Coli[60]) is superior to the measured efficacy in yeast and eukaryotes of prime editing, an alternate means of integrating DNA in host genomes[94]. This efficacy is also competitive with the editing efficacy of Cas9 or Cas12a without the side-effects associated with the utilisation of DSB repair pathways for editing, although further research is needed to demonstrate feasibility in eukaryotic cell lines[9496].

    • A different avenue for bypassing the limitations of DSB based gene knockout protocols is to induce a gene knockdown at the target site. This involves utilizing RNA-guided site-specific riboendonucleases to target and cleave mRNA transcribed by the gene of interest[14,37,38,9798]. This silences the expression of the target gene. Several Cas12 orthologs have been discovered which possess RNA-guided RNAse activity (Fig. 6C and E), and an entire clade of CRISPR-Cas effectors, designated Cas13, have been discovered which exclusively target and cleave single strand RNA in a manner analogous to RNA-guided DNA targeting CRISPR-Cas effectors[14,379899].

    • Exploring the diversity of CRISPR-Cas systems has unveiled a plethora of possible candidate RNA-guided ribonucleoprotein modules. However, despite the hundreds of thousands of CRISPR-Cas systems, which have been detected in metagenome sequencing data, there has been a much smaller diversity of catalytic activities that could be utilised for alternate gene editing strategies. Some of these systems, employed as emerging technologies have clear potential to become competitive with CRISPR-Cas9 in terms of editing efficacy and target site specificity, while simultaneously lacking the in-built constraints imposed by a reliance on double strand breaks to facilitate genetic recombination. Overall, when added to the existing CRISPR-toolbox, these new RNA-guided interference or integrase systems represent a significant advance when used in specialised cases where the limitations CRISPR-Cas9 are a clear technical obstacle. In the near future we will be able to customise and personalise the gene editing approach by choosing an ideal nuclease for a specific application. In conjunction with other strategies such as either rational, directed evolution or the use of base-editing or prime-editing technology, the exploitation of CRISPR-Cas system diversity remains a promising avenue for developing a general purpose gene editing platform equivalent or superior to the current CRISPR-Cas9 paradigm.

    • We would like to thank Anthony Newman and Jovita De Silva for their efforts in proofreading this work and suggesting improvements. We would like to thank the two anonymous reviewers for their suggestions and comments that have significantly improved the manuscript. Gaetan Burgio is supported by the National Collaborative Research Infrastructure (NCRIS) via Phenomics Australia, the National Health and Medical Research Council of Australia (Grant No. APP1143008), the Australian Research Council (Grant No. DP180101494) and the National Natural Science Foundation of China (Grant No. 81772214).

Reference (103)



    DownLoad:  Full-Size Img  PowerPoint