Overview of CRISPR/Cas9
The Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) Type II system is a bacterial immune system that has been modified for genome engineering (see CRISPR history). Prior to CRISPR/Cas9, genome engineering approaches, like zinc finger nucleases (ZFNs) or transcription-activator-like effector nucleases (TALENs), relied upon the use of customizable DNA-binding protein nucleases that required scientists to design and generate a new nuclease-pair for every genomic target. Largely due to its simplicity and adaptability, CRISPR has rapidly become one of the most popular approaches for genome engineering.
CRISPR consists of two components: a “guide” RNA (gRNA) and a non-specific CRISPR-associated endonuclease (Cas9). The gRNA is a short synthetic RNA composed of a “scaffold” sequence necessary for Cas9-binding and a user-defined ∼20 nucleotide “spacer” or “targeting” sequence which defines the genomic target to be modified. Thus, one can change the genomic target of Cas9 by simply changing the targeting sequence present in the gRNA. CRISPR was originally employed to “knock-out” target genes in various cell types and organisms, but modifications to the Cas9 enzyme have extended the application of CRISPR to selectively activate or repress target genes, purify specific regions of DNA, and even image DNA in live cells using fluorescence microscopy. Furthermore, the ease of generating gRNAs makes CRISPR one of the most scalable genome editing technologies and has been recently utilized for genome-wide screens.
This guide will provide a basic understanding of CRISPR/Cas9 biology, introduce the various applications of CRISPR, and help you get started using CRISPR/Cas9 in your own research.
Generating a Knock-out Using CRISPR/Cas9
CRISPR/Cas9 can be used to generate knock-out cells or animals by co-expressing a gRNA specific to the gene to be targeted and the endonuclease Cas9. The genomic target can be any ∼20 nucleotide DNA sequence, provided it meets two conditions:
- The sequence is unique compared to the rest of the genome.
- The target is present immediately upstream of a Protospacer Adjacent Motif (PAM).
The PAM sequence is absolutely necessary for target binding and the exact sequence is dependent upon the species of Cas9 (5′ NGG 3′ for Streptococcus pyogenes Cas9). We will focus on Cas9 from S. pyogenes as it is currently the most widely used in genome engineering (see additional species of Cas9 and corresponding PAM sequences here). Once expressed, the Cas9 protein and the gRNA form a riboprotein complex through interactions between the gRNA “scaffold” domain and surface-exposed positively-charged grooves on Cas9. Cas9 undergoes a conformational change upon gRNA binding that shifts the molecule from an inactive, non-DNA binding conformation, into an active DNA-binding conformation. Importantly, the “spacer” sequence of the gRNA remains free to interact with target DNA. The Cas9-gRNA complex will bind any genomic sequence with a PAM, but the extent to which the gRNA spacer matches the target DNA determines whether Cas9 will cut. Once the Cas9-gRNA complex binds a putative DNA target, a “seed” sequence at the 3′ end of the gRNA targeting sequence begins to anneal to the target DNA. If the seed and target DNA sequences match, the gRNA will continue to anneal to the target DNA in a 3′ to 5′ direction.
Cas9 will only cleave the target if sufficient homology exists between the gRNA spacer and target sequences. The “zipper-like” annealing mechanics may explain why mismatches between the target sequence in the 3′ seed sequence completely abolish target cleavage, whereas mismatches toward the 5′ end are permissive for target cleavage. The Cas9 nuclease has two functional endonuclease domains: RuvC and HNH. Cas9 undergoes a second conformational change upon target binding that positions the nuclease domains to cleave opposite strands of the target DNA. The end result of Cas9-mediated DNA cleavage is a double strand break (DSB) within the target DNA (∼3-4 nucleotides upstream of the PAM sequence).
The resulting DSB is then repaired by one of two general repair pathways:
- The efficient but error-prone Non-Homologous End Joining (NHEJ) pathway
- The less efficient but high-fidelity Homology Directed Repair (HDR) pathway
The NHEJ repair pathway is the most active repair mechanism, capable of rapidly repairing DSBs, but frequently results in small nucleotide insertions or deletions (InDels) at the DSB site. The randomness of NHEJ-mediated DSB repair has important practical implications, because a population of cells expressing Cas9 and a gRNA will result in a diverse array of mutations (for more information, jump to Plan Your Experiment). In most cases, NHEJ gives rise to small InDels in the target DNA which result in in-frame amino acid deletions, insertions, or frameshift mutations leading to premature stop codons within the open reading frame (ORF) of the targeted gene. Ideally, the end result is a loss-of-function mutation within the targeted gene; however, the “strength” of the knock-out phenotype for a given mutant cell is ultimately determined by the amount of residual gene function.
Browse Plasmids: Double Strand Break (Cut)
Enhancing Specificity with Cas9 Nickase
CRISPR/Cas9 is highly specific when gRNAs are designed correctly, but specificity is still a major concern, particularly as CRISPR is being developed for clinical use. The specificity of the CRISPR system is determined in large part by how specific the gRNA targeting sequence is for the genomic target compared to the rest of the genome. Ideally, a gRNA targeting sequence will have perfect homology to the target DNA with no homology elsewhere in the genome. Realistically, a given gRNA targeting sequence will have additional sites throughout the genome where partial homology exists. These sites are called “off-targets” and need to be considered when designing a gRNA for your experiment (more information on gRNA design can be found in the below Plan Your Experiment section).
In addition to optimizing gRNA design, specificity of the CRISPR system can also be increased through modifications to Cas9 itself. As discussed previously, Cas9 generates double strand breaks (DSBs) through the combined activity of two nuclease domains, RuvC and HNH. The exact amino acid residues within each nuclease domain that are critical for endonuclease activity are known (D10A for HNH and H840A for RuvC in S. pyogenes Cas9) and modified versions of the Cas9 enzyme containing only one active catalytic domain (called “Cas9 nickase”) have been generated. Cas9 nickases still bind DNA based on gRNA specificity, but nickases are only capable of cutting one of the DNA strands, resulting in a “nick”, or single strand break, instead of a DSB. DNA nicks are rapidly repaired by HDR (homology directed repair) using the intact complementary DNA strand as the template (jump to our HDR section for more details). Thus, two nickases targeting opposite strands are required to generate a DSB within the target DNA (often referred to as a “double nick” or “dual nickase” CRISPR system). This requirement dramatically increases target specificity, since it is unlikely that two off-target nicks will be generated within close enough proximity to cause a DSB. Therefore, if specificity and reduced off-target effects are crucial, consider using the dual nickase approach to create a double nick-induced DSB. The nickase system can also be combined with HDR-mediated gene editing for highly specific gene edits.
Browse Plasmids: Single Strand Break (Nick)
Making Precise Modifications Using Homology Directed Repair (HDR)
While NHEJ-mediated DSB repair is imperfect and often results in disruption of the open reading frame of the gene, Homology Directed Repair (HDR) can be used to generate specific nucleotide changes (also known as gene “edits”) ranging from a single nucleotide change to large insertions (e.g. addition of a fluorophore or tag).
In order to utilize HDR for gene editing, a DNA “repair template” containing the desired sequence must be delivered into the cell type of interest with the gRNA(s) and Cas9 or Cas9 nickase. The repair template must contain the desired edit as well as additional homologous sequence immediately upstream and downstream of the target (termed left & right homology arms). The length and binding position of each homology arm is dependent on the size of the change being introduced. The repair template can be a single stranded oligonucleotide, double-stranded oligonucleotide, or double-stranded DNA plasmid depending on the specific application. It is worth noting that the repair template must lack the PAM sequence that is present in the genomic DNA, otherwise the repair template becomes a suitable target for Cas9 cleavage. For example, the PAM could be mutated such that it is no longer present, but the coding region of the gene is not affected (i.e. a silent mutation).
The efficiency of HDR is generally low (<10% of modified alleles) even in cells that express Cas9, gRNA and an exogenous repair template. For this reason, many laboratories are attempting to artificially enhance HDR by synchronizing the cells within the cell cycle stage when HDR is most active, or by chemically or genetically inhibiting genes involved in NHEJ. The low efficiency of HDR has several important practical implications. First, since the efficiency of Cas9 cleavage is relatively high and the efficiency of HDR is relatively low, a portion of the Cas9-induced DSBs will be repaired via NHEJ. In other words, the resulting population of cells will contain some combination of wild-type alleles, NHEJ-repaired alleles, and/or the desired HDR-edited allele. Therefore, it is important to confirm the presence of the desired edit experimentally, and if necessary, isolate clones containing the desired edit (see our validation section in Plan Your Experiment).
Activation or Repression of Target Genes Using CRISPR/Cas9
The CRISPR/Cas system is a remarkably flexible tool for genome manipulation. A unique feature of Cas9 is its ability to bind target DNA independently of its ability to cleave target DNA. Specifically, both RuvC- and HNH- nuclease domains can be rendered inactive by point mutations (D10A and H840A in SpCas9), resulting in a nuclease dead Cas9 (dCas9) molecule that cannot cleave target DNA. The dCas9 molecule retains the ability to bind to target DNA based on the gRNA targeting sequence. The first experiments using dCas9 in bacteria demonstrated that targeting dCas9 to transcriptional start sites was sufficient to “repress” or “knock-down” transcription by blocking transcription initiation. Furthermore, dCas9 can be tagged with transcriptional repressors or activators, and targeting these dCas9 fusion proteins to the promoter region results in robust transcription repression or activation of downstream target genes. The simplest dCas9-based activators and repressors consist of dCas9 fused directly to a single transcriptional activator, A (e.g. VP64) or transcriptional repressors, R (e.g. KRAB; see A in Figure to the right). Additionally, more elaborate activation strategies have been developed which result in greater activation of target genes in mammalian cells. These include: co-expression of epitope-tagged dCas9 and antibody-activator effector proteins (e.g. SunTag system, B), dCas9 fused to several different activation domains in series (e.g. dCas9-VPR, C) or co-expression of dCas9-VP64 with a “modified scaffold” gRNA and additional RNA-binding “helper activators” (e.g. SAM activators, D). Importantly, unlike the genome modifications induced by Cas9 or Cas9 nickase, dCas9-mediated gene activation or repression is reversible, since it does not permanently modify the genomic DNA.
Multiplex Genome Engineering with CRISPR/Cas9
Expressing several gRNAs off of the same plasmid ensures that every cell that takes up a plasmid expresses all of the desired gRNAs and increases the likelihood that all desired genomic edits will be carried out by Cas9. Such “multiplex” CRISPR applications include:
- The use of dual nickases to generate a knock-out or edit a gene
- Using Cas9 to generate large genomic deletions
- Modifying multiple different genes at once
Current multiplex CRISPR systems enable researchers to target anywhere from 2 to 7 genetic loci by cloning multiple gRNAs into a single plasmid. These multiplex gRNA vectors can conceivably be combined with any of the aforementioned Cas9-derivatives to not only knock-out target genes, but activate or repress target genes as well.
Browse Plasmids: Multiplex gRNA vectors
Genome-wide Screens Using CRISPR/Cas9
The ability to semi-automatically design and synthesize gRNAs to mutate, activate, or repress almost any genomic locus makes the CRISPR/Cas9 the ideal genome engineering system for large-scale forward genetic screening. Forward genetic screens are particularly useful for studying diseases or phenotypes for which the underlying genetic cause is not known. In general, the goal of a genetic screen is to generate a large population of cells with mutations in a wide variety of genes and use these mutant cells to identify the genetic perturbations that result in a desired phenotype. Before CRISPR/Cas9, genetic screens relied heavily on shRNA-based screens, which are prone to off-target effects and may result in false negatives due to incomplete knock-down of target genes. The CRISPR system, in contrast, is capable of making highly specific, permanent genetic modifications in target genes. The CRISPR system has already been used to screen for novel genes that regulate known phenotypes, including resistance to chemotherapy drugs, resistance to toxins, cell viability, and tumor metastasis. Currently, the most popular method for conducting genome-wide screens using CRISPR/Cas9 involves the use of “pooled” lentiviral CRISPR libraries.
What are pooled lentiviral CRISPR libraries?
Pooled lentiviral CRISPR libraries (heretofore referred to as CRISPR libraries) are a heterogenous population of lentiviral transfer vectors, each containing an individual gRNA targeting a single gene in a given genome.
Guide RNAs are designed in silico and synthesized (see A in figure to the right), then cloned in a pooled format into lentiviral transfer vectors B. CRISPR libraries have been designed for most of the common CRISPR applications including genetic knock-out and activation or repression for both human and mouse genes. Although each library is different, there are several features that are common across most CRISPR libraries. Each library typically contains ∼3-6 gRNAs per gene to ensure modification of every target gene. Libraries can target anywhere from a single class of genes up to every gene in the genome. Thus, CRISPR libraries contain thousands of unique gRNAs targeting a wide variety of genes. Guide RNA design for CRISPR libraries follows the same general principles as designing a gRNA for a specific target. Target sequences must be unique compared to the rest of the genome and be located just upstream of a PAM sequence. Obviously, the exact region of the gene to be targeted may vary depending on the specific application (5′ constitutively expressed exons for knock-out libraries, or the promoter region for activation and repression libraries). For some libraries, Cas9 (or Cas9 derivative) is included on the gRNA-containing plasmid; for others, they must be delivered to the cells separately.
How does one use a CRISPR library?
All of the CRISPR libraries available through Addgene follow the same general experimental protocol. In most cases, the CRISPR library will be shipped at a concentration that is too low to be used in experiments. Thus, the first step in using your library is to “amplify” the library (C in above figure) such that the total amount of DNA is increased but the “representation” (i.e. the relative percentage of each gRNA with the library) is maintained. Once the library has been amplified and the representation checked using next-generation sequencing (NGS), the next step is to generate lentivirus containing the entire CRISPR library D. Mutant cells are then generated by transducing Cas9-expressing cells (or wild-type cells for libraries containing Cas9 and the gRNA) with the lentiviral library E. In screens where you are measuring the loss of gRNAs from a final population (i.e. negative selection survival screens) you need to use NGS to identify the gRNAs present in the initial mutant cell population prior to selection. Alternatively, for positive screens such as drug-screens, you can treat your mutant cells with drug, or control and directly compare the gRNA distribution at the end of the screen F. It is important to remember that analysis of relevant genes (“hits”) at the end of your screen requires the use of NGS.
What can screens tell you?
As noted at the beginning of this section, forward genetic screens are most useful for situations in which the physiology or cell biology behind a particular phenotype or disease is well understood, but the underlying genetic causes are unknown. Therefore, genome-wide screens using CRISPR libraries are a great way to gather unbiased information regarding which genes, if any, play a causal role in a given phenotype. With any experiment, it is important to be sure that the hits you identify are actually important for your phenotype. This is typically carried out by testing the gRNAs identified in your screen individually to ensure that the genetic modification reproduces the phenotype you screened for in the first place.
Find more information on what you should consider before starting your pooled library experiment in “Practical Considerations for Using Pooled Lentiviral CRISPR Libraries” (McDade et al., 2016).
Browse Libraries: CRISPR Pooled Libraries
Additional Uses of Cas9
Image Genomic Regions Using Fluorophore-tagged dCas9
Using a dCas9 fused to a fluorescent marker (such as GFP), researchers have turned dCas9 into a customizable DNA label that can be detected in live cells. By creating unique gRNAs that bind in close proximity along a stretch of genomic DNA, a technique referred to as “tiling”, researchers have imaged specific regions of the genome. The tiling technique does require multiple gRNAs to bind near one another in order to produce a detectable signal.
Browse Plasmids: Label
Purify Genomic Regions Using dCas9
Building on the well-established concept of ChIP (Chromatin Immunoprecipitation), researchers have created enChIP (engineered DNA-binding molecule-mediated ChIP) that allows for the purification of any genomic sequence specified by a particular gRNA. A catalytically inactive dCas9 fused to an epitope tag(s) can be used to purify genomic DNA bound by the gRNA. Learn more about ChIP here.
Browse Plasmids: Purify
Alternatives to Cas9 for CRISPR Genome Engineering
While S. pyogenes Cas9 (SpCas9) is certainly the most commonly used CRISPR endonuclease for genome engineering, it may not be the best endonuclease for every application. For example, the PAM sequence for SpCas9 (5′ NGG 3′) is abundant throughout the human genome, but a NGG sequence may not be positioned correctly to target your desired genes for modification. This is of particular concern when trying to edit a gene using Homology Directed Repair (HDR), which requires PAM sequences in very close proximity to the region to be edited. In the Joung lab, Kleinsteiver et al. has generated synthetic SpCas9-derived variants with non-NGG PAM sequences. The inclusion of these variants into the CRISPR arsenal effectively doubles the targeting range of CRISPR in the human genome.
Additional Cas9 orthologs from various species have been identified and these “non-SpCas9s” bind a variety of PAM sequences. These non-SpCas9s may have other characteristics that make them more useful than SpCas9 for specific applications. For example, the relatively large size of SpCas9 (∼4kb coding sequence) means that plasmids carrying the SpCas9 cDNA cannot be efficiently packaged into adeno-associated virus (AAV). Conversely, the coding sequence for Staphylococcus aureus Cas9 (SaCas9) is ∼1 kilobase shorter than SpCas9, allowing it to be efficiently packaged into AAV. Similar to SpCas9, the SaCas9 endonuclease is capable of modifying target genes in mammalian cells in vitro and in mice in vivo.
Another limitation of SpCas9 has to do with the low efficiency of making specific genetic edits via HDR. In the Zhang lab, Zetsche et al. describes two RNA-guided endonucleases from the Cpf1 family that display cleavage activity in mammalian cells. Unlike Cas9 nucleases, the result of Cpf1-mediated DNA cleavage is a double-strand break with a short 3′ overhang. Cpf1’s staggered cleavage pattern opens up the possibility of directional gene transfer, analogous to traditional restriction enzyme cloning, which may increase the efficiency of gene editing. Like the Cas9 variants and orthologs described above, Cpf1 also expands the number of sites that can be targeted by CRISPR to AT-rich regions or AT-rich genomes that lack the NGG PAM sites favored by SpCas9.
Plan Your CRISPR Experiment
CRISPR/Cas9 is a powerful system that enables researchers to manipulate the genome of target cells like never before. This section will provide a general framework to get you started using CRISPR/Cas9 in your research. Although we will focus on using CRISPR in mammalian cells, many of these principles apply to using CRISPR in other organisms. First, consider the genetic manipulation that is necessary to model your specific disease or process of interest. Do you want to:
- Generate complete and permanent loss of gene expression or function (i.e. knock-out)?
- Express a mutated version of the gene (e.g. point mutant)?
- Increase or decrease expression of a target gene?
Once you have a clear understanding of what you are trying to do, you are ready to start navigating the different reagents that are available for your particular experiment.
Select Your Desired Genetic Manipulation
Different genetic manipulations require different CRISPR components. Selecting a specific genetic manipulation can be a good way to narrow down which reagents are appropriate for a given experiment. Make sure to check whether reagents are available to carry out your specific genetic manipulation in your specific model organism. There may not be a perfect plasmid for your specific application, and in such a case, it may be necessary to customize an existing reagent to suit your needs.
|Genetic Manipulation||Application||Cas9||gRNA||Additional Considerations|
|Knock-out||Permanently disrupt gene function in a particular cell type or organism without regard for specific mutation||Cas9 (or Cas9 nickase)||Single (or dual) gRNA targeting 5′ exon or essential protein domains||Dual-nickase approach increases specificity but is less efficient|
|Edit||Generate a specific user-defined sequence change in a particular gene, such as generating a point mutation or inserting a tag||Cas9 (or Cas9 nickase)||Single (or dual) gRNA targeting the region where the edit should be made||Requires DNA template for HDR; Efficiency is reduced compared to knock-out|
|Repress or Interfere (CRISPRi)||Reduce expression of a particular gene(s) without permanently modifying the genome||dCas9 or dCas9-repressor (such as dCas9-KRAB)||gRNA(s) targeting promoter elements of target gene||dCas9-KRAB is more effective than dCas9 alone for mammalian cell lines|
|Activate (CRISPRa)||Increase expression of an endogenous gene(s) without permanently modifying the genome||dCas9-activator (such as dCas9-VP64)||gRNA(s) targeting promoter elements of target gene||Many different activators exist, all derived from S. pyogenes dCas9|
Select Expression System
To use the CRISPR system, you will need both gRNA and Cas9 expressed in your target cells. The expression system you need will depend upon your specific application. For example, certain cell types are easier to transfect than others. For easy-to-transfect cell types (e.g. HEK293 cells), transfection with standard transfection reagents may be sufficient to express both your gRNA and Cas9. For more difficult cells (e.g. primary cells), viral delivery of CRISPR reagents may be more appropriate.
The table below summarizes the major expression systems and variables for using CRISPR in mammalian cells. Some of the variables include:
- Species of Cas9 and gRNA
- Species of promoter and expression pattern of promoter for Cas9 and gRNA
- Presence of a selectable marker (drug or fluorophore)
- Delivery method
|Expression System||Components of System||Application|
|Mammalian expression vector||
||Transient or stable expression of Cas9 and/or gRNA in a mammalian cell line that can be transfected at high efficiency|
|Cas9 mRNA and gRNA||Plasmids containing gRNA and Cas9 are used in in vitro transcription reactions to generate mature Cas9 mRNA and gRNA, then delivered to target cells (i.e. microinjection or electroporation)||
|Cas9-gRNA riboprotein complexes||Purified Cas9 protein and in vitro transcribed gRNA are combined to form a Cas9-gRNA complex and delivered to cells using cationic lipids||
Additional Addgene Resources:
- Depositor protocols for plasmids at Addgene
- JOVE Video Reprint: Protocol for genomic deletions in mammalian cell lines
Select Your Target Sequence and Design Your gRNA
Once you have selected your CRISPR components and method of delivery, you are ready to select a target sequence and design your gRNA. A general overview of how to design a gRNA is presented below.
- Know your cell line and genome sequence
The cell line you choose determines a variety of factors related to your experiment. The genomic sequence used to design gRNAs will depend upon the gene in question and the species from which your cells were derived. When possible, you should sequence the region you are planning to modify prior to designing your gRNA, as sequence variation between your gRNA targeting sequence and target DNA may result in reduced cleavage. The number of alleles for each gene may vary depending on the specific cell line or organism, which may affect the observed efficiency of knock-out or knock-in using CRISPR/Cas9.
- Select gene and genetic element to be manipulated
In order to manipulate a given gene using CRISPR/Cas9, you will have to identify the genomic sequence for the gene you are trying to target. However, the exact region of the gene you target will depend on your specific application. For example:
- To activate or repress a target gene using dCas9-activators or dCas9-repressors, gRNAs should be targeted to the promoter driving expression of your gene of interest.
- For genetic knock-outs, gRNAs commonly target 5′ constitutively expressed exons, which reduces the chances that the targeted region is removed from the mRNA due to alternative splicing. Exons near the N-terminus are targeted since frameshift mutations here will increase the likelihood that a nonfunctional protein product is produced.
- Alternatively, gRNAs can be designed to target exons that code for known essential protein domains. The benefit of this approach is that non-frameshift mutations such as insertions or deletions (which commonly result from DSBs) are more likely to alter protein function when they occur in protein domains that are essential for protein function.
- For gene editing experiments using HDR, it is essential that the target sequence be close to the location of the desired edit. In this case, it is necessary to identify the exact location where the edit should occur and select a target sequence nearby.
- Select gRNAs based on predicted “on-target” and “off-target” activity
A PAM sequence is absolutely necessary for Cas9 to bind target DNA. As such, one must identify all PAM sequences within the genetic region to be targeted (PAM is 5′ NGG 3′ for SpCas9). If there are no PAM sequences within your desired sequence, you may want to consider using a Cas9 from a different species or a S. pyogenes Cas9 variant that binds a PAM sequence that is present in the genomic region you want to target (see additional Cas9 variants and PAM sequences). Once all possible PAM sequences and putative target sites have been identified, it is time to choose which site is likely to result in the most efficient on-target cleavage.
Clearly, the gRNA targeting sequence needs to match the target sequence, but it is also critical to ensure that the gRNA targeting sequence does NOT match additional sites within the genome. In a perfect world, your gRNA targeting sequence would have perfect homology to your target with no homology elsewhere in the genome. Realistically, a given gRNA targeting sequence will have additional sites throughout the genome where partial homology exists. These sites are called “off-targets” and should be considered when designing a gRNA for your experiments. In general, off-target sites are not cleaved as efficiently when mismatches occur near the PAM sequence, so gRNAs with no homology or those with mismatches close to the PAM sequence will have the highest specificity.
In addition to “off-target activity”, it is also important to consider factors that maximize cleavage of the desired target sequence (“on-target activity”). It is now understood that two gRNA targeting sequences, each having 100% homology to the target DNA may not result in equivalent cleavage efficiency. In fact, cleavage efficiency may increase or decrease depending upon the specific nucleotides within the selected target sequence. For example, gRNA targeting sequences containing a G nucleotide at position 20 (1 bp upstream of the PAM) may be more efficacious than gRNAs containing a C nucleotide at the same position in spite of being a perfect match for the target sequence. Therefore, close examination of predicted on-target and off-target activity of each potential gRNA targeting sequence is necessary to design the best gRNA for your experiment.
Several gRNA design programs have been developed that are capable of locating potential PAM and target sequences and ranking the associated gRNAs based on their predicted on-target and off-target activity (see gRNA design software). Additionally, many plasmids containing “validated” gRNAs are now available through Addgene. These plasmids contain gRNAs that have been used successfully in genome engineering experiments and have been published in peer-reviewed journals. Using validated gRNAs can save your lab valuable time and resources when carrying out experiments using CRISPR/Cas9.
Browse Plasmids: Validated gRNAs
- Synthesize and clone desired gRNAs
Once you have selected your target sequences it is time to design your gRNA oligos and clone these oligos into your desired vector. In many cases, targeting oligos are synthesized, annealed, and inserted into plasmids containing the gRNA scaffold using standard restriction-ligation cloning. However, the exact cloning strategy will depend on the gRNA vector you have chosen, so it is best to review the protocol associated with the specific plasmid in question (see CRISPR protocols from Addgene depositors).
- Deliver Cas9 and gRNA
Your method of delivery will be determined by the specific expression system that you have chosen. It may be necessary to optimize a protocol for delivery of gRNA and Cas9 to your target cells, as transfection efficiency will vary based on the method of delivery and the cell type tested. Read our blog post for more information about delivery in mammalian systems.
- Validate genetic modification
Once you have successfully delivered the gRNA and Cas9 to your target cells, it is time to validate your genome edit. Delivery of Cas9 and gRNA to wild-type cells will result in several possible genotypes within the resulting “mutant” cell population. A percentage of cells may be wild-type due to either (1) lack of gRNA and/or Cas9 expression, or (2) lack of efficient target cleavage in cells expressing both Cas9 and gRNA. Cells that have undergone modification (mutant cells) can be homozygous (modification of both alleles) or heterozygous (modification of a single allele). Furthermore, in mutant cells containing two mutated alleles, each mutated allele may be different owing to the error-prone nature of NHEJ. Even for gene editing experiments using HDR, not every mutated allele will contain the desired edit as a large percentage of DSBs are still repaired by NHEJ. Therefore, the end result of most experiments is a heterogeneous population of cells containing a wide variety of mutations or edits within target genes.
How do you determine that your desired edit has occurred? The exact method necessary to validate your edit will depend upon your specific application, and in some cases, new approaches must be devised. However, there are several common ways to verify that your cells contain your desired edit, including but not limited to:
- Mismatch-cleavage assay (for NHEJ repaired DSBs): Provides a semi-quantitative readout of the percentage of alleles that have been mutated within a mixed cell population. The region of interest is PCR amplified, PCR products are denatured-renatured, treated with a nuclease that cleaves DNA heteroduplexes, and run on an agarose gel to identify DNA fragments.
- PCR and restriction digest (for HDR repaired DSBs): For small nucleotide edits that introduce a novel restriction site. The region of interest is PCR amplified, digested with the appropriate restriction enzyme and run on an agarose gel to identify DNA fragments.
- PCR amplification and gel electrophoresis (for HDR or NHEJ): For large deletions or insertions, the region of interest can be PCR amplified using primers that (A) flank the region of interest (deletions or small insertions) or (B) span the genome-insert boundary (insertions only). The PCR product is then run on an agarose gel to determine whether the edit was successful.
- PCR amplification, subcloning and Sanger sequencing (for HDR or NHEJ): Provides semi-quantitative assessment of targeting frequency and exact sequence of targeted alleles. Involves PCR amplification of targeted region from DNA, subcloning into a plasmid, and screening individual clones.
- PCR amplification and next-generation sequencing (for HDR or NHEJ): Provides quantitative assessment of the genome edits in your target sequence and can also be used to examine off-target effects.
More information on each of these techniques can be found on our blog post, CRISPR 101: Validating your Genome Edit.
Addgene Website & Blog References
- CRISPR plasmids at Addgene
- CRISPR software and depositor protocols
- JOVE video reprint: Protocol for genomic deletions in mammalian cell lines
- Viral vectors at Addgene
- CRISPR software guide
- Genome-wide screening guide
- Validation of your genome edits
- HDR (Homology directed repair) guide
Cas9 Species/Variants and PAM Sequences
|Species/Variant of Cas9||PAM Sequence|
|Streptococcus pyogenes (SP); SpCas9||NGG|
|SpCas9 D1135E variant||NGG (reduced NAG binding)|
|SpCas9 VRER variant||NGCG|
|SpCas9 EQR variant||NGAG|
|SpCas9 VQR variant||NGAN or NGNG|
|Staphylococcus aureus (SA); SaCas9||NNGRRT or NNGRR(N)|
|Neisseria meningitidis (NM)||NNNNGATT|
|Streptococcus thermophilus (ST)||NNAGAAW|
|Treponema denticola (TD)||NAAAAC|
|Additional Cas9s from various species||PAM sequence may not be characterized|
The majority of the CRISPR plasmids in Addgene’s collection are from S. pyogenes unless otherwise noted.
|Cas||CRISPR Associated Protein, the Cas9 nuclease is the active enzyme for the Type II CRISPR system|
|CRISPR||Clustered Regularly Interspaced Short Palendromic Repeat, a region in bacterial genomes used in pathogen defense|
|CRISPRa||CRISPR Activation, using a dCas9 or dCas9-activator with a gRNA to increase transcription of a target gene|
|CRISPRi||CRISPR Interference, using a dCas9 or dCas9-repressor with a gRNA to repress/decrease transcription of a target gene|
|Cut||A DNA double strand break, the wild type function of Cas9|
|dCas9||Nuclease dead Cas9, an enzymatically inactive form of Cas9; Can bind, but cannot cleave DNA|
|DSB||Double Strand Break, a break in both strands of DNA, through the use of Cas9 or two Cas9-nickases targeting opposite strands|
|Dual Nickase/Double Nick||A method to decrease off-target effects by using a single Cas9 nickase and 2 different gRNAs, which bind in close proximity on opposite strands of the DNA, to create a DSB|
|enChIP||Engineered DNA-binding molecule-mediated ChIP, using a tagged-dCas9+gRNA to purify specific genomic regions to identify molecules associated with the genomic regions|
|Genetic modification or manipulation||Any genetic perturbation, including genetic knock-out, gene activation, or gene repression|
|gRNA||Guide RNA, a synthetic fusion of the endogenous bacterial crRNA and tracrRNA; Provides both targeting specificity and scaffolding/binding ability for Cas9 nuclease; Does not exist in nature; Also referred to as “single guide RNA” or “sgRNA”|
|gRNA scaffold sequence||The sequence within the gRNA that is responsible for Cas9 binding; Does not include the 20bp spacer/targeting sequence that is used to guide Cas9 to target DNA|
|gRNA targeting sequence||The 20 nucleotides that precede the PAM sequence in the genomic DNA; What gets put into a gRNA expression plasmid; Does NOT include the PAM sequence or the gRNA scaffold sequence|
|HDR||Homology Directed Repair, a DNA repair mechanism that uses a template to repair nicks or DSBs|
|InDel||Insertion/Deletion, a type of mutation that can result in the disruption of a gene by shifting the ORF and/or creating premature stop codons|
|NHEJ||Non-Homologous End-Joining; A DNA repair mechanism that often introduces InDels|
|Nick||A break in only one strand of a double stranded DNA; Normally repaired by HDR|
|Nickase||Cas9 that has one of the two nuclease domains inactivated; Can be either the RuvC or HNH domain; Capable of cleaving only one strand of target DNA|
|Off-target effects or off-target activity||Cas9 cleavage at undesired locations due to gRNA targeting sequence with sufficient homology to recruit Cas9 to unintended genomic locations; Can be minimized by double-nick|
|On-target activity||Cas9 cleavage at a desired location due to gRNA targeting sequence with sufficient homology to recruit Cas9 to desired genomic locations|
|ORF||Open Reading Frame; The codons that make up a gene|
|PAM||Protospacer Adjacent Motif; Necessary for Cas9 to bind target DNA; Must immediately follow the target sequence|
|PCR||Polymerase Chain Reaction; Used to amplify a specific sequence of DNA|
|sgRNA||Single guide RNA; A synthetic RNA composed of a targeting sequence and scaffold sequence derived from endogenous bacterial crRNA and tracrRNA; Used to target Cas9 to a specific genomic locus in genome engineering experiments; Also referred to as a “gRNA”|
|Target sequence||Genomic target of the gRNA targeting sequence; The 20 nucleotides that are incorporated into the gRNA plus the PAM sequence; Not to be confused with the gRNA targeting sequence|
Practical Considerations for Using Pooled Lentiviral CRISPR Libraries. McDade JR, Waxmonsky NC, Swanson LE, Fan M. Curr Protoc Mol Biol. 2016 Jul 1;115:31.5.1-31.5.13. doi: 10.1002/cpmb.8. PubMed.
Cpf1 Is a Single RNA-Guided Endonuclease of a Class 2 CRISPR-Cas System. Zetsche B, Gootenberg JS, Abudayyeh OO, Slaymaker IM, Makarova KS, Essletzbichler P, Volz SE, Joung J, van der Oost J, Regev A, Koonin EV, Zhang F. Cell. 2015 Oct 22;163(3):759-71. doi: 10.1016/j.cell.2015.09.038. PubMed.
Engineered CRISPR-Cas9 nucleases with altered PAM specificities. Kleinstiver BP, Prew MS, Tsai SQ, Topkar VV, Nguyen NT, Zheng Z, Gonzales AP, Li Z, Peterson RT, Yeh JR, Aryee MJ, Joung JK. Nature. 2015 Jul 23;523(7561):481-5. doi: 10.1038/nature14592. PubMed.
CRISPR-Cas9-Mediated Genetic Screening in Mice with Haploid Embryonic Stem Cells Carrying a Guide RNA Library. Zhong C, Yin Q, Xie Z, Bai M, Dong R, Tang W, Xing YH, Zhang H, Yang S, Chen LL, Bartolomei MS, Ferguson-Smith A, Li D, Yang L, Wu Y, Li J. Cell Stem Cell. 2015 Aug 6;17(2):221-32. doi: 10.1016/j.stem.2015.06.005. PubMed.
Multiplexable, locus-specific targeting of long RNAs with CRISPR-Display. Shechner DM, Hacisuleyman E, Younger ST, Rinn JL. Nat Methods. 2015 Jul;12(7):664-70. doi: 10.1038/nmeth.3433. PubMed.
Discovery of cancer drug targets by CRISPR-Cas9 screening of protein domains. Shi J, Wang E, Milazzo JP, Wang Z, Kinney JB, Vakoc CR. Nat Biotechnol. 2015 Jun;33(6):661-7. doi: 10.1038/nbt.3235. PubMed.
Electroporation enables the efficient mRNA delivery into the mouse zygotes and facilitates CRISPR/Cas9-based genome editing. Hashimoto M, Takemoto T. Sci Rep. 2015 Jun 11;5:11315. doi: 10.1038/srep11315. PubMed.
High-throughput functional genomics using CRISPR-Cas9. Shalem O, Sanjana NE, Zhang F. Nat Rev Genetics. 2015 May;16(5):299-311. doi: 10.1038/nrg3899. PubMed.
Increasing the efficiency of homology-directed repair for CRISPR-Cas9-induced precise gene editing in mammalian cells. Chu VT, Weber T, Wefers B, Wurst W, Sander S, Rajewsky K, Kühn R. Nat Biotechnol. 2015 May;33(5):543-8. doi: 10.1038/nbt.3198. PubMed.
Highly efficient Cas9-mediated transcriptional programming. Chavez A, Scheiman J, Vora S, Pruitt BW, Tuttle M, P R Iyer E, Lin S, Kiani S, Guzman CD, Wiegand DJ, Ter-Ovanesyan D, Braff JL, Davidsohn N, Housden BE, Perrimon N, Weiss R, Aach J, Collins JJ, Church GM. Nat Methods. 2015 Apr;12(4):326-8. doi: 10.1038/nmeth.3312. PubMed.
In vivo genome editing using Staphylococcus aureus Cas9. Ran FA, Cong L, Yan WX, Scott DA, Gootenberg JS, Kriz AJ, Zetsche B, Shalem O, Wu X, Makarova KS, Koonin EV, Sharp PA, Zhang F. Nature. 2015 Apr 9;520(7546):186-91. doi: 10.1038/nature14299. PubMed.
Genome engineering using CRISPR-Cas9 system. Cong L, Zhang F. Methods Mol Biol. 2015;1239:197-217. doi: 10.1007/978-1-4939-1862-1_10. PubMed.
Genome-scale transcriptional activation by an engineered CRISPR-Cas9 complex. Konermann S, Brigham MD, Trevino AE, Joung J, Abudayyeh OO, Barcena C, Hsu PD, Habib N, Gootenberg JS, Nishimasu H, Nureki O, Zhang F. Nature. 2015 Jan 29;517(7536):583-8. doi: 10.1038/nature14136. PubMed.
Engineering complex synthetic transcriptional programs with CRISPR RNA scaffolds. Zalatan JG, Lee ME, Almeida R, Gilbert LA, Whitehead EH, La Russa M, Tsai JC, Weissman JS, Dueber JE, Qi LS, Lim WA. Cell. 2015 Jan 15;160(1-2):339-50. doi: 10.1016/j.cell.2014.11.052. PubMed.
Cationic lipid-mediated delivery of proteins enables efficient protein-based genome editing in vitro and in vivo. Zuris JA, Thompson DB, Shu Y, Guilinger JP, Bessen JL, Hu JH, Maeder ML, Joung JK, Chen ZY, Liu DR. Nat Biotechnol. 2015 Jan;33(1):73-80. doi: 10.1038/nbt.3081. PubMed.
Enhanced homology-directed human genome engineering by controlled timing of CRISPR/Cas9 delivery. Lin S, Staahl BT, Alla RK, Doudna JA. Elife. 2014 Dec 15;3:e04766. doi: 10.7554/eLife.04766. PubMed.
In vivo engineering of oncogenic chromosomal rearrangements with the CRISPR/Cas9 system. Maddalo D, Manchado E, Concepcion CP, Bonetti C, Vidigal JA, Han YC, Ogrodowski P, Crippa A, Rekhtman N, de Stanchina E, Lowe SW, Ventura A. Nature. 2014 Dec 18;516(7531):423-7. doi: 10.1038/nature13902. PubMed.
Rational design of highly active sgRNAs for CRISPR-Cas9-mediated gene inactivation. Doench JG, Hartenian E, Graham DB, Tothova Z, Hegde M, Smith I, Sullender M, Ebert BL, Xavier RJ, Root DE. Nat Biotechnol. 2014 Dec;32(12):1262-7. doi: 10.1038/nbt.3026. PubMed.
Genome editing using Cas9 nickases. Trevino AE, Zhang F. Methods Enzymol. 2014;546:161-74. doi: 10.1016/B978-0-12-801185-0.00008-8. PubMed.
Multiplex CRISPR/Cas9-based genome engineering from a single lentiviral vector. Kabadi AM, Ousterout DG, Hilton IB, Gersbach CA. Nucleic Acids Res. 2014 Oct 29;42(19):e147. doi: 10.1093/nar/gku749. PubMed.
A protein-tagging system for signal amplification in gene expression and fluorescence imaging. Tanenbaum ME, Gilbert LA, Qi LS, Weissman JS, Vale RD. Cell. 2014 Oct 23;159(3):635-46. doi: 10.1016/j.cell.2014.09.039. PubMed.
Structural basis of PAM-dependent target DNA recognition by the Cas9 endonuclease. Anders C, Niewoehner O, Duerst A, Jinek M. Nature. 2014 Sep 25;513(7519):569-73. doi: 10.1038/nature13579. PubMed.
Multiplex genome engineering in human cells using all-in-one CRISPR/Cas9 vector system. Sakuma T, Nishikawa A, Kume S, Chayama K, Yamamoto T. Sci Rep. 2014 Jun 23;4:5400. doi: 10.1038/srep05400. PubMed.
Crystal structure of Cas9 in complex with guide RNA and target DNA. Nishimasu H, Ran FA, Hsu PD, Konermann S, Shehata SI, Dohmae N, Ishitani R, Zhang F, Nureki O. Cell. 2014 Feb 27;156(5):935-49. doi: 10.1016/j.cell.2014.02.001. PubMed.
Genetic screens in human cells using the CRISPR-Cas9 system. Wang T, Wei JJ, Sabatini DM, Lander ES. Science. 2014 Jan 3;343(6166):80-4. doi: 10.1126/science.1246981. PubMed.
Genome-scale CRISPR-Cas9 knockout screening in human cells. Shalem O, Sanjana NE, Hartenian E, Shi X, Scott DA, Mikkelsen TS, Heckl D, Ebert BL, Root DE, Doench JG, Zhang F. Science. 2014 Jan 3;343(6166):84-7. doi: 10.1126/science.1247005. PubMed.
Genome engineering using the CRISPR-Cas9 system. Ran FA, Hsu PD, Wright J, Agarwala V, Scott DA, Zhang F. Nat Protoc. 2013 Nov;8(11):2281-308. doi: 10.1038/nprot.2013.143. PubMed.
Multiplexed activation of endogenous genes by CRISPR-on, an RNA-guided transcriptional activator system. Cheng AW, Wang H, Yang H, Shi L, Katz Y, Theunissen TW, Rangarajan S, Shivalila CS, Dadon DB, Jaenisch R. Cell Res. 2013 Oct;23(10):1163-71. doi: 10.1038/cr.2013.122. PubMed.
CAS9 transcriptional activators for target specificity screening and paired nickases for cooperative genome engineering. Mali P, Aach J, Stranges PB, Esvelt KM, Moosburner M, Kosuri S, Yang L, Church GM. Nat Biotechnol. 2013 Sep;31(9):833-8. doi: 10.1038/nbt.2675. PubMed.
DNA targeting specificity of RNA-guided Cas9 nucleases. Hsu PD, Scott DA, Weinstein JA, Ran FA, Konermann S, Agarwala V, Li Y, Fine EJ, Wu X, Shalem O, Cradick TJ, Marraffini LA, Bao G, Zhang F. Nat Biotechnol. 2013 Sep;31(9):827-32. doi: 10.1038/nbt.2647. PubMed.
High-frequency off-target mutagenesis induced by CRISPR-Cas nucleases in human cells. Fu Y, Foden JA, Khayter C, Maeder ML, Reyon D, Joung JK, Sander JD. Nat Biotechnol. 2013 Sep;31(9):822-6. doi: 10.1038/nbt.2623. PubMed.
CRISPR-mediated modular RNA-guided regulation of transcription in eukaryotes. Gilbert LA, Larson MH, Morsut L, Liu Z, Brar GA, Torres SE, Stern-Ginossar N, Brandman O, Whitehead EH, Doudna JA, Lim WA, Weissman JS, Qi LS. Cell. 2013 Jul 18;154(2):442-51. doi: 10.1016/j.cell.2013.06.044. PubMed.
One-step generation of mice carrying mutations in multiple genes by CRISPR/Cas-mediated genome engineering. Wang H, Yang H, Shivalila CS, Dawlaty MM, Cheng AW, Zhang F, Jaenisch R. Cell. 2013 May 9;153(4):910-8. doi: 10.1016/j.cell.2013.04.025. PubMed.
Repurposing CRISPR as an RNA-guided platform for sequence-specific control of gene expression. Qi LS, Larson MH, Gilbert LA, Doudna JA, Weissman JS, Arkin AP, Lim WA. Cell. 2013 Feb 28;152(5):1173-83. doi: 10.1016/j.cell.2013.02.022. PubMed.
Multiplex genome engineering using CRISPR/Cas systems. Cong L, Ran FA, Cox D, Lin S, Barretto R, Habib N, Hsu PD, Wu X, Jiang W, Marraffini LA, Zhang F. Science. 2013 Feb 15;339(6121):819-23. doi: 10.1126/science.1231143. PubMed.
RNA-guided human genome engineering via Cas9. Mali P, Yang L, Esvelt KM, Aach J, Guell M, DiCarlo JE, Norville JE, Church GM. Science. 2013 Feb 15;339(6121):823-6. doi: 10.1126/science.1232033. PubMed.
Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP. Proc Natl Acad Sci U S A. 2005 Oct 25;102(43):15545-50. PubMed.