$$\rightleftharpoonup{xx}$$
$$\longleftharp{xx}$$,
$$\longrightharp{xx}$$,
Large-scale sequencing projects such as ENCODE1, Roadmap Epigenomics2, and FANTOM3 have identified millions of putative enhancers within the human genome across hundreds of cell types. It is estimated that each promoter associates with an average of 4.9 enhancers and each enhancer contacts an average of 2.4 genes3, suggesting that gene expression is often the result of the integration of multiple distributed regulatory interactions. A significant remaining challenge is to define not only how individual enhancers contribute to gene expression, but how they combine to affect expression. Genetic approaches are commonly used to identify relationships between enhancers in model organisms from Drosophila4 to mice5. However, these experiments are time-consuming and low-throughput for the study of multiple enhancers at multiple genes.
One approach for studying enhancer function on a large scale involves massively parallel reporter assays. These assays allow for the simultaneous screening of thousands of DNA sequences for their ability to drive the expression of a reporter gene6. While these assays have shown that DNA sequence can alone be sufficient to convey gene regulation information7, they come with the caveats of being performed outside of the native chromatin context and with a heterologous promoter. In addition, the size of DNA sequence being analyzed in massively parallel reporter assays is usually less than 200 basepairs, which may exclude relevant surrounding sequence. Importantly, as reporter assays only measure the activity of one sequence at a time, they do not take into account the complex relationships that can exist between enhancers. Thus, while massively parallel reporter assays can be informative about the intrinsic activity of a DNA sequence, they do not necessarily inform us of the function of that DNA sequence in the context of the genome.
Recently developed CRISPR/Cas9 tools8 have facilitated the study of gene regulation as they allow for the deletion of enhancers at the endogenous locus. However, deleting multiple enhancers simultaneously may lead to genomic instability, and it is time consuming to generate successive enhancer deletions in a single cell line. In addition, new genomic sequence is created at the site of the deletion following repair, and this sequence may gain regulatory function. An alternative version of Cas9 has been developed specifically for modulating gene expression, relying on fusions of activating9,10 or repressing11,12 domains to the nuclease-deficient form of Cas9 (dCas9). These fusion proteins are ideal for studying multiple loci simultaneously as they do not physically alter the DNA sequence, and instead modulate epigenetics in order to interrogate a regulatory region. The most widely used repressive fusion is KRAB, which recruits the KAP1 co-repressor complex, promoting the deposition of the repression-associated histone H3 lysine 9 trimethylation (H3K9me3)13. dCas9-KRAB, also known as CRISPR interference14, has been used to target and screen individual enhancers for their contributions to gene expression15,16; however, it has not been optimized for targeting multiple regions simultaneously. One version of multiplex CRISPR interference for enhancers, Mosaic-seq17, uses single cell RNA-seq as a readout, but this technology is expensive and only suitable for the study of highly expressed genes due to the low sensitivity of single cell RNA-seq.
We sought to develop a CRISPR interference-based method for dissecting combinatorial enhancer function within the context of a transcriptional response to estrogen. About half of estrogen-responsive genes contain 2 or more enhancers bound by estrogen receptor alpha (ER) nearby18, suggesting that multiple enhancers may be participating in the estrogen response, and understanding the regulatory logic would require targeting multiple enhancers simultaneously. As initial studies using CRISPR interference at promoters suggested that not all promoters are equally responsive to KRAB-mediated repression19, we reasoned that the addition of a distinct repressive domain to dCas9 may facilitate the deactivation of diverse enhancers. We chose the Sin3a Interacting Domain of Mad1 (SID)20 as it leads to the recruitment of histone deacetylases21, which remove acetyl groups on histones that are associated with transcriptional activity. Importantly, the SID domain was effective at reducing gene expression when fused to dCas922 and TALEs23, and Sin3a has been shown to be a potent repressive co-factor in a variety of enhancer sequence contexts24. We used SID4x-dCas9-KRAB (Enhancer-i) to target 10 different enhancers bound by the ER, and identify ER binding sites (ERBS) that are necessary for the estrogen transcriptional response at 4 genes18. We also targeted the combinations of enhancers to identify the sites that cooperate in the production of the estrogen transcriptional response. We found that up to 50 sites can potentially be targeted simultaneously with detectable gene expression changes. Using ChIP-seq and RNA-seq, we demonstrated that Enhancer-i is a highly specific technique for studying multiple enhancers simultaneously.
In this protocol, we describe the steps involved in performing Enhancer-i, a flexible technique that enables the functional study of multiple enhancers simultaneously in a tissue culture setting. Enhancer-i is highly correlated with genetic deletion but provides transient deactivation that is dependent on histone deacetylases (HDACs). By delivering guide RNAs via transient transfection as opposed to stable integration via viral vectors, this protocol avoids deposition and potential spreading of H3K9me3. This protocol details guide RNA design and cloning via Gibson assembly, the transfection of guide RNAs using lipofection, and the analysis of resulting gene expression changes by qPCR. We also include the methods for evaluating the specificity of Enhancer-i targeting at the level of the genome and transcriptome. While this technique was developed to study gene regulation by ER bound enhancers in human cancer cell lines, it is applicable to the dissection of any mammalian enhancer.