Epigenetics describes the heritable changes in gene function that occur independently to the DNA sequence. The molecular basis of epigenetic gene regulation is complex, but essentially involves modifications to the DNA itself or the proteins with which DNA associates. The predominant epigenetic modification of DNA in mammalian genomes is methylation of cytosine nucleotides (5-MeC). DNA methylation provides instruction to gene expression machinery as to where and when the gene should be expressed. The primary target sequence for DNA methylation in mammals is 5'-CpG-3' dinucleotides (Figure 1). CpG dinucleotides are not uniformly distributed throughout the genome, but are concentrated in regions of repetitive genomic sequences and CpG "islands" commonly associated with gene promoters (Figure 1). DNA methylation patterns are established early in development, modulated during tissue specific differentiation and disrupted in many disease states including cancer. To understand the biological role of DNA methylation and its role in human disease, precise, efficient and reproducible methods are required to detect and quantify individual 5-MeCs.
This protocol for bisulphite conversion is the "gold standard" for DNA methylation analysis and facilitates identification and quantification of DNA methylation at single nucleotide resolution. The chemistry of cytosine deamination by sodium bisulphite involves three steps (Figure 2). (1) Sulphonation: The addition of bisulphite to the 5-6 double bond of cytosine (2) Hydrolic Deamination: hydrolytic deamination of the resulting cytosine-bisulphite derivative to give a uracil-bisulphite derivative (3) Alkali Desulphonation: Removal of the sulphonate group by an alkali treatment, to give uracil. Bisulphite preferentially deaminates cytosine to uracil in single stranded DNA, whereas 5-MeC, is refractory to bisulphite-mediated deamination. Upon PCR amplification, uracil is amplified as thymine while 5-MeC residues remain as cytosines, allowing methylated CpGs to be distinguished from unmethylated CpGs by presence of a cytosine "C" versus thymine "T" residue during sequencing.
DNA modification by bisulphite conversion is a well-established protocol that can be exploited for many methods of DNA methylation analysis. Since the detection of 5-MeC by bisulphite conversion was first demonstrated by Frommer et al.1 and Clark et al.2, methods based around bisulphite conversion of genomic DNA account for the majority of new data on DNA methylation. Different methods of post PCR analysis may be utilized, depending on the degree of specificity and resolution of methylation required. Cloning and sequencing is still the most readily available method that can give single nucleotide resolution for methylation across the DNA molecule.
22 Related JoVE Articles!
Selective Capture of 5-hydroxymethylcytosine from Genomic DNA
Institutions: Emory University School of Medicine, The University of Chicago.
5-methylcytosine (5-mC) constitutes ~2-8% of the total cytosines in human genomic DNA and impacts a broad range of biological functions, including gene expression, maintenance of genome integrity, parental imprinting, X-chromosome inactivation, regulation of development, aging, and cancer1
. Recently, the presence of an oxidized 5-mC, 5-hydroxymethylcytosine (5-hmC), was discovered in mammalian cells, in particular in embryonic stem (ES) cells and neuronal cells2-4
. 5-hmC is generated by oxidation of 5-mC catalyzed by TET family iron (II)/α-ketoglutarate-dependent dioxygenases2, 3
. 5-hmC is proposed to be involved in the maintenance of embryonic stem (mES) cell, normal hematopoiesis and malignancies, and zygote development2, 5-10
. To better understand the function of 5-hmC, a reliable and straightforward sequencing system is essential. Traditional bisulfite sequencing cannot distinguish 5-hmC from 5-mC11
. To unravel the biology of 5-hmC, we have developed a highly efficient and selective chemical approach to label and capture 5-hmC, taking advantage of a bacteriophage enzyme that adds a glucose moiety to 5-hmC specifically12
Here we describe a straightforward two-step procedure for selective chemical labeling of 5-hmC. In the first labeling step, 5-hmC in genomic DNA is labeled with a 6-azide-glucose catalyzed by β-GT, a glucosyltransferase from T4 bacteriophage, in a way that transfers the 6-azide-glucose to 5-hmC from the modified cofactor, UDP-6-N3-Glc (6-N3UDPG). In the second step, biotinylation, a disulfide biotin linker is attached to the azide group by click chemistry. Both steps are highly specific and efficient, leading to complete labeling regardless of the abundance of 5-hmC in genomic regions and giving extremely low background. Following biotinylation of 5-hmC, the 5-hmC-containing DNA fragments are then selectively captured using streptavidin beads in a density-independent manner. The resulting 5-hmC-enriched DNA fragments could be used for downstream analyses, including next-generation sequencing.
Our selective labeling and capture protocol confers high sensitivity, applicable to any source of genomic DNA with variable/diverse 5-hmC abundances. Although the main purpose of this protocol is its downstream application (i.e
., next-generation sequencing to map out the 5-hmC distribution in genome), it is compatible with single-molecule, real-time SMRT (DNA) sequencing, which is capable of delivering single-base resolution sequencing of 5-hmC.
Genetics, Issue 68, Chemistry, Biophysics, 5-Hydroxymethylcytosine, chemical labeling, genomic DNA, high-throughput sequencing
Pyrosequencing for Microbial Identification and Characterization
Institutions: Johns Hopkins University, Qiagen Sciences, Inc..
Pyrosequencing is a versatile technique that facilitates microbial genome sequencing that can be used to identify bacterial species, discriminate bacterial strains and detect genetic mutations that confer resistance to anti-microbial agents. The advantages of pyrosequencing for microbiology applications include rapid and reliable high-throughput screening and accurate identification of microbes and microbial genome mutations. Pyrosequencing involves sequencing of DNA by synthesizing the complementary strand a single base at a time, while determining the specific nucleotide being incorporated during the synthesis reaction. The reaction occurs on immobilized single stranded template DNA where the four deoxyribonucleotides (dNTP) are added sequentially and the unincorporated dNTPs are enzymatically degraded before addition of the next dNTP to the synthesis reaction. Detection of the specific base incorporated into the template is monitored by generation of chemiluminescent signals. The order of dNTPs that produce the chemiluminescent signals determines the DNA sequence of the template. The real-time sequencing capability of pyrosequencing technology enables rapid microbial identification in a single assay. In addition, the pyrosequencing instrument, can analyze the full genetic diversity of anti-microbial drug resistance, including typing of SNPs, point mutations, insertions, and deletions, as well as quantification of multiple gene copies that may occur in some anti-microbial resistance patterns.
Microbiology, Issue 78, Genetics, Molecular Biology, Basic Protocols, Genomics, Eukaryota, Bacteria, Viruses, Bacterial Infections and Mycoses, Virus Diseases, Diagnosis, Therapeutics, Equipment and Supplies, Technology, Industry, and Agriculture, Life Sciences (General), Pyrosequencing, DNA, Microbe, PCR, primers, Next-Generation, high-throughput, sequencing
Unraveling the Unseen Players in the Ocean - A Field Guide to Water Chemistry and Marine Microbiology
Institutions: San Diego State University, University of California San Diego.
Here we introduce a series of thoroughly tested and well standardized research protocols adapted for use in remote marine environments. The sampling protocols include the assessment of resources available to the microbial community (dissolved organic carbon, particulate organic matter, inorganic nutrients), and a comprehensive description of the viral and bacterial communities (via direct viral and microbial counts, enumeration of autofluorescent microbes, and construction of viral and microbial metagenomes). We use a combination of methods, which represent a dispersed field of scientific disciplines comprising already established protocols and some of the most recent techniques developed. Especially metagenomic sequencing techniques used for viral and bacterial community characterization, have been established only in recent years, and are thus still subjected to constant improvement. This has led to a variety of sampling and sample processing procedures currently in use. The set of methods presented here provides an up to date approach to collect and process environmental samples. Parameters addressed with these protocols yield the minimum on information essential to characterize and understand the underlying mechanisms of viral and microbial community dynamics. It gives easy to follow guidelines to conduct comprehensive surveys and discusses critical steps and potential caveats pertinent to each technique.
Environmental Sciences, Issue 93, dissolved organic carbon, particulate organic matter, nutrients, DAPI, SYBR, microbial metagenomics, viral metagenomics, marine environment
Polysome Fractionation and Analysis of Mammalian Translatomes on a Genome-wide Scale
Institutions: McGill University, Karolinska Institutet, McGill University.
mRNA translation plays a central role in the regulation of gene expression and represents the most energy consuming process in mammalian cells. Accordingly, dysregulation of mRNA translation is considered to play a major role in a variety of pathological states including cancer. Ribosomes also host chaperones, which facilitate folding of nascent polypeptides, thereby modulating function and stability of newly synthesized polypeptides. In addition, emerging data indicate that ribosomes serve as a platform for a repertoire of signaling molecules, which are implicated in a variety of post-translational modifications of newly synthesized polypeptides as they emerge from the ribosome, and/or components of translational machinery. Herein, a well-established method of ribosome fractionation using sucrose density gradient centrifugation is described. In conjunction with the in-house developed “anota” algorithm this method allows direct determination of differential translation of individual mRNAs on a genome-wide scale. Moreover, this versatile protocol can be used for a variety of biochemical studies aiming to dissect the function of ribosome-associated protein complexes, including those that play a central role in folding and degradation of newly synthesized polypeptides.
Biochemistry, Issue 87, Cells, Eukaryota, Nutritional and Metabolic Diseases, Neoplasms, Metabolic Phenomena, Cell Physiological Phenomena, mRNA translation, ribosomes,
protein synthesis, genome-wide analysis, translatome, mTOR, eIF4E, 4E-BP1
An Affordable HIV-1 Drug Resistance Monitoring Method for Resource Limited Settings
Institutions: University of KwaZulu-Natal, Durban, South Africa, Jembi Health Systems, University of Amsterdam, Stanford Medical School.
HIV-1 drug resistance has the potential to seriously compromise the effectiveness and impact of antiretroviral therapy (ART). As ART programs in sub-Saharan Africa continue to expand, individuals on ART should be closely monitored for the emergence of drug resistance. Surveillance of transmitted drug resistance to track transmission of viral strains already resistant to ART is also critical. Unfortunately, drug resistance testing is still not readily accessible in resource limited settings, because genotyping is expensive and requires sophisticated laboratory and data management infrastructure. An open access genotypic drug resistance monitoring method to manage individuals and assess transmitted drug resistance is described. The method uses free open source software for the interpretation of drug resistance patterns and the generation of individual patient reports. The genotyping protocol has an amplification rate of greater than 95% for plasma samples with a viral load >1,000 HIV-1 RNA copies/ml. The sensitivity decreases significantly for viral loads <1,000 HIV-1 RNA copies/ml. The method described here was validated against a method of HIV-1 drug resistance testing approved by the United States Food and Drug Administration (FDA), the Viroseq genotyping method. Limitations of the method described here include the fact that it is not automated and that it also failed to amplify the circulating recombinant form CRF02_AG from a validation panel of samples, although it amplified subtypes A and B from the same panel.
Medicine, Issue 85, Biomedical Technology, HIV-1, HIV Infections, Viremia, Nucleic Acids, genetics, antiretroviral therapy, drug resistance, genotyping, affordable
Genotypic Inference of HIV-1 Tropism Using Population-based Sequencing of V3
Institutions: BC Centre for Excellence in HIV/AIDS.
Background: Prior to receiving a drug from CCR5-antagonist class in HIV therapy, a patient must undergo an HIV tropism test to confirm that his or her viral population uses the CCR5 coreceptor for cellular entry, and not an alternative coreceptor. One approach to tropism testing is to examine the sequence of the V3 region of the HIV envelope, which interacts with the coreceptor.
Methods: Viral RNA is extracted from blood plasma. The V3 region is amplified in triplicate with nested reverse transcriptase-PCR. The amplifications are then sequenced and analyzed using the software, RE_Call. Sequences are then submitted to a bioinformatic algorithm such as geno2pheno to infer viral tropism from the V3 region. Sequences are inferred to be non-R5 if their geno2pheno false positive rate falls below 5.75%. If any one of the three sequences from a sample is inferred to be non-R5, the patient is unlikely to respond to a CCR5-antagonist.
Immunology, Issue 46, HIV, tropism, coreceptor, V3, genotyping, sequencing, CCR5, CXCR4, maraviroc
Metabolic Labeling of Newly Transcribed RNA for High Resolution Gene Expression Profiling of RNA Synthesis, Processing and Decay in Cell Culture
Institutions: Max von Pettenkofer Institute, University of Cambridge, Ludwig-Maximilians-University Munich.
The development of whole-transcriptome microarrays and next-generation sequencing has revolutionized our understanding of the complexity of cellular gene expression. Along with a better understanding of the involved molecular mechanisms, precise measurements of the underlying kinetics have become increasingly important. Here, these powerful methodologies face major limitations due to intrinsic properties of the template samples they study, i.e.
total cellular RNA. In many cases changes in total cellular RNA occur either too slowly or too quickly to represent the underlying molecular events and their kinetics with sufficient resolution. In addition, the contribution of alterations in RNA synthesis, processing, and decay are not readily differentiated.
We recently developed high-resolution gene expression profiling to overcome these limitations. Our approach is based on metabolic labeling of newly transcribed RNA with 4-thiouridine (thus also referred to as 4sU-tagging) followed by rigorous purification of newly transcribed RNA using thiol-specific biotinylation and streptavidin-coated magnetic beads. It is applicable to a broad range of organisms including vertebrates, Drosophila
, and yeast. We successfully applied 4sU-tagging to study real-time kinetics of transcription factor activities, provide precise measurements of RNA half-lives, and obtain novel insights into the kinetics of RNA processing. Finally, computational modeling can be employed to generate an integrated, comprehensive analysis of the underlying molecular mechanisms.
Genetics, Issue 78, Cellular Biology, Molecular Biology, Microbiology, Biochemistry, Eukaryota, Investigative Techniques, Biological Phenomena, Gene expression profiling, RNA synthesis, RNA processing, RNA decay, 4-thiouridine, 4sU-tagging, microarray analysis, RNA-seq, RNA, DNA, PCR, sequencing
Amplifying and Quantifying HIV-1 RNA in HIV Infected Individuals with Viral Loads Below the Limit of Detection by Standard Clinical Assays
Institutions: NCI-Frederick, University of Pittsburgh, Tuffts University.
Amplifying viral genes and quantifying HIV-1 RNA in HIV-1 infected individuals with viral loads below the limit of detection by standard assays (below 50-75 copies/ml) is necessary to gain insight to viral dynamics and virus host interactions in patients who naturally control the infection and those who are on combination antiretroviral therapy (cART).
Here we describe how to amplify viral genomes by single genome sequencing (the SGS protocol) 13, 19
and how to accurately quantify HIV-1 RNA in patients with low viral loads (the single-copy assay (SCA) protocol) 12, 20
The single-copy assay is a real-time PCR assay with sensitivity depending on the volume of plasma being assayed. If a single virus genome is detected in 7 ml of plasma, then the RNA copy number is reported to be 0.3 copies/ml. The assay has an internal control testing for the efficiency of RNA extraction, and controls for possible amplification from DNA or contamination. Patient samples are measured in triplicate.
The single-genome sequencing assay (SGS), now widely used and considered to be non-labor intensive 3, 7, 12, 14, 15
,is a limiting dilution assay, in which endpoint diluted cDNA product is spread over a 96-well plate. According to a Poisson distribution, when less than 1/3 of the wells give product, there is an 80% chance of the PCR product being resultant of amplification from a single cDNA molecule. SGS has the advantage over cloning of not being subjected to resampling and not being biased by PCR-introduced recombination 19
. However, the amplification success of SCA and SGS depend on primer design. Both assays were developed for HIV-1 subtype B, but can be adapted for other subtypes and other regions of the genome by changing primers, probes, and standards.
Immunology, Issue 55, single genome sequencing, SGS, real-time PCR, single-copy assay, SCA, HIV-1, ultra-sensitive, RNA extraction
Identifying DNA Mutations in Purified Hematopoietic Stem/Progenitor Cells
Institutions: UT Health Science Center at San Antonio, UT Health Science Center at San Antonio, UT Health Science Center at San Antonio, UT Health Science Center at San Antonio, UT Health Science Center at San Antonio.
In recent years, it has become apparent that genomic instability is tightly related to many developmental disorders, cancers, and aging. Given that stem cells are responsible for ensuring tissue homeostasis and repair throughout life, it is reasonable to hypothesize that the stem cell population is critical for preserving genomic integrity of tissues. Therefore, significant interest has arisen in assessing the impact of endogenous and environmental factors on genomic integrity in stem cells and their progeny, aiming to understand the etiology of stem-cell based diseases.
transgenic mice carry a recoverable λ phage vector encoding the LacI
reporter system, in which the LacI
gene serves as the mutation reporter. The result of a mutated LacI
gene is the production of β-galactosidase that cleaves a chromogenic substrate, turning it blue. The LacI
reporter system is carried in all cells, including stem/progenitor cells and can easily be recovered and used to subsequently infect E. coli
. After incubating infected E. coli
on agarose that contains the correct substrate, plaques can be scored; blue plaques indicate a mutant LacI
gene, while clear plaques harbor wild-type. The frequency of blue (among clear) plaques indicates the mutant frequency in the original cell population the DNA was extracted from. Sequencing the mutant LacI
gene will show the location of the mutations in the gene and the type of mutation.
transgenic mouse model is well-established as an in vivo
mutagenesis assay. Moreover, the mice and the reagents for the assay are commercially available. Here we describe in detail how this model can be adapted to measure the frequency of spontaneously occurring DNA mutants in stem cell-enriched Lin-
(LSK) cells and other subpopulations of the hematopoietic system.
Infection, Issue 84, In vivo mutagenesis, hematopoietic stem/progenitor cells, LacI mouse model, DNA mutations, E. coli
Competitive Genomic Screens of Barcoded Yeast Libraries
Institutions: University of Toronto, University of Toronto, University of Toronto, National Human Genome Research Institute, NIH, Stanford University , University of Toronto.
By virtue of advances in next generation sequencing technologies, we have access to new genome sequences almost daily. The tempo of these advances is accelerating, promising greater depth and breadth. In light of these extraordinary advances, the need for fast, parallel methods to define gene function becomes ever more important. Collections of genome-wide deletion mutants in yeasts and E. coli
have served as workhorses for functional characterization of gene function, but this approach is not scalable, current gene-deletion approaches require each of the thousands of genes that comprise a genome to be deleted and verified. Only after this work is complete can we pursue high-throughput phenotyping. Over the past decade, our laboratory has refined a portfolio of competitive, miniaturized, high-throughput genome-wide assays that can be performed in parallel. This parallelization is possible because of the inclusion of DNA 'tags', or 'barcodes,' into each mutant, with the barcode serving as a proxy for the mutation and one can measure the barcode abundance to assess mutant fitness. In this study, we seek to fill the gap between DNA sequence and barcoded mutant collections. To accomplish this we introduce a combined transposon disruption-barcoding approach that opens up parallel barcode assays to newly sequenced, but poorly characterized microbes. To illustrate this approach we present a new Candida albicans
barcoded disruption collection and describe how both microarray-based and next generation sequencing-based platforms can be used to collect 10,000 - 1,000,000 gene-gene and drug-gene interactions in a single experiment.
Biochemistry, Issue 54, chemical biology, chemogenomics, chemical probes, barcode microarray, next generation sequencing
Methylated DNA Immunoprecipitation
Institutions: BC Cancer Research Centre, University of British Columbia - UBC, These authors contributed equally., University of British Columbia - UBC, BC Cancer Agency, University of British Columbia - UBC.
The identification of DNA methylation patterns is a common procedure in the study of epigenetics, as methylation is known to have significant effects on gene expression, and is involved with normal development as well as disease 1-4
. Thus, the ability to discriminate between methylated DNA and non-methylated DNA is essential for generating methylation profiles for such studies. Methylated DNA immunoprecipitation (MeDIP) is an efficient technique for the extraction of methylated DNA from a sample of interest 5-7
. A sample of as little as 200 ng of DNA is sufficient for the antibody, or immunoprecipitation (IP), reaction. DNA is sonicated into fragments ranging in size from 300-1000 bp, and is divided into immunoprecipitated (IP) and input (IN) portions. IP DNA is subsequently heat denatured and then incubated with anti-5'mC, allowing the monoclonal antibody to bind methylated DNA. After this, magnetic beads containing a secondary antibody with affinity for the primary antibody are added, and incubated. These bead-linked antibodies will bind the monoclonal antibody used in the first step. DNA bound to the antibody complex (methylated DNA) is separated from the rest of the DNA by using a magnet to pull the complexes out of solution. Several washes using IP buffer are then performed to remove the unbound, non-methylated DNA. The methylated DNA/antibody complexes are then digested with Proteinase K to digest the antibodies leaving only the methylated DNA intact. The enriched DNA is purified by phenol:chloroform extraction to remove the protein matter and then precipitated and resuspended in water for later use. PCR techniques can be used to validate the efficiency of the MeDIP procedure by analyzing the amplification products of IP and IN DNA for regions known to lack and known to contain methylated sequences. The purified methylated DNA can then be used for locus-specific (PCR) or genome-wide (microarray and sequencing) methylation studies, and is particularly useful when applied in conjunction with other research tools such as gene expression profiling and array comparative genome hybridization (CGH) 8
. Further investigation into DNA methylation will lead to the discovery of new epigenetic targets, which in turn, may be useful in developing new therapeutic or prognostic research tools for diseases such as cancer that are characterized by aberrantly methylated DNA 2, 4, 9-11
Cell Biology, Issue 23, DNA methylation, immunoprecipitation, epigenomics, epigenetics, methylcytosine, MeDIP protocol, 5-methylcytosine antibody, anti-5-methylcytosine, microarray
Single Oocyte Bisulfite Mutagenesis
Institutions: Schulich School of Medicine and Dentistry, University of Western Ontario, Schulich School of Medicine and Dentistry, University of Western Ontario, Children's Health Research Institute.
Epigenetics encompasses all heritable and reversible modifications to chromatin that alter gene accessibility, and thus are the primary mechanisms for regulating gene transcription1
. DNA methylation is an epigenetic modification that acts predominantly as a repressive mark. Through the covalent addition of a methyl group onto cytosines in CpG dinucleotides, it can recruit additional repressive proteins and histone modifications to initiate processes involved in condensing chromatin and silencing genes2
. DNA methylation is essential for normal development as it plays a critical role in developmental programming, cell differentiation, repression of retroviral elements, X-chromosome inactivation and genomic imprinting.
One of the most powerful methods for DNA methylation analysis is bisulfite mutagenesis. Sodium bisulfite is a DNA mutagen that deaminates cytosines into uracils. Following PCR amplification and sequencing, these conversion events are detected as thymines. Methylated cytosines are protected from deamination and thus remain as cytosines, enabling identification of DNA methylation at the individual nucleotide level3
. Development of the bisulfite mutagenesis assay has advanced from those originally reported4-6
towards ones that are more sensitive and reproducible7
. One key advancement was embedding smaller amounts of DNA in an agarose bead, thereby protecting DNA from the harsh bisulfite treatment8
. This enabled methylation analysis to be performed on pools of oocytes and blastocyst-stage embryos9
. The most sophisticated bisulfite mutagenesis protocol to date is for individual blastocyst-stage embryos10
. However, since blastocysts have on average 64 cells (containing 120-720 pg of genomic DNA), this method is not efficacious for methylation studies on individual oocytes or cleavage-stage embryos.
Taking clues from agarose embedding of minute DNA amounts including oocytes11
, here we present a method whereby oocytes are directly embedded in an agarose and lysis solution bead immediately following retrieval and removal of the zona pellucida from the oocyte. This enables us to bypass the two main challenges of single oocyte bisulfite mutagenesis: protecting a minute amount of DNA from degradation, and subsequent loss during the numerous protocol steps. Importantly, as data are obtained from single oocytes, the issue of PCR bias within pools is eliminated. Furthermore, inadvertent cumulus cell contamination is detectable by this method since any sample with more than one methylation pattern may be excluded from analysis12
. This protocol provides an improved method for successful and reproducible analyses of DNA methylation at the single-cell level and is ideally suited for individual oocytes as well as cleavage-stage embryos.
Genetics, Issue 64, Developmental Biology, Biochemistry, Bisulfite mutagenesis, DNA methylation, individual oocyte, individual embryo, mouse model, PCR, epigenetics
Determination of DNA Methylation of Imprinted Genes in Arabidopsis Endosperm
Institutions: Saint Louis University.
is an excellent model organism for studying epigenetic mechanisms. One of the reasons is the loss-of-function null mutant of DNA methyltransferases is viable, thus providing a system to study how loss of DNA methylation in a genome affects growth and development. Imprinting refers to differential expression of maternal and paternal alleles and plays an important role in reproduction development in both mammal and plants. DNA methylation is critical for determining whether the maternal or paternal alleles of an imprinted gene is expressed or silenced. In flowering plants, there is a double fertilization event in reproduction: one sperm cell fertilizes the egg cell to form embryo and a second sperm fuses with the central cell to give rise to endosperm. Endosperm is the tissue where imprinting occurs in plants. MEDEA
, a SET domain Polycomb group gene, and FWA
, a transcription factor regulating flowering, are the first two genes shown to be imprinted in endosperm and their expression is controlled by DNA methylation and demethylation in plants. In order to determine imprinting status of a gene and methylation pattern in endosperm, we need to be able to isolate endosperm first. Since seed is tiny in Arabidopsis
, it remains challenging to isolate Arabidopsis
endosperm and examine its methylation. In this video protocol, we report how to conduct a genetic cross, to isolate endosperm tissue from seeds, and to determine the methylation status by bisulfite sequencing.
Plant Biology, Issue 47, DNA methylation, imprinting, bisulfite sequencing, endosperm, Arabidopsis
Optimized Analysis of DNA Methylation and Gene Expression from Small, Anatomically-defined Areas of the Brain
Institutions: Max Planck Institute of Psychiatry.
Exposure to diet, drugs and early life adversity during sensitive windows of life 1,2
can lead to lasting changes in gene expression that contribute to the display of physiological and behavioural phenotypes. Such environmental programming is likely to increase the susceptibility to metabolic, cardiovascular and mental diseases 3,4
DNA methylation and histone modifications are considered key processes in the mediation of the gene-environment dialogue and appear also to underlay environmental programming 5
. In mammals, DNA methylation typically comprises the covalent addition of a methyl group at the 5-position of cytosine within the context of CpG dinucleotides.
CpG methylation occurs in a highly tissue- and cell-specific manner making it a challenge to study discrete, small regions of the brain where cellular heterogeneity is high and tissue quantity limited. Moreover, because gene expression and methylation are closely linked events, increased value can be gained by comparing both parameters in the same sample.
Here, a step-by-step protocol (Figure 1
) for the investigation of epigenetic programming in the brain is presented using the 'maternal separation' paradigm of early life adversity for illustrative purposes. The protocol describes the preparation of micropunches from differentially-aged mouse brains from which DNA and RNA can be simultaneously isolated, thus allowing DNA methylation and gene expression analyses in the same sample.
Neuroscience, Issue 65, Genetics, Physiology, Epigenetics, DNA methylation, early-life stress, maternal separation, bisulfite sequencing
Efficient and Rapid Isolation of Early-stage Embryos from Arabidopsis thaliana Seeds
Institutions: University of Zürich.
In flowering plants, the embryo develops within a nourishing tissue - the endosperm - surrounded by the maternal seed integuments (or seed coat). As a consequence, the isolation of plant embryos at early stages (1 cell to globular stage) is technically challenging due to their relative inaccessibility. Efficient manual dissection at early stages is strongly impaired by the small size of young Arabidopsis
seeds and the adhesiveness of the embryo to the surrounding tissues. Here, we describe a method that allows the efficient isolation of young Arabidopsis
embryos, yielding up to 40 embryos in 1 hr to 4 hr, depending on the downstream application. Embryos are released into isolation buffer by slightly crushing 250-750 seeds with a plastic pestle in an Eppendorf tube. A glass microcapillary attached to either a standard laboratory pipette (via a rubber tube) or a hydraulically controlled microinjector is used to collect embryos from droplets placed on a multi-well slide on an inverted light microscope. The technical skills required are simple and easily transferable, and the basic setup does not require costly equipment. Collected embryos are suitable for a variety of downstream applications such as RT-PCR, RNA sequencing, DNA methylation analyses, fluorescence in situ
hybridization (FISH), immunostaining, and reporter gene assays.
Plant Biology, Issue 76, Cellular Biology, Developmental Biology, Molecular Biology, Genetics, Embryology, Embryo isolation, Arabidopsis thaliana, RNA amplification, transcriptomics, DNA methylation profiling, FISH, reporter assays
Detecting Somatic Genetic Alterations in Tumor Specimens by Exon Capture and Massively Parallel Sequencing
Institutions: Memorial Sloan-Kettering Cancer Center, Memorial Sloan-Kettering Cancer Center.
Efforts to detect and investigate key oncogenic mutations have proven valuable to facilitate the appropriate treatment for cancer patients. The establishment of high-throughput, massively parallel "next-generation" sequencing has aided the discovery of many such mutations. To enhance the clinical and translational utility of this technology, platforms must be high-throughput, cost-effective, and compatible with formalin-fixed paraffin embedded (FFPE) tissue samples that may yield small amounts of degraded or damaged DNA. Here, we describe the preparation of barcoded and multiplexed DNA libraries followed by hybridization-based capture of targeted exons for the detection of cancer-associated mutations in fresh frozen and FFPE tumors by massively parallel sequencing. This method enables the identification of sequence mutations, copy number alterations, and select structural rearrangements involving all targeted genes. Targeted exon sequencing offers the benefits of high throughput, low cost, and deep sequence coverage, thus conferring high sensitivity for detecting low frequency mutations.
Molecular Biology, Issue 80, Molecular Diagnostic Techniques, High-Throughput Nucleotide Sequencing, Genetics, Neoplasms, Diagnosis, Massively parallel sequencing, targeted exon sequencing, hybridization capture, cancer, FFPE, DNA mutations
RNA-seq Analysis of Transcriptomes in Thrombin-treated and Control Human Pulmonary Microvascular Endothelial Cells
Institutions: Children's Mercy Hospital and Clinics, School of Medicine, University of Missouri-Kansas City.
The characterization of gene expression in cells via measurement of mRNA levels is a useful tool in determining how the transcriptional machinery of the cell is affected by external signals (e.g.
drug treatment), or how cells differ between a healthy state and a diseased state. With the advent and continuous refinement of next-generation DNA sequencing technology, RNA-sequencing (RNA-seq) has become an increasingly popular method of transcriptome analysis to catalog all species of transcripts, to determine the transcriptional structure of all expressed genes and to quantify the changing expression levels of the total set of transcripts in a given cell, tissue or organism1,2
. RNA-seq is gradually replacing DNA microarrays as a preferred method for transcriptome analysis because it has the advantages of profiling a complete transcriptome, providing a digital type datum (copy number of any transcript) and not relying on any known genomic sequence3
Here, we present a complete and detailed protocol to apply RNA-seq to profile transcriptomes in human pulmonary microvascular endothelial cells with or without thrombin treatment. This protocol is based on our recent published study entitled "RNA-seq Reveals Novel Transcriptome of Genes and Their Isoforms in Human Pulmonary Microvascular Endothelial Cells Treated with Thrombin,"4
in which we successfully performed the first complete transcriptome analysis of human pulmonary microvascular endothelial cells treated with thrombin using RNA-seq. It yielded unprecedented resources for further experimentation to gain insights into molecular mechanisms underlying thrombin-mediated endothelial dysfunction in the pathogenesis of inflammatory conditions, cancer, diabetes, and coronary heart disease, and provides potential new leads for therapeutic targets to those diseases.
The descriptive text of this protocol is divided into four parts. The first part describes the treatment of human pulmonary microvascular endothelial cells with thrombin and RNA isolation, quality analysis and quantification. The second part describes library construction and sequencing. The third part describes the data analysis. The fourth part describes an RT-PCR validation assay. Representative results of several key steps are displayed. Useful tips or precautions to boost success in key steps are provided in the Discussion section. Although this protocol uses human pulmonary microvascular endothelial cells treated with thrombin, it can be generalized to profile transcriptomes in both mammalian and non-mammalian cells and in tissues treated with different stimuli or inhibitors, or to compare transcriptomes in cells or tissues between a healthy state and a disease state.
Genetics, Issue 72, Molecular Biology, Immunology, Medicine, Genomics, Proteins, RNA-seq, Next Generation DNA Sequencing, Transcriptome, Transcription, Thrombin, Endothelial cells, high-throughput, DNA, genomic DNA, RT-PCR, PCR
Identification of Key Factors Regulating Self-renewal and Differentiation in EML Hematopoietic Precursor Cells by RNA-sequencing Analysis
Institutions: The University of Texas Graduate School of Biomedical Sciences at Houston.
Hematopoietic stem cells (HSCs) are used clinically for transplantation treatment to rebuild a patient's hematopoietic system in many diseases such as leukemia and lymphoma. Elucidating the mechanisms controlling HSCs self-renewal and differentiation is important for application of HSCs for research and clinical uses. However, it is not possible to obtain large quantity of HSCs due to their inability to proliferate in vitro
. To overcome this hurdle, we used a mouse bone marrow derived cell line, the EML (Erythroid, Myeloid, and Lymphocytic) cell line, as a model system for this study.
RNA-sequencing (RNA-Seq) has been increasingly used to replace microarray for gene expression studies. We report here a detailed method of using RNA-Seq technology to investigate the potential key factors in regulation of EML cell self-renewal and differentiation. The protocol provided in this paper is divided into three parts. The first part explains how to culture EML cells and separate Lin-CD34+ and Lin-CD34- cells. The second part of the protocol offers detailed procedures for total RNA preparation and the subsequent library construction for high-throughput sequencing. The last part describes the method for RNA-Seq data analysis and explains how to use the data to identify differentially expressed transcription factors between Lin-CD34+ and Lin-CD34- cells. The most significantly differentially expressed transcription factors were identified to be the potential key regulators controlling EML cell self-renewal and differentiation. In the discussion section of this paper, we highlight the key steps for successful performance of this experiment.
In summary, this paper offers a method of using RNA-Seq technology to identify potential regulators of self-renewal and differentiation in EML cells. The key factors identified are subjected to downstream functional analysis in vitro
and in vivo
Genetics, Issue 93, EML Cells, Self-renewal, Differentiation, Hematopoietic precursor cell, RNA-Sequencing, Data analysis
Chromatin Interaction Analysis with Paired-End Tag Sequencing (ChIA-PET) for Mapping Chromatin Interactions and Understanding Transcription Regulation
Institutions: Agency for Science, Technology and Research, Singapore, A*STAR-Duke-NUS Neuroscience Research Partnership, Singapore, National University of Singapore, Singapore.
Genomes are organized into three-dimensional structures, adopting higher-order conformations inside the micron-sized nuclear spaces 7, 2, 12
. Such architectures are not random and involve interactions between gene promoters and regulatory elements 13
. The binding of transcription factors to specific regulatory sequences brings about a network of transcription regulation and coordination 1, 14
Chromatin Interaction Analysis by Paired-End Tag Sequencing (ChIA-PET) was developed to identify these higher-order chromatin structures 5,6
. Cells are fixed and interacting loci are captured by covalent DNA-protein cross-links. To minimize non-specific noise and reduce complexity, as well as to increase the specificity of the chromatin interaction analysis, chromatin immunoprecipitation (ChIP) is used against specific protein factors to enrich chromatin fragments of interest before proximity ligation. Ligation involving half-linkers subsequently forms covalent links between pairs of DNA fragments tethered together within individual chromatin complexes. The flanking MmeI restriction enzyme sites in the half-linkers allow extraction of paired end tag-linker-tag constructs (PETs) upon MmeI digestion. As the half-linkers are biotinylated, these PET constructs are purified using streptavidin-magnetic beads. The purified PETs are ligated with next-generation sequencing adaptors and a catalog of interacting fragments is generated via next-generation sequencers such as the Illumina Genome Analyzer. Mapping and bioinformatics analysis is then performed to identify ChIP-enriched binding sites and ChIP-enriched chromatin interactions 8
We have produced a video to demonstrate critical aspects of the ChIA-PET protocol, especially the preparation of ChIP as the quality of ChIP plays a major role in the outcome of a ChIA-PET library. As the protocols are very long, only the critical steps are shown in the video.
Genetics, Issue 62, ChIP, ChIA-PET, Chromatin Interactions, Genomics, Next-Generation Sequencing
gDNA Enrichment by a Transposase-based Technology for NGS Analysis of the Whole Sequence of BRCA1, BRCA2, and 9 Genes Involved in DNA Damage Repair
Institutions: Centre Georges-François Leclerc.
The widespread use of Next Generation Sequencing has opened up new avenues for cancer research and diagnosis. NGS will bring huge amounts of new data on cancer, and especially cancer genetics. Current knowledge and future discoveries will make it necessary to study a huge number of genes that could be involved in a genetic predisposition to cancer. In this regard, we developed a Nextera design to study 11 complete genes involved in DNA damage repair. This protocol was developed to safely study 11 genes (ATM
, and TP53
) from promoter to 3'-UTR in 24 patients simultaneously. This protocol, based on transposase technology and gDNA enrichment, gives a great advantage in terms of time for the genetic diagnosis thanks to sample multiplexing. This protocol can be safely used with blood gDNA.
Genetics, Issue 92, gDNA enrichment, Nextera, NGS, DNA damage, BRCA1, BRCA2
A Strategy to Identify de Novo Mutations in Common Disorders such as Autism and Schizophrenia
Institutions: Universite de Montreal, Universite de Montreal, Universite de Montreal.
There are several lines of evidence supporting the role of de novo
mutations as a mechanism for common disorders, such as autism and schizophrenia. First, the de novo
mutation rate in humans is relatively high, so new mutations are generated at a high frequency in the population. However, de novo
mutations have not been reported in most common diseases. Mutations in genes leading to severe diseases where there is a strong negative selection against the phenotype, such as lethality in embryonic stages or reduced reproductive fitness, will not be transmitted to multiple family members, and therefore will not be detected by linkage gene mapping or association studies. The observation of very high concordance in monozygotic twins and very low concordance in dizygotic twins also strongly supports the hypothesis that a significant fraction of cases may result from new mutations. Such is the case for diseases such as autism and schizophrenia. Second, despite reduced reproductive fitness1
and extremely variable environmental factors, the incidence of some diseases is maintained worldwide at a relatively high and constant rate. This is the case for autism and schizophrenia, with an incidence of approximately 1% worldwide. Mutational load can be thought of as a balance between selection for or against a deleterious mutation and its production by de novo
mutation. Lower rates of reproduction constitute a negative selection factor that should reduce the number of mutant alleles in the population, ultimately leading to decreased disease prevalence. These selective pressures tend to be of different intensity in different environments. Nonetheless, these severe mental disorders have been maintained at a constant relatively high prevalence in the worldwide population across a wide range of cultures and countries despite a strong negative selection against them2
. This is not what one would predict in diseases with reduced reproductive fitness, unless there was a high new mutation rate. Finally, the effects of paternal age: there is a significantly increased risk of the disease with increasing paternal age, which could result from the age related increase in paternal de novo
mutations. This is the case for autism and schizophrenia3
. The male-to-female ratio of mutation rate is estimated at about 4–6:1, presumably due to a higher number of germ-cell divisions with age in males. Therefore, one would predict that de novo
mutations would more frequently come from males, particularly older males4
. A high rate of new mutations may in part explain why genetic studies have so far failed to identify many genes predisposing to complexes diseases genes, such as autism and schizophrenia, and why diseases have been identified for a mere 3% of genes in the human genome. Identification for de novo
mutations as a cause of a disease requires a targeted molecular approach, which includes studying parents and affected subjects. The process for determining if the genetic basis of a disease may result in part from de novo
mutations and the molecular approach to establish this link will be illustrated, using autism and schizophrenia as examples.
Medicine, Issue 52, de novo mutation, complex diseases, schizophrenia, autism, rare variations, DNA sequencing
High Sensitivity 5-hydroxymethylcytosine Detection in Balb/C Brain Tissue
Institutions: New England Biolabs.
DNA hydroxymethylation is a long known modification of DNA, but has recently become a focus in epigenetic research. Mammalian DNA is enzymatically modified at the 5th
carbon position of cytosine (C) residues to 5-mC, predominately in the context of CpG dinucleotides. 5-mC is amenable to enzymatic oxidation to 5-hmC by the Tet family of enzymes, which are believed to be involved in development and disease. Currently, the biological role of 5-hmC is not fully understood, but is generating a lot of interest due to its potential as a biomarker. This is due to several groundbreaking studies identifying 5-hydroxymethylcytosine in mouse embryonic stem (ES) and neuronal cells. Research techniques, including bisulfite sequencing methods, are unable to easily distinguish between 5-mC and 5-hmC . A few protocols exist that can measure global amounts of 5-hydroxymethylcytosine in the genome, including liquid chromatography coupled with mass spectrometry analysis or thin layer chromatography of single nucleosides digested from genomic DNA. Antibodies that target 5-hydroxymethylcytosine also exist, which can be used for dot blot analysis, immunofluorescence, or precipitation of hydroxymethylated DNA, but these antibodies do not have single base resolution.In addition, resolution depends on the size of the immunoprecipitated DNA and for microarray experiments, depends on probe design. Since it is unknown exactly where 5-hydroxymethylcytosine exists in the genome or its role in epigenetic regulation, new techniques are required that can identify locus specific hydroxymethylation. The EpiMark 5-hmC and 5-mC Analysis Kit provides a solution for distinguishing between these two modifications at specific loci. The EpiMark 5-hmC and 5-mC Analysis Kit is a simple and robust method for the identification and quantitation of 5-methylcytosine and 5-hydroxymethylcytosine within a specific DNA locus. This enzymatic approach utilizes the differential methylation sensitivity of the isoschizomers MspI and HpaII in a simple 3-step protocol. Genomic DNA of interest is treated with T4-BGT, adding a glucose moeity to 5-hydroxymethylcytosine. This reaction is sequence-independent, therefore all 5-hmC will be glucosylated; unmodified or 5-mC containing DNA will not be affected. This glucosylation is then followed by restriction endonuclease digestion. MspI and HpaII recognize the same sequence (CCGG) but are sensitive to different methylation states. HpaII cleaves only a completely unmodified site: any modification (5-mC, 5-hmC or 5-ghmC) at either cytosine blocks cleavage. MspI recognizes and cleaves 5-mC and 5-hmC, but not 5-ghmC. The third part of the protocol is interrogation of the locus by PCR. As little as 20 ng of input DNA can be used. Amplification of the experimental (glucosylated and digested) and control (mock glucosylated and digested) target DNA with primers flanking a CCGG site of interest (100-200 bp) is performed. If the CpG site contains 5-hydroxymethylcytosine, a band is detected after glucosylation and digestion, but not in the non-glucosylated control reaction. Real time PCR will give an approximation of how much hydroxymethylcytosine is in this particular site. In this experiment, we will analyze the 5-hydroxymethylcytosine amount in a mouse Babl/C brain sample by end point PCR.
Neuroscience, Issue 48, EpiMark, Epigenetics, 5-hydroxymethylcytosine, 5-methylcytosine, methylation, hydroxymethylation