As DNA sequencing technology has markedly advanced in recent years2, it has become increasingly evident that the amount of genetic variation between any two individuals is greater than previously thought3. In contrast, array-based genotyping has failed to identify a significant contribution of common sequence variants to the phenotypic variability of common disease4,5. Taken together, these observations have led to the evolution of the Common Disease / Rare Variant hypothesis suggesting that the majority of the "missing heritability" in common and complex phenotypes is instead due to an individual's personal profile of rare or private DNA variants6-8. However, characterizing how rare variation impacts complex phenotypes requires the analysis of many affected individuals at many genomic loci, and is ideally compared to a similar survey in an unaffected cohort. Despite the sequencing power offered by today's platforms, a population-based survey of many genomic loci and the subsequent computational analysis required remains prohibitive for many investigators.
To address this need, we have developed a pooled sequencing approach1,9 and a novel software package1 for highly accurate rare variant detection from the resulting data. The ability to pool genomes from entire populations of affected individuals and survey the degree of genetic variation at multiple targeted regions in a single sequencing library provides excellent cost and time savings to traditional single-sample sequencing methodology. With a mean sequencing coverage per allele of 25-fold, our custom algorithm, SPLINTER, uses an internal variant calling control strategy to call insertions, deletions and substitutions up to four base pairs in length with high sensitivity and specificity from pools of up to 1 mutant allele in 500 individuals. Here we describe the method for preparing the pooled sequencing library followed by step-by-step instructions on how to use the SPLINTER package for pooled sequencing analysis (https://www.ibridgenetwork.org/wustl/splinter). We show a comparison between pooled sequencing of 947 individuals, all of whom also underwent genome-wide array, at over 20kb of sequencing per person. Concordance between genotyping of tagged and novel variants called in the pooled sample were excellent. This method can be easily scaled up to any number of genomic loci and any number of individuals. By incorporating the internal positive and negative amplicon controls at ratios that mimic the population under study, the algorithm can be calibrated for optimal performance. This strategy can also be modified for use with hybridization capture or individual-specific barcodes and can be applied to the sequencing of naturally heterogeneous samples, such as tumor DNA.
21 Related JoVE Articles!
Generation of RNA/DNA Hybrids in Genomic DNA by Transformation using RNA-containing Oligonucleotides
Institutions: Georgia Institute of Technology.
Synthetic short nucleic acid polymers, oligonucleotides (oligos), are the most functional and widespread tools of molecular biology. Oligos can be produced to contain any desired DNA or RNA sequence and can be prepared to include a wide variety of base and sugar modifications. Moreover, oligos can be designed to mimic specific nucleic acid alterations and thus, can serve as important tools to investigate effects of DNA damage and mechanisms of repair. We found that Thermo Scientific Dharmacon RNA-containing oligos with a length between 50 and 80 nucleotides can be particularly suitable to study, in vivo
, functions and consequences of chromosomal RNA/DNA hybrids and of ribonucleotides embedded into DNA. RNA/DNA hybrids can readily form during DNA replication, repair and transcription, however, very little is known about the stability of RNA/DNA hybrids in cells and to which extent these hybrids can affect the genetic integrity of cells. RNA-containing oligos, therefore, represent a perfect vector to introduce ribonucleotides into chromosomal DNA and generate RNA/DNA hybrids of chosen length and base composition. Here we present the protocol for the incorporation of ribonucleotides into the genome of the eukaryotic model system yeast /Saccharomyces cerevisiae
/. Yet, our lab has utilized Thermo Scientific Dharmacon RNA-containing oligos to generate RNA/DNA hybrids at the chromosomal level in different cell systems, from bacteria to human cells.
Cellular Biology, Issue 45, RNA-containing oligonucleotides, ribonucleotides, RNA/DNA hybrids, yeast, transformation, gene targeting, genome instability, DNA repair
Identifying the Effects of BRCA1 Mutations on Homologous Recombination using Cells that Express Endogenous Wild-type BRCA1
Institutions: The Ohio State University, Tohoku University.
The functional analysis of missense mutations can be complicated by the presence in the cell of the endogenous protein. Structure-function analyses of the BRCA1 have been complicated by the lack of a robust assay for the full length BRCA1 protein and the difficulties inherent in working with cell lines that express hypomorphic BRCA1 protein1,2,3,4,5
. We developed a system whereby the endogenous BRCA1 protein in a cell was acutely depleted by RNAi targeting the 3'-UTR of the BRCA1 mRNA and replaced by co-transfecting a plasmid expressing a BRCA1 variant. One advantage of this procedure is that the acute silencing of BRCA1 and simultaneous replacement allow the cells to grow without secondary mutations or adaptations that might arise over time to compensate for the loss of BRCA1 function. This depletion and add-back procedure was done in a HeLa-derived cell line that was readily assayed for homologous recombination activity. The homologous recombination assay is based on a previously published method whereby a recombination substrate is integrated into the genome (Figure 1)6,7,8,9
. This recombination substrate has the rare-cutting I-SceI restriction enzyme site inside an inactive GFP allele, and downstream is a second inactive GFP allele. Transfection of the plasmid that expresses I-SceI results in a double-stranded break, which may be repaired by homologous recombination, and if homologous recombination does repair the break it creates an active GFP allele that is readily scored by flow cytometry for GFP protein expression. Depletion of endogenous BRCA1 resulted in an 8-10-fold reduction in homologous recombination activity, and add-back of wild-type plasmid fully restored homologous recombination function. When specific point mutants of full length BRCA1 were expressed from co-transfected plasmids, the effect of the specific missense mutant could be scored. As an example, the expression of the BRCA1(M18T) protein, a variant of unknown clinical significance10
, was expressed in these cells, it failed to restore BRCA1-dependent homologous recombination. By contrast, expression of another variant, also of unknown significance, BRCA1(I21V) fully restored BRCA1-dependent homologous recombination function. This strategy of testing the function of BRCA1 missense mutations has been applied to another biological system assaying for centrosome function (Kais et al, unpublished observations). Overall, this approach is suitable for the analysis of missense mutants in any gene that must be analyzed recessively.
Cell Biology, Issue 48, BRCA1, homologous recombination, breast cancer, RNA interference, DNA repair
Amplifying and Quantifying HIV-1 RNA in HIV Infected Individuals with Viral Loads Below the Limit of Detection by Standard Clinical Assays
Institutions: NCI-Frederick, University of Pittsburgh, Tuffts University.
Amplifying viral genes and quantifying HIV-1 RNA in HIV-1 infected individuals with viral loads below the limit of detection by standard assays (below 50-75 copies/ml) is necessary to gain insight to viral dynamics and virus host interactions in patients who naturally control the infection and those who are on combination antiretroviral therapy (cART).
Here we describe how to amplify viral genomes by single genome sequencing (the SGS protocol) 13, 19
and how to accurately quantify HIV-1 RNA in patients with low viral loads (the single-copy assay (SCA) protocol) 12, 20
The single-copy assay is a real-time PCR assay with sensitivity depending on the volume of plasma being assayed. If a single virus genome is detected in 7 ml of plasma, then the RNA copy number is reported to be 0.3 copies/ml. The assay has an internal control testing for the efficiency of RNA extraction, and controls for possible amplification from DNA or contamination. Patient samples are measured in triplicate.
The single-genome sequencing assay (SGS), now widely used and considered to be non-labor intensive 3, 7, 12, 14, 15
,is a limiting dilution assay, in which endpoint diluted cDNA product is spread over a 96-well plate. According to a Poisson distribution, when less than 1/3 of the wells give product, there is an 80% chance of the PCR product being resultant of amplification from a single cDNA molecule. SGS has the advantage over cloning of not being subjected to resampling and not being biased by PCR-introduced recombination 19
. However, the amplification success of SCA and SGS depend on primer design. Both assays were developed for HIV-1 subtype B, but can be adapted for other subtypes and other regions of the genome by changing primers, probes, and standards.
Immunology, Issue 55, single genome sequencing, SGS, real-time PCR, single-copy assay, SCA, HIV-1, ultra-sensitive, RNA extraction
Genotyping of Plant and Animal Samples without Prior DNA Purification
Institutions: Thermo Fisher Scientific.
The Direct PCR approach facilitates PCR amplification directly from small amounts of unpurified samples, and is demonstrated here for several plant and animal tissues (Figure 1
). Direct PCR is based on specially engineered Thermo Scientific Phusion and Phire DNA Polymerases, which include a double-stranded DNA binding domain that gives them unique properties such as high tolerance of inhibitors.
PCR-based target DNA detection has numerous applications in plant research, including plant genotype analysis and verification of transgenes. PCR from plant tissues traditionally involves an initial DNA isolation step, which may require expensive or toxic reagents. The process is time consuming and increases the risk of cross contamination1, 2
. Conversely, by using Thermo Scientific Phire Plant Direct PCR Kit the target DNA can be easily detected, without prior DNA extraction. In the model demonstrated here, an example of derived cleaved amplified polymorphic sequence analysis (dCAPS)3,4
is performed directly from Arabidopsis
plant leaves. dCAPS genotyping assays can be used to identify single nucleotide polymorphisms (SNPs) by SNP allele-specific restriction endonuclease digestion3
Some plant samples tend to be more challenging when using Direct PCR methods as they contain components that interfere with PCR, such as phenolic compounds. In these cases, an additional step to remove the compounds is traditionally required2,5
. Here, this problem is overcome by using a quick and easy dilution protocol followed by Direct PCR amplification (Figure 1
). Fifteen year-old oak leaves are used as a model for challenging plants as the specimen contains high amounts of phenolic compounds including tannins.
Gene transfer into mice is broadly used to study the roles of genes in development, physiology and human disease. The use of these animals requires screening for the presence of the transgene, usually with PCR. Traditionally, this involves a time consuming DNA isolation step, during which DNA for PCR analysis is purified from ear, tail or toe tissues6,7
. However, with the Thermo Scientific Phire Animal Tissue Direct PCR Kit transgenic mice can be genotyped without prior DNA purification. In this protocol transgenic mouse genotyping is achieved directly from mouse ear tissues, as demonstrated here for a challenging example where only one primer set is used for amplification of two fragments differing greatly in size.
Genetics, Issue 67, Molecular Biology, Plant Biology, Medicine, Direct PCR, DNA amplification, DNA purification, dCAPS, PCR-based target DNA detection, genotyping, Arabidopsis, oak, mouse tissues
Examination of Thymic Positive and Negative Selection by Flow Cytometry
Institutions: University of Alberta.
A healthy immune system requires that T cells respond to foreign antigens while remaining tolerant to self-antigens. Random rearrangement of the T cell receptor (TCR) α and β loci generates a T cell repertoire with vast diversity in antigen specificity, both to self and foreign. Selection of the repertoire during development in the thymus is critical for generating safe and useful T cells. Defects in thymic selection contribute to the development of autoimmune and immunodeficiency disorders1-4
T cell progenitors enter the thymus as double negative (DN) thymocytes that do not express CD4 or CD8 co-receptors. Expression of the αβTCR and both co-receptors occurs at the double positive (DP) stage. Interaction of the αβTCR with self-peptide-MHC (pMHC) presented by thymic cells determines the fate of the DP thymocyte. High affinity interactions lead to negative selection and elimination of self-reactive thymocytes. Low affinity interactions result in positive selection and development of CD4 or CD8 single positive (SP) T cells capable of recognizing foreign antigens presented by self-MHC5
Positive selection can be studied in mice with a polyclonal (wildtype) TCR repertoire by observing the generation of mature T cells. However, they are not ideal for the study of negative selection, which involves deletion of small antigen-specific populations. Many model systems have been used to study negative selection but vary in their ability to recapitulate physiological events6
. For example, in vitro
stimulation of thymocytes lacks the thymic environment that is intimately involved in selection, while administration of exogenous antigen can lead to non-specific deletion of thymocytes7-9
. Currently, the best tools for studying in vivo
negative selection are mice that express a transgenic TCR specific for endogenous self-antigen. However, many classical TCR transgenic models are characterized by premature expression of the transgenic TCRα chain at the DN stage, resulting in premature negative selection. Our lab has developed the HYcd4
model, in which the transgenic HY TCRα is conditionally expressed at the DP stage, allowing negative selection to occur during the DP to SP transition as occurs in wildtype mice10
Here, we describe a flow cytometry-based protocol to examine thymic positive and negative selection in the HYcd4
mouse model. While negative selection in HYcd4
mice is highly physiological, these methods can also be applied to other TCR transgenic models. We will also present general strategies for analyzing positive selection in a polyclonal repertoire applicable to any genetically manipulated mice.
Immunology, Issue 68, Medicine, Cellular Biology, Anatomy, Physiology, Thymus, T cell, negative selection, positive selection, autoimmunity, flow cytometry
Identifying DNA Mutations in Purified Hematopoietic Stem/Progenitor Cells
Institutions: UT Health Science Center at San Antonio, UT Health Science Center at San Antonio, UT Health Science Center at San Antonio, UT Health Science Center at San Antonio, UT Health Science Center at San Antonio.
In recent years, it has become apparent that genomic instability is tightly related to many developmental disorders, cancers, and aging. Given that stem cells are responsible for ensuring tissue homeostasis and repair throughout life, it is reasonable to hypothesize that the stem cell population is critical for preserving genomic integrity of tissues. Therefore, significant interest has arisen in assessing the impact of endogenous and environmental factors on genomic integrity in stem cells and their progeny, aiming to understand the etiology of stem-cell based diseases.
transgenic mice carry a recoverable λ phage vector encoding the LacI
reporter system, in which the LacI
gene serves as the mutation reporter. The result of a mutated LacI
gene is the production of β-galactosidase that cleaves a chromogenic substrate, turning it blue. The LacI
reporter system is carried in all cells, including stem/progenitor cells and can easily be recovered and used to subsequently infect E. coli
. After incubating infected E. coli
on agarose that contains the correct substrate, plaques can be scored; blue plaques indicate a mutant LacI
gene, while clear plaques harbor wild-type. The frequency of blue (among clear) plaques indicates the mutant frequency in the original cell population the DNA was extracted from. Sequencing the mutant LacI
gene will show the location of the mutations in the gene and the type of mutation.
transgenic mouse model is well-established as an in vivo
mutagenesis assay. Moreover, the mice and the reagents for the assay are commercially available. Here we describe in detail how this model can be adapted to measure the frequency of spontaneously occurring DNA mutants in stem cell-enriched Lin-
(LSK) cells and other subpopulations of the hematopoietic system.
Infection, Issue 84, In vivo mutagenesis, hematopoietic stem/progenitor cells, LacI mouse model, DNA mutations, E. coli
Isolation of Fidelity Variants of RNA Viruses and Characterization of Virus Mutation Frequency
Institutions: Institut Pasteur .
RNA viruses use RNA dependent RNA polymerases to replicate their genomes. The intrinsically high error rate of these enzymes is a large contributor to the generation of extreme population diversity that facilitates virus adaptation and evolution. Increasing evidence shows that the intrinsic error rates, and the resulting mutation frequencies, of RNA viruses can be modulated by subtle amino acid changes to the viral polymerase. Although biochemical assays exist for some viral RNA polymerases that permit quantitative measure of incorporation fidelity, here we describe a simple method of measuring mutation frequencies of RNA viruses that has proven to be as accurate as biochemical approaches in identifying fidelity altering mutations. The approach uses conventional virological and sequencing techniques that can be performed in most biology laboratories. Based on our experience with a number of different viruses, we have identified the key steps that must be optimized to increase the likelihood of isolating fidelity variants and generating data of statistical significance. The isolation and characterization of fidelity altering mutations can provide new insights into polymerase structure and function1-3
. Furthermore, these fidelity variants can be useful tools in characterizing mechanisms of virus adaptation and evolution4-7
Immunology, Issue 52, Polymerase fidelity, RNA virus, mutation frequency, mutagen, RNA polymerase, viral evolution
The Green Monster Process for the Generation of Yeast Strains Carrying Multiple Gene Deletions
Institutions: J. Craig Venter Institute, J. Craig Venter Institute, University of Toronto, Mt Sinai Hospital.
Phenotypes for a gene deletion are often revealed only when the mutation is tested in a particular genetic background or environmental condition1,2
. There are examples where many genes need to be deleted to unmask hidden gene functions3,4
. Despite the potential for important discoveries, genetic interactions involving three or more genes are largely unexplored. Exhaustive searches of multi-mutant interactions would be impractical due to the sheer number of possible combinations of deletions. However, studies of selected sets of genes, such as sets of paralogs with a greater a priori
chance of sharing a common function, would be informative.
In the yeast Saccharomyces cerevisiae
, gene knockout is accomplished by replacing a gene with a selectable marker via homologous recombination. Because the number of markers is limited, methods have been developed for removing and reusing the same marker5,6,7,8,9,10
. However, sequentially engineering multiple mutations using these methods is time-consuming because the time required scales linearly with the number of deletions to be generated.
Here we describe the Green Monster method for routinely engineering multiple deletions in yeast11
. In this method, a green fluorescent protein (GFP) reporter integrated into deletions is used to quantitatively label strains according to the number of deletions contained in each strain (Figure 1
). Repeated rounds of assortment of GFP-marked deletions via yeast mating and meiosis coupled with flow-cytometric enrichment of strains carrying more of these deletions lead to the accumulation of deletions in strains (Figure 2
). Performing multiple processes in parallel, with each process incorporating one or more deletions per round, reduces the time required for strain construction.
The first step is to prepare haploid single-mutants termed 'ProMonsters,' each of which carries a GFP reporter in a deleted locus and one of the 'toolkit' loci—either Green Monster GMToolkit-a
or GMToolkit-α at the can1Δ
locus (Figure 3
). Using strains from the yeast deletion collection12
, GFP-marked deletions can be conveniently generated by replacing the common KanMX4
cassette existing in these strains with a universal GFP
fragment. Each GMToolkit contains: either the a
- or α-mating-type-specific haploid selection marker1
and exactly one of the two markers that, when both GMToolkits are present, collectively allow for selection of diploids.
The second step is to carry out the sexual cycling through which deletion loci can be combined within a single cell by the random assortment and/or meiotic recombination that accompanies each cycle of mating and sporulation.
Microbiology, Issue 70, Genetics, Synthetic Biology, Environmental Genomics, Genomics, Bioengineering, Biomedical Engineering, Cellular Biology, Multi-site genomic engineering, genetic interaction, green fluorescent protein, GFP, flow cytometry, Saccharomyces cerevisiae, yeast, Green Monster
Inhibitory Synapse Formation in a Co-culture Model Incorporating GABAergic Medium Spiny Neurons and HEK293 Cells Stably Expressing GABAA Receptors
Institutions: University College London.
Inhibitory neurons act in the central nervous system to regulate the dynamics and spatio-temporal co-ordination of neuronal networks. GABA (γ-aminobutyric acid) is the predominant inhibitory neurotransmitter in the brain. It is released from the presynaptic terminals of inhibitory neurons within highly specialized intercellular junctions known as synapses, where it binds to GABAA
Rs) present at the plasma membrane of the synapse-receiving, postsynaptic neurons. Activation of these GABA-gated ion channels leads to influx of chloride resulting in postsynaptic potential changes that decrease the probability that these neurons will generate action potentials.
During development, diverse types of inhibitory neurons with distinct morphological, electrophysiological and neurochemical characteristics have the ability to recognize their target neurons and form synapses which incorporate specific GABAA
Rs subtypes. This principle of selective innervation of neuronal targets raises the question as to how the appropriate synaptic partners identify each other.
To elucidate the underlying molecular mechanisms, a novel in vitro
co-culture model system was established, in which medium spiny GABAergic neurons, a highly homogenous population of neurons isolated from the embryonic striatum, were cultured with stably transfected HEK293 cell lines that express different GABAA
R subtypes. Synapses form rapidly, efficiently and selectively in this system, and are easily accessible for quantification. Our results indicate that various GABAA
R subtypes differ in their ability to promote synapse formation, suggesting that this reduced in vitro
model system can be used to reproduce, at least in part, the in vivo
conditions required for the recognition of the appropriate synaptic partners and formation of specific synapses. Here the protocols for culturing the medium spiny neurons and generating HEK293 cells lines expressing GABAA
Rs are first described, followed by detailed instructions on how to combine these two cell types in co-culture and analyze the formation of synaptic contacts.
Neuroscience, Issue 93, Developmental neuroscience, synaptogenesis, synaptic inhibition, co-culture, stable cell lines, GABAergic, medium spiny neurons, HEK 293 cell line
An Allele-specific Gene Expression Assay to Test the Functional Basis of Genetic Associations
Institutions: University of Oxford.
The number of significant genetic associations with common complex traits is constantly increasing. However, most of these associations have not been understood at molecular level. One of the mechanisms mediating the effect of DNA variants on phenotypes is gene expression, which has been shown to be particularly relevant for complex traits1
This method tests in a cellular context the effect of specific DNA sequences on gene expression. The principle is to measure the relative abundance of transcripts arising from the two alleles of a gene, analysing cells which carry one copy of the DNA sequences associated with disease (the risk variants)2,3
. Therefore, the cells used for this method should meet two fundamental genotypic requirements: they have to be heterozygous both for DNA risk variants and for DNA markers, typically coding polymorphisms, which can distinguish transcripts based on their chromosomal origin (Figure 1). DNA risk variants and DNA markers do not need to have the same allele frequency but the phase (haplotypic) relationship of the genetic markers needs to be understood. It is also important to choose cell types which express the gene of interest. This protocol refers specifically to the procedure adopted to extract nucleic acids from fibroblasts but the method is equally applicable to other cells types including primary cells.
DNA and RNA are extracted from the selected cell lines and cDNA is generated. DNA and cDNA are analysed with a primer extension assay, designed to target the coding DNA markers4
. The primer extension assay is carried out using the MassARRAY (Sequenom)5
platform according to the manufacturer's specifications. Primer extension products are then analysed by matrix-assisted laser desorption/ionization time of-flight mass spectrometry (MALDI-TOF/MS). Because the selected markers are heterozygous they will generate two peaks on the MS profiles. The area of each peak is proportional to the transcript abundance and can be measured with a function of the MassARRAY Typer software to generate an allelic ratio (allele 1: allele 2) calculation. The allelic ratio obtained for cDNA is normalized using that measured from genomic DNA, where the allelic ratio is expected to be 1:1 to correct for technical artifacts. Markers with a normalised allelic ratio significantly different to 1 indicate that the amount of transcript generated from the two chromosomes in the same cell is different, suggesting that the DNA variants associated with the phenotype have an effect on gene expression. Experimental controls should be used to confirm the results.
Cellular Biology, Issue 45, Gene expression, regulatory variant, haplotype, association study, primer extension, MALDI-TOF mass spectrometry, single nucleotide polymorphism, allele-specific
Label-free Isolation and Enrichment of Cells Through Contactless Dielectrophoresis
Institutions: Virginia Tech, Virginia Tech.
Dielectrophoresis (DEP) is the phenomenon by which polarized particles in a non-uniform electric field undergo translational motion, and can be used to direct the motion of microparticles in a surface marker-independent manner. Traditionally, DEP devices include planar metallic electrodes patterned in the sample channel. This approach can be expensive and requires a specialized cleanroom environment. Recently, a contact-free approach called contactless dielectrophoresis (cDEP) has been developed. This method utilizes the classic principle of DEP while avoiding direct contact between electrodes and sample by patterning fluidic electrodes and a sample channel from a single polydimethylsiloxane (PDMS) substrate, and has application as a rapid microfluidic strategy designed to sort and enrich microparticles. Unique to this method is that the electric field is generated via fluidic electrode channels containing a highly conductive fluid, which are separated from the sample channel by a thin insulating barrier. Because metal electrodes do not directly contact the sample, electrolysis, electrode delamination, and sample contamination are avoided. Additionally, this enables an inexpensive and simple fabrication process.
cDEP is thus well-suited for manipulating sensitive biological particles. The dielectrophoretic force acting upon the particles depends not only upon spatial gradients of the electric field generated by customizable design of the device geometry, but the intrinsic biophysical properties of the cell. As such, cDEP is a label-free technique that avoids depending upon surface-expressed molecular biomarkers that may be variably expressed within a population, while still allowing characterization, enrichment, and sorting of bioparticles.
Here, we demonstrate the basics of fabrication and experimentation using cDEP. We explain the simple preparation of a cDEP chip using soft lithography techniques. We discuss the experimental procedure for characterizing crossover frequency of a particle or cell, the frequency at which the dielectrophoretic force is zero. Finally, we demonstrate the use of this technique for sorting a mixture of ovarian cancer cells and fluorescing microspheres (beads).
Biomedical Engineering, Issue 79, Medicine, Cellular Biology, Molecular Biology, Bioengineering, Anatomy, Physiology, Biophysics, Physics, Microfluidics, Cell Separation, Microfluidic Analytical Techniques, Electrophoresis, Microchip, cancer diagnosis, cell enrichment, cell sorting, microfluidics, dielectrophoresis, Lab on a chip, cells, imaging
Detecting Somatic Genetic Alterations in Tumor Specimens by Exon Capture and Massively Parallel Sequencing
Institutions: Memorial Sloan-Kettering Cancer Center, Memorial Sloan-Kettering Cancer Center.
Efforts to detect and investigate key oncogenic mutations have proven valuable to facilitate the appropriate treatment for cancer patients. The establishment of high-throughput, massively parallel "next-generation" sequencing has aided the discovery of many such mutations. To enhance the clinical and translational utility of this technology, platforms must be high-throughput, cost-effective, and compatible with formalin-fixed paraffin embedded (FFPE) tissue samples that may yield small amounts of degraded or damaged DNA. Here, we describe the preparation of barcoded and multiplexed DNA libraries followed by hybridization-based capture of targeted exons for the detection of cancer-associated mutations in fresh frozen and FFPE tumors by massively parallel sequencing. This method enables the identification of sequence mutations, copy number alterations, and select structural rearrangements involving all targeted genes. Targeted exon sequencing offers the benefits of high throughput, low cost, and deep sequence coverage, thus conferring high sensitivity for detecting low frequency mutations.
Molecular Biology, Issue 80, Molecular Diagnostic Techniques, High-Throughput Nucleotide Sequencing, Genetics, Neoplasms, Diagnosis, Massively parallel sequencing, targeted exon sequencing, hybridization capture, cancer, FFPE, DNA mutations
Quantitation and Analysis of the Formation of HO-Endonuclease Stimulated Chromosomal Translocations by Single-Strand Annealing in Saccharomyces cerevisiae
Institutions: Irell & Manella Graduate School of Biological Sciences, City of Hope Comprehensive Cancer Center and Beckman Research Institute, University of Southern California, Norris Comprehensive Cancer Center.
Genetic variation is frequently mediated by genomic rearrangements that arise through interaction between dispersed repetitive elements present in every eukaryotic genome. This process is an important mechanism for generating diversity between and within organisms1-3
. The human genome consists of approximately 40% repetitive sequence of retrotransposon origin, including a variety of LINEs and SINEs4
. Exchange events between these repetitive elements can lead to genome rearrangements, including translocations, that can disrupt gene dosage and expression that can result in autoimmune and cardiovascular diseases5
, as well as cancer in humans6-9
Exchange between repetitive elements occurs in a variety of ways. Exchange between sequences that share perfect (or near-perfect) homology occurs by a process called homologous recombination (HR). By contrast, non-homologous end joining (NHEJ) uses little-or-no sequence homology for exchange10,11
. The primary purpose of HR, in mitotic cells, is to repair double-strand breaks (DSBs) generated endogenously by aberrant DNA replication and oxidative lesions, or by exposure to ionizing radiation (IR), and other exogenous DNA damaging agents.
In the assay described here, DSBs are simultaneously created bordering recombination substrates at two different chromosomal loci in diploid cells by a galactose-inducible HO-endonuclease (Figure 1
). The repair of the broken chromosomes generates chromosomal translocations by single strand annealing (SSA), a process where homologous sequences adjacent to the chromosome ends are covalently joined subsequent to annealing. One of the substrates, his3-Δ3'
, contains a 3' truncated HIS3
allele and is located on one copy of chromosome XV at the native HIS3
locus. The second substrate, his3-Δ5'
, is located at the LEU2
locus on one copy of chromosome III, and contains a 5' truncated HIS3
allele. Both substrates are flanked by a HO endonuclease recognition site that can be targeted for incision by HO-endonuclease. HO endonuclease recognition sites native to the MAT
locus, on both copies of chromosome III, have been deleted in all strains. This prevents interaction between the recombination substrates and other broken chromosome ends from interfering in the assay. The KAN-MX
-marked galactose-inducible HO endonuclease expression cassette is inserted at the TRP1
locus on chromosome IV. The substrates share 311 bp or 60 bp of the HIS3
coding sequence that can be used by the HR machinery for repair by SSA. Cells that use these substrates to repair broken chromosomes by HR form an intact HIS3
allele and a tXV::III chromosomal translocation that can be selected for by the ability to grow on medium lacking histidine (Figure 2A
). Translocation frequency by HR is calculated by dividing the number of histidine prototrophic colonies that arise on selective medium by the total number of viable cells that arise after plating appropriate dilutions onto non-selective medium (Figure 2B
). A variety of DNA repair mutants have been used to study the genetic control of translocation formation by SSA using this system12-14
Genetics, Issue 55, translocation formation, HO-endonuclease, Genomic Southern blot, Chromosome blot, Pulsed-field gel electrophoresis, Homologous recombination, DNA double-strand breaks, Single-strand annealing
An Allelotyping PCR for Identifying Salmonella enterica serovars Enteritidis, Hadar, Heidelberg, and Typhimurium
Institutions: University of Georgia.
Current commercial PCRs tests for identifying Salmonella
target genes unique to this genus. However, there are two species, six subspecies, and over 2,500 different Salmonella
serovars, and not all are equal in their significance to public health. For example, finding S. enterica subspecies
IIIa Arizona on a table egg layer farm is insignificant compared to the isolation of S. enterica
subspecies I serovar Enteritidis, the leading cause of salmonellosis linked to the consumption of table eggs. Serovars are identified based on antigenic differences in lipopolysaccharide (LPS)(O antigen) and flagellin (H1 and H2 antigens). These antigenic differences are the outward appearance of the diversity of genes and gene alleles associated with this phenotype.
We have developed an allelotyping, multiplex PCR that keys on genetic differences between four major S. enterica
subspecies I serovars found in poultry and associated with significant human disease in the US. The PCR primer pairs were targeted to key genes or sequences unique to a specific Salmonella
serovar and designed to produce an amplicon with size specific for that gene or allele. Salmonella
serovar is assigned to an isolate based on the combination of PCR test results for specific LPS and flagellin gene alleles. The multiplex PCRs described in this article are specific for the detection of S. enterica
subspecies I serovars Enteritidis, Hadar, Heidelberg, and Typhimurium.
Here we demonstrate how to use the multiplex PCRs to identify serovar for a Salmonella
Immunology, Issue 53, PCR, Salmonella, multiplex, Serovar
Genetic Manipulation in Δku80 Strains for Functional Genomic Analysis of Toxoplasma gondii
Institutions: The Geisel School of Medicine at Dartmouth.
Targeted genetic manipulation using homologous recombination is the method of choice for functional genomic analysis to obtain a detailed view of gene function and phenotype(s). The development of mutant strains with targeted gene deletions, targeted mutations, complemented gene function, and/or tagged genes provides powerful strategies to address gene function, particularly if these genetic manipulations can be efficiently targeted to the gene locus of interest using integration mediated by double cross over homologous recombination.
Due to very high rates of nonhomologous recombination, functional genomic analysis of Toxoplasma gondii
has been previously limited by the absence of efficient methods for targeting gene deletions and gene replacements to specific genetic loci. Recently, we abolished the major pathway of nonhomologous recombination in type I and type II strains of T. gondii
by deleting the gene encoding the KU80 protein1,2
. The Δku80
strains behave normally during tachyzoite (acute) and bradyzoite (chronic) stages in vitro
and in vivo
and exhibit essentially a 100% frequency of homologous recombination. The Δku80
strains make functional genomic studies feasible on the single gene as well as on the genome scale1-4
Here, we report methods for using type I and type II Δku80Δhxgprt
strains to advance gene targeting approaches in T. gondii
. We outline efficient methods for generating gene deletions, gene replacements, and tagged genes by targeted insertion or deletion of the hypoxanthine-xanthine-guanine phosphoribosyltransferase (HXGPRT
) selectable marker. The described gene targeting protocol can be used in a variety of ways in Δku80
strains to advance functional analysis of the parasite genome and to develop single strains that carry multiple targeted genetic manipulations. The application of this genetic method and subsequent phenotypic assays will reveal fundamental and unique aspects of the biology of T. gondii
and related significant human pathogens that cause malaria (Plasmodium
sp.) and cryptosporidiosis (Cryptosporidium
Infectious Diseases, Issue 77, Genetics, Microbiology, Infection, Medicine, Immunology, Molecular Biology, Cellular Biology, Biomedical Engineering, Bioengineering, Genomics, Parasitology, Pathology, Apicomplexa, Coccidia, Toxoplasma, Genetic Techniques, Gene Targeting, Eukaryota, Toxoplasma gondii, genetic manipulation, gene targeting, gene deletion, gene replacement, gene tagging, homologous recombination, DNA, sequencing
Mapping Bacterial Functional Networks and Pathways in Escherichia Coli using Synthetic Genetic Arrays
Institutions: University of Toronto, University of Toronto, University of Regina.
Phenotypes are determined by a complex series of physical (e.g.
protein-protein) and functional (e.g.
gene-gene or genetic) interactions (GI)1
. While physical interactions can indicate which bacterial proteins are associated as complexes, they do not necessarily reveal pathway-level functional relationships1. GI screens, in which the growth of double mutants bearing two deleted or inactivated genes is measured and compared to the corresponding single mutants, can illuminate epistatic dependencies between loci and hence provide a means to query and discover novel functional relationships2
. Large-scale GI maps have been reported for eukaryotic organisms like yeast3-7
, but GI information remains sparse for prokaryotes8
, which hinders the functional annotation of bacterial genomes. To this end, we and others have developed high-throughput quantitative bacterial GI screening methods9, 10
Here, we present the key steps required to perform quantitative E. coli
Synthetic Genetic Array (eSGA) screening procedure on a genome-scale9
, using natural bacterial conjugation and homologous recombination to systemically generate and measure the fitness of large numbers of double mutants in a colony array format.
Briefly, a robot is used to transfer, through conjugation, chloramphenicol (Cm) - marked mutant alleles from engineered Hfr (High frequency of recombination) 'donor strains' into an ordered array of kanamycin (Kan) - marked F- recipient strains. Typically, we use loss-of-function single mutants bearing non-essential gene deletions (e.g.
the 'Keio' collection11
) and essential gene hypomorphic mutations (i.e.
alleles conferring reduced protein expression, stability, or activity9, 12, 13
) to query the functional associations of non-essential and essential genes, respectively. After conjugation and ensuing genetic exchange mediated by homologous recombination, the resulting double mutants are selected on solid medium containing both antibiotics. After outgrowth, the plates are digitally imaged and colony sizes are quantitatively scored using an in-house automated image processing system14
. GIs are revealed when the growth rate of a double mutant is either significantly better or worse than expected9
. Aggravating (or negative) GIs often result between loss-of-function mutations in pairs of genes from compensatory pathways that impinge on the same essential process2
. Here, the loss of a single gene is buffered, such that either single mutant is viable. However, the loss of both pathways is deleterious and results in synthetic lethality or sickness (i.e.
slow growth). Conversely, alleviating (or positive) interactions can occur between genes in the same pathway or protein complex2
as the deletion of either gene alone is often sufficient to perturb the normal function of the pathway or complex such that additional perturbations do not reduce activity, and hence growth, further. Overall, systematically identifying and analyzing GI networks can provide unbiased, global maps of the functional relationships between large numbers of genes, from which pathway-level information missed by other approaches can be inferred9
Genetics, Issue 69, Molecular Biology, Medicine, Biochemistry, Microbiology, Aggravating, alleviating, conjugation, double mutant, Escherichia coli, genetic interaction, Gram-negative bacteria, homologous recombination, network, synthetic lethality or sickness, suppression
Combining Magnetic Sorting of Mother Cells and Fluctuation Tests to Analyze Genome Instability During Mitotic Cell Aging in Saccharomyces cerevisiae
Institutions: Rensselaer Polytechnic Institute.
has been an excellent model system for examining mechanisms and consequences of genome instability. Information gained from this yeast model is relevant to many organisms, including humans, since DNA repair and DNA damage response factors are well conserved across diverse species. However, S. cerevisiae
has not yet been used to fully address whether the rate of accumulating mutations changes with increasing replicative (mitotic) age due to technical constraints. For instance, measurements of yeast replicative lifespan through micromanipulation involve very small populations of cells, which prohibit detection of rare mutations. Genetic methods to enrich for mother cells in populations by inducing death of daughter cells have been developed, but population sizes are still limited by the frequency with which random mutations that compromise the selection systems occur. The current protocol takes advantage of magnetic sorting of surface-labeled yeast mother cells to obtain large enough populations of aging mother cells to quantify rare mutations through phenotypic selections. Mutation rates, measured through fluctuation tests, and mutation frequencies are first established for young cells and used to predict the frequency of mutations in mother cells of various replicative ages. Mutation frequencies are then determined for sorted mother cells, and the age of the mother cells is determined using flow cytometry by staining with a fluorescent reagent that detects bud scars formed on their cell surfaces during cell division. Comparison of predicted mutation frequencies based on the number of cell divisions to the frequencies experimentally observed for mother cells of a given replicative age can then identify whether there are age-related changes in the rate of accumulating mutations. Variations of this basic protocol provide the means to investigate the influence of alterations in specific gene functions or specific environmental conditions on mutation accumulation to address mechanisms underlying genome instability during replicative aging.
Microbiology, Issue 92, Aging, mutations, genome instability, Saccharomyces cerevisiae, fluctuation test, magnetic sorting, mother cell, replicative aging
A Restriction Enzyme Based Cloning Method to Assess the In vitro Replication Capacity of HIV-1 Subtype C Gag-MJ4 Chimeric Viruses
Institutions: Emory University, Emory University.
The protective effect of many HLA class I alleles on HIV-1 pathogenesis and disease progression is, in part, attributed to their ability to target conserved portions of the HIV-1 genome that escape with difficulty. Sequence changes attributed to cellular immune pressure arise across the genome during infection, and if found within conserved regions of the genome such as Gag, can affect the ability of the virus to replicate in vitro
. Transmission of HLA-linked polymorphisms in Gag to HLA-mismatched recipients has been associated with reduced set point viral loads. We hypothesized this may be due to a reduced replication capacity of the virus. Here we present a novel method for assessing the in vitro
replication of HIV-1 as influenced by the gag
gene isolated from acute time points from subtype C infected Zambians. This method uses restriction enzyme based cloning to insert the gag
gene into a common subtype C HIV-1 proviral backbone, MJ4. This makes it more appropriate to the study of subtype C sequences than previous recombination based methods that have assessed the in vitro
replication of chronically derived gag-pro
sequences. Nevertheless, the protocol could be readily modified for studies of viruses from other subtypes. Moreover, this protocol details a robust and reproducible method for assessing the replication capacity of the Gag-MJ4 chimeric viruses on a CEM-based T cell line. This method was utilized for the study of Gag-MJ4 chimeric viruses derived from 149 subtype C acutely infected Zambians, and has allowed for the identification of residues in Gag that affect replication. More importantly, the implementation of this technique has facilitated a deeper understanding of how viral replication defines parameters of early HIV-1 pathogenesis such as set point viral load and longitudinal CD4+ T cell decline.
Infectious Diseases, Issue 90, HIV-1, Gag, viral replication, replication capacity, viral fitness, MJ4, CEM, GXR25
Infinium Assay for Large-scale SNP Genotyping Applications
Institutions: Oklahoma Medical Research Foundation.
Genotyping variants in the human genome has proven to be an efficient method to identify genetic associations with phenotypes. The distribution of variants within families or populations can facilitate identification of the genetic factors of disease. Illumina's panel of genotyping BeadChips allows investigators to genotype thousands or millions of single nucleotide polymorphisms (SNPs) or to analyze other genomic variants, such as copy number, across a large number of DNA samples. These SNPs can be spread throughout the genome or targeted in specific regions in order to maximize potential discovery. The Infinium assay has been optimized to yield high-quality, accurate results quickly. With proper setup, a single technician can process from a few hundred to over a thousand DNA samples per week, depending on the type of array. This assay guides users through every step, starting with genomic DNA and ending with the scanning of the array. Using propriety reagents, samples are amplified, fragmented, precipitated, resuspended, hybridized to the chip, extended by a single base, stained, and scanned on either an iScan or Hi Scan high-resolution optical imaging system. One overnight step is required to amplify the DNA. The DNA is denatured and isothermally amplified by whole-genome amplification; therefore, no PCR is required. Samples are hybridized to the arrays during a second overnight step. By the third day, the samples are ready to be scanned and analyzed. Amplified DNA may be stockpiled in large quantities, allowing bead arrays to be processed every day of the week, thereby maximizing throughput.
Basic Protocol, Issue 81, genomics, SNP, Genotyping, Infinium, iScan, HiScan, Illumina
An Affordable HIV-1 Drug Resistance Monitoring Method for Resource Limited Settings
Institutions: University of KwaZulu-Natal, Durban, South Africa, Jembi Health Systems, University of Amsterdam, Stanford Medical School.
HIV-1 drug resistance has the potential to seriously compromise the effectiveness and impact of antiretroviral therapy (ART). As ART programs in sub-Saharan Africa continue to expand, individuals on ART should be closely monitored for the emergence of drug resistance. Surveillance of transmitted drug resistance to track transmission of viral strains already resistant to ART is also critical. Unfortunately, drug resistance testing is still not readily accessible in resource limited settings, because genotyping is expensive and requires sophisticated laboratory and data management infrastructure. An open access genotypic drug resistance monitoring method to manage individuals and assess transmitted drug resistance is described. The method uses free open source software for the interpretation of drug resistance patterns and the generation of individual patient reports. The genotyping protocol has an amplification rate of greater than 95% for plasma samples with a viral load >1,000 HIV-1 RNA copies/ml. The sensitivity decreases significantly for viral loads <1,000 HIV-1 RNA copies/ml. The method described here was validated against a method of HIV-1 drug resistance testing approved by the United States Food and Drug Administration (FDA), the Viroseq genotyping method. Limitations of the method described here include the fact that it is not automated and that it also failed to amplify the circulating recombinant form CRF02_AG from a validation panel of samples, although it amplified subtypes A and B from the same panel.
Medicine, Issue 85, Biomedical Technology, HIV-1, HIV Infections, Viremia, Nucleic Acids, genetics, antiretroviral therapy, drug resistance, genotyping, affordable
Pyrosequencing: A Simple Method for Accurate Genotyping
Institutions: Washington University in St. Louis.
Pharmacogenetic research benefits first-hand from the abundance of information provided by the completion of the Human Genome Project. With such a tremendous amount of data available comes an explosion of genotyping methods. Pyrosequencing(R) is one of the most thorough yet simple methods to date used to analyze polymorphisms. It also has the ability to identify tri-allelic, indels, short-repeat polymorphisms, along with determining allele percentages for methylation or pooled sample assessment. In addition, there is a standardized control sequence that provides internal quality control. This method has led to rapid and efficient single-nucleotide polymorphism evaluation including many clinically relevant polymorphisms. The technique and methodology of Pyrosequencing is explained.
Cellular Biology, Issue 11, Springer Protocols, Pyrosequencing, genotype, polymorphism, SNP, pharmacogenetics, pharmacogenomics, PCR