RNA viruses use RNA dependent RNA polymerases to replicate their genomes. The intrinsically high error rate of these enzymes is a large contributor to the generation of extreme population diversity that facilitates virus adaptation and evolution. Increasing evidence shows that the intrinsic error rates, and the resulting mutation frequencies, of RNA viruses can be modulated by subtle amino acid changes to the viral polymerase. Although biochemical assays exist for some viral RNA polymerases that permit quantitative measure of incorporation fidelity, here we describe a simple method of measuring mutation frequencies of RNA viruses that has proven to be as accurate as biochemical approaches in identifying fidelity altering mutations. The approach uses conventional virological and sequencing techniques that can be performed in most biology laboratories. Based on our experience with a number of different viruses, we have identified the key steps that must be optimized to increase the likelihood of isolating fidelity variants and generating data of statistical significance. The isolation and characterization of fidelity altering mutations can provide new insights into polymerase structure and function1-3. Furthermore, these fidelity variants can be useful tools in characterizing mechanisms of virus adaptation and evolution4-7.
18 Related JoVE Articles!
A Quantitative Fitness Analysis Workflow
Institutions: Newcastle University Medical School.
Quantitative Fitness Analysis (QFA) is an experimental and computational workflow for comparing fitnesses of microbial cultures grown in parallel1,2,3,4
. QFA can be applied to focused observations of single cultures but is most useful for genome-wide genetic interaction or drug screens investigating up to thousands of independent cultures. The central experimental method is the inoculation of independent, dilute liquid microbial cultures onto solid agar plates which are incubated and regularly photographed. Photographs from each time-point are analyzed, producing quantitative cell density estimates, which are used to construct growth curves, allowing quantitative fitness measures to be derived. Culture fitnesses can be compared to quantify and rank genetic interaction strengths or drug sensitivities. The effect on culture fitness of any treatments added into substrate agar (e.g.
small molecules, antibiotics or nutrients) or applied to plates externally (e.g.
UV irradiation, temperature) can be quantified by QFA.
The QFA workflow produces growth rate estimates analogous to those obtained by spectrophotometric measurement of parallel liquid cultures in 96-well or 200-well plate readers. Importantly, QFA has significantly higher throughput compared with such methods. QFA cultures grow on a solid agar surface and are therefore well aerated during growth without the need for stirring or shaking.
QFA throughput is not as high as that of some Synthetic Genetic Array (SGA) screening methods5,6
. However, since QFA cultures are heavily diluted before being inoculated onto agar, QFA can capture more complete growth curves, including exponential and saturation phases3
. For example, growth curve observations allow culture doubling times to be estimated directly with high precision, as discussed previously1
Here we present a specific QFA protocol applied to thousands of S. cerevisiae
cultures which are automatically handled by robots during inoculation, incubation and imaging. Any of these automated steps can be replaced by an equivalent, manual procedure, with an associated reduction in throughput, and we also present a lower throughput manual protocol. The same QFA software tools can be applied to images captured in either workflow.
We have extensive experience applying QFA to cultures of the budding yeast S. cerevisiae
but we expect that QFA will prove equally useful for examining cultures of the fission yeast S. pombe
and bacterial cultures.
Physiology, Issue 66, Medicine, Robotic, microbial, culture, yeast, array, library, high-throughput, analysis, fitness, growth rate, quantitative, solid agar
Isolating Potentiated Hsp104 Variants Using Yeast Proteinopathy Models
Institutions: Perelman School of Medicine at the University of Pennsylvania.
Many protein-misfolding disorders can be modeled in the budding yeast Saccharomyces cerevisiae
. Proteins such as TDP-43 and FUS, implicated in amyotrophic lateral sclerosis, and α-synuclein, implicated in Parkinson’s disease, are toxic and form cytoplasmic aggregates in yeast. These features recapitulate protein pathologies observed in patients with these disorders. Thus, yeast are an ideal platform for isolating toxicity suppressors from libraries of protein variants. We are interested in applying protein disaggregases to eliminate misfolded toxic protein conformers. Specifically, we are engineering Hsp104, a hexameric AAA+ protein from yeast that is uniquely capable of solubilizing both disordered aggregates and amyloid and returning the proteins to their native conformations. While Hsp104 is highly conserved in eukaryotes and eubacteria, it has no known metazoan homologue. Hsp104 has only limited ability to eliminate disordered aggregates and amyloid fibers implicated in human disease. Thus, we aim to engineer Hsp104 variants to reverse the protein misfolding implicated in neurodegenerative disorders. We have developed methods to screen large libraries of Hsp104 variants for suppression of proteotoxicity in yeast. As yeast are prone to spontaneous nonspecific suppression of toxicity, a two-step screening process has been developed to eliminate false positives. Using these methods, we have identified a series of potentiated Hsp104 variants that potently suppress the toxicity and aggregation of TDP-43, FUS, and α-synuclein. Here, we describe this optimized protocol, which could be adapted to screen libraries constructed using any protein backbone for suppression of toxicity of any protein that is toxic in yeast.
Microbiology, Issue 93, Protein-misfolding disorders, yeast proteinopathy models, Hsp104, proteotoxicity, amyloid, disaggregation
Competitive Genomic Screens of Barcoded Yeast Libraries
Institutions: University of Toronto, University of Toronto, University of Toronto, National Human Genome Research Institute, NIH, Stanford University , University of Toronto.
By virtue of advances in next generation sequencing technologies, we have access to new genome sequences almost daily. The tempo of these advances is accelerating, promising greater depth and breadth. In light of these extraordinary advances, the need for fast, parallel methods to define gene function becomes ever more important. Collections of genome-wide deletion mutants in yeasts and E. coli
have served as workhorses for functional characterization of gene function, but this approach is not scalable, current gene-deletion approaches require each of the thousands of genes that comprise a genome to be deleted and verified. Only after this work is complete can we pursue high-throughput phenotyping. Over the past decade, our laboratory has refined a portfolio of competitive, miniaturized, high-throughput genome-wide assays that can be performed in parallel. This parallelization is possible because of the inclusion of DNA 'tags', or 'barcodes,' into each mutant, with the barcode serving as a proxy for the mutation and one can measure the barcode abundance to assess mutant fitness. In this study, we seek to fill the gap between DNA sequence and barcoded mutant collections. To accomplish this we introduce a combined transposon disruption-barcoding approach that opens up parallel barcode assays to newly sequenced, but poorly characterized microbes. To illustrate this approach we present a new Candida albicans
barcoded disruption collection and describe how both microarray-based and next generation sequencing-based platforms can be used to collect 10,000 - 1,000,000 gene-gene and drug-gene interactions in a single experiment.
Biochemistry, Issue 54, chemical biology, chemogenomics, chemical probes, barcode microarray, next generation sequencing
Large-scale Gene Knockdown in C. elegans Using dsRNA Feeding Libraries to Generate Robust Loss-of-function Phenotypes
Institutions: University of Massachusetts, Amherst, University of Massachusetts, Amherst, University of Massachusetts, Amherst.
RNA interference by feeding worms bacteria expressing dsRNAs has been a useful tool to assess gene function in C. elegans
. While this strategy works well when a small number of genes are targeted for knockdown, large scale feeding screens show variable knockdown efficiencies, which limits their utility. We have deconstructed previously published RNAi knockdown protocols and found that the primary source of the reduced knockdown can be attributed to the loss of dsRNA-encoding plasmids from the bacteria fed to the animals. Based on these observations, we have developed a dsRNA feeding protocol that greatly reduces or eliminates plasmid loss to achieve efficient, high throughput knockdown. We demonstrate that this protocol will produce robust, reproducible knock down of C. elegans
genes in multiple tissue types, including neurons, and will permit efficient knockdown in large scale screens. This protocol uses a commercially available dsRNA feeding library and describes all steps needed to duplicate the library and perform dsRNA screens. The protocol does not require the use of any sophisticated equipment, and can therefore be performed by any C. elegans
Developmental Biology, Issue 79, Caenorhabditis elegans (C. elegans), Gene Knockdown Techniques, C. elegans, dsRNA interference, gene knockdown, large scale feeding screen
Mutagenesis and Functional Selection Protocols for Directed Evolution of Proteins in E. coli
Institutions: University of California Santa Cruz - UCSC.
The efficient generation of genetic diversity represents an invaluable molecular tool that can be used to label DNA synthesis, to create unique molecular signatures, or to evolve proteins in the laboratory. Here, we present a protocol that allows the generation of large (>1011
) mutant libraries for a given target sequence. This method is based on replication of a ColE1 plasmid encoding the desired sequence by a low-fidelity variant of DNA polymerase I (LF-Pol I). The target plasmid is transformed into a mutator strain of E. coli
and plated on solid media, yielding between 0.2 and 1 mutations/kb, depending on the location of the target gene. Higher mutation frequencies are achieved by iterating this process of mutagenesis. Compared to alternative methods of mutagenesis, our protocol stands out for its simplicity, as no cloning or PCR are involved. Thus, our method is ideal for mutational labeling of plasmids or other Pol I templates or to explore large sections of sequence space for the evolution of activities not present in the original target. The tight spatial control that PCR or randomized oligonucleotide-based methods offer can also be achieved through subsequent cloning of specific sections of the library. Here we provide protocols showing how to create a random mutant library and how to establish drug-based selections in E. coli
to identify mutants exhibiting new biochemical activities.
Genetics, Issue 49, random mutagenesis, directed evolution, LB agar drug gradient, bacterial complementation, ColE1 plasmid, DNA polymerase I, replication fidelity, genetic adaptation, antimicrobials, methylating agents
The Importance of Correct Protein Concentration for Kinetics and Affinity Determination in Structure-function Analysis
Institutions: GE Healthcare Bio-Sciences AB.
In this study, we explore the interaction between the bovine cysteine protease inhibitor cystatin B and a catalytically inactive form of papain (Fig. 1), a plant cysteine protease, by real-time label-free analysis using Biacore X100. Several cystatin B variants with point mutations in areas of interaction with papain, are produced. For each cystatin B variant we determine its specific binding concentration using calibration-free concentration analysis (CFCA) and compare the values obtained with total protein concentration as determined by A280
. After that, the kinetics of each cystatin B variant binding to papain is measured using single-cycle kinetics (SCK). We show that one of the four cystatin B variants we examine is only partially active for binding. This partial activity, revealed by CFCA, translates to a significant difference in the association rate constant (ka
) and affinity (KD
), compared to the values calculated using total protein concentration. Using CFCA in combination with kinetic analysis in a structure-function study contributes to obtaining reliable results, and helps to make the right interpretation of the interaction mechanism.
Cellular Biology, Issue 37, Protein interaction, Surface Plasmon Resonance, Biacore X100, CFCA, Cystatin B, Papain
Identifying the Effects of BRCA1 Mutations on Homologous Recombination using Cells that Express Endogenous Wild-type BRCA1
Institutions: The Ohio State University, Tohoku University.
The functional analysis of missense mutations can be complicated by the presence in the cell of the endogenous protein. Structure-function analyses of the BRCA1 have been complicated by the lack of a robust assay for the full length BRCA1 protein and the difficulties inherent in working with cell lines that express hypomorphic BRCA1 protein1,2,3,4,5
. We developed a system whereby the endogenous BRCA1 protein in a cell was acutely depleted by RNAi targeting the 3'-UTR of the BRCA1 mRNA and replaced by co-transfecting a plasmid expressing a BRCA1 variant. One advantage of this procedure is that the acute silencing of BRCA1 and simultaneous replacement allow the cells to grow without secondary mutations or adaptations that might arise over time to compensate for the loss of BRCA1 function. This depletion and add-back procedure was done in a HeLa-derived cell line that was readily assayed for homologous recombination activity. The homologous recombination assay is based on a previously published method whereby a recombination substrate is integrated into the genome (Figure 1)6,7,8,9
. This recombination substrate has the rare-cutting I-SceI restriction enzyme site inside an inactive GFP allele, and downstream is a second inactive GFP allele. Transfection of the plasmid that expresses I-SceI results in a double-stranded break, which may be repaired by homologous recombination, and if homologous recombination does repair the break it creates an active GFP allele that is readily scored by flow cytometry for GFP protein expression. Depletion of endogenous BRCA1 resulted in an 8-10-fold reduction in homologous recombination activity, and add-back of wild-type plasmid fully restored homologous recombination function. When specific point mutants of full length BRCA1 were expressed from co-transfected plasmids, the effect of the specific missense mutant could be scored. As an example, the expression of the BRCA1(M18T) protein, a variant of unknown clinical significance10
, was expressed in these cells, it failed to restore BRCA1-dependent homologous recombination. By contrast, expression of another variant, also of unknown significance, BRCA1(I21V) fully restored BRCA1-dependent homologous recombination function. This strategy of testing the function of BRCA1 missense mutations has been applied to another biological system assaying for centrosome function (Kais et al, unpublished observations). Overall, this approach is suitable for the analysis of missense mutants in any gene that must be analyzed recessively.
Cell Biology, Issue 48, BRCA1, homologous recombination, breast cancer, RNA interference, DNA repair
High Throughput Screening of Fungal Endoglucanase Activity in Escherichia coli
Institutions: California Institute of Technology, California Institute of Technology.
Cellulase enzymes (endoglucanases, cellobiohydrolases, and β-glucosidases) hydrolyze cellulose into component sugars, which in turn can be converted into fuel alcohols1
. The potential for enzymatic hydrolysis of cellulosic biomass to provide renewable energy has intensified efforts to engineer cellulases for economical fuel production2
. Of particular interest are fungal cellulases3-8
, which are already being used industrially for foods and textiles processing.
Identifying active variants among a library of mutant cellulases is critical to the engineering process; active mutants can be further tested for improved properties and/or subjected to additional mutagenesis. Efficient engineering of fungal cellulases has been hampered by a lack of genetic tools for native organisms and by difficulties in expressing the enzymes in heterologous hosts. Recently, Morikawa and coworkers developed a method for expressing in E. coli
the catalytic domains of endoglucanases from H. jecorina3,9
, an important industrial fungus with the capacity to secrete cellulases in large quantities. Functional E. coli
expression has also been reported for cellulases from other fungi, including Macrophomina phaseolina10
and Phanerochaete chrysosporium11-12
We present a method for high throughput screening of fungal endoglucanase activity in E. coli
. (Fig 1
) This method uses the common microbial dye Congo Red (CR) to visualize enzymatic degradation of carboxymethyl cellulose (CMC) by cells growing on solid medium. The activity assay requires inexpensive reagents, minimal manipulation, and gives unambiguous results as zones of degradation (“halos”) at the colony site. Although a quantitative measure of enzymatic activity cannot be determined by this method, we have found that halo size correlates with total enzymatic activity in the cell. Further characterization of individual positive clones will determine , relative protein fitness.
Traditional bacterial whole cell CMC/CR activity assays13
involve pouring agar containing CMC onto colonies, which is subject to cross-contamination, or incubating cultures in CMC agar wells, which is less amenable to large-scale experimentation. Here we report an improved protocol that modifies existing wash methods14
for cellulase activity: cells grown on CMC agar plates are removed prior to CR staining. Our protocol significantly reduces cross-contamination and is highly scalable, allowing the rapid screening of thousands of clones. In addition to H. jecorina enzymes
, we have expressed and screened endoglucanase variants from the Thermoascus aurantiacus
and Penicillium decumbens
(shown in Figure 2
), suggesting that this protocol is applicable to enzymes from a range of organisms.
Molecular Biology, Issue 54, cellulase, endoglucanase, CMC, Congo Red
Massively Parallel Reporter Assays in Cultured Mammalian Cells
Institutions: Broad Institute.
The genetic reporter assay is a well-established and powerful tool for dissecting the relationship between DNA sequences and their gene regulatory activities. The potential throughput of this assay has, however, been limited by the need to individually clone and assay the activity of each sequence on interest using protein fluorescence or enzymatic activity as a proxy for regulatory activity. Advances in high-throughput DNA synthesis and sequencing technologies have recently made it possible to overcome these limitations by multiplexing the construction and interrogation of large libraries of reporter constructs. This protocol describes implementation of a Massively Parallel Reporter Assay (MPRA) that allows direct comparison of hundreds of thousands of putative regulatory sequences in a single cell culture dish.
Genetics, Issue 90, gene regulation, transcriptional regulation, sequence-activity mapping, reporter assay, library cloning, transfection, tag sequencing, mammalian cells
Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules
Institutions: Princeton University.
The aim of de novo
protein design is to find the amino acid sequences that will fold into a desired 3-dimensional structure with improvements in specific properties, such as binding affinity, agonist or antagonist behavior, or stability, relative to the native sequence. Protein design lies at the center of current advances drug design and discovery. Not only does protein design provide predictions for potentially useful drug targets, but it also enhances our understanding of the protein folding process and protein-protein interactions. Experimental methods such as directed evolution have shown success in protein design. However, such methods are restricted by the limited sequence space that can be searched tractably. In contrast, computational design strategies allow for the screening of a much larger set of sequences covering a wide variety of properties and functionality. We have developed a range of computational de novo
protein design methods capable of tackling several important areas of protein design. These include the design of monomeric proteins for increased stability and complexes for increased binding affinity.
To disseminate these methods for broader use we present Protein WISDOM (https://www.proteinwisdom.org), a tool that provides automated methods for a variety of protein design problems. Structural templates are submitted to initialize the design process. The first stage of design is an optimization sequence selection stage that aims at improving stability through minimization of potential energy in the sequence space. Selected sequences are then run through a fold specificity stage and a binding affinity stage. A rank-ordered list of the sequences for each step of the process, along with relevant designed structures, provides the user with a comprehensive quantitative assessment of the design. Here we provide the details of each design method, as well as several notable experimental successes attained through the use of the methods.
Genetics, Issue 77, Molecular Biology, Bioengineering, Biochemistry, Biomedical Engineering, Chemical Engineering, Computational Biology, Genomics, Proteomics, Protein, Protein Binding, Computational Biology, Drug Design, optimization (mathematics), Amino Acids, Peptides, and Proteins, De novo protein and peptide design, Drug design, In silico sequence selection, Optimization, Fold specificity, Binding affinity, sequencing
A Restriction Enzyme Based Cloning Method to Assess the In vitro Replication Capacity of HIV-1 Subtype C Gag-MJ4 Chimeric Viruses
Institutions: Emory University, Emory University.
The protective effect of many HLA class I alleles on HIV-1 pathogenesis and disease progression is, in part, attributed to their ability to target conserved portions of the HIV-1 genome that escape with difficulty. Sequence changes attributed to cellular immune pressure arise across the genome during infection, and if found within conserved regions of the genome such as Gag, can affect the ability of the virus to replicate in vitro
. Transmission of HLA-linked polymorphisms in Gag to HLA-mismatched recipients has been associated with reduced set point viral loads. We hypothesized this may be due to a reduced replication capacity of the virus. Here we present a novel method for assessing the in vitro
replication of HIV-1 as influenced by the gag
gene isolated from acute time points from subtype C infected Zambians. This method uses restriction enzyme based cloning to insert the gag
gene into a common subtype C HIV-1 proviral backbone, MJ4. This makes it more appropriate to the study of subtype C sequences than previous recombination based methods that have assessed the in vitro
replication of chronically derived gag-pro
sequences. Nevertheless, the protocol could be readily modified for studies of viruses from other subtypes. Moreover, this protocol details a robust and reproducible method for assessing the replication capacity of the Gag-MJ4 chimeric viruses on a CEM-based T cell line. This method was utilized for the study of Gag-MJ4 chimeric viruses derived from 149 subtype C acutely infected Zambians, and has allowed for the identification of residues in Gag that affect replication. More importantly, the implementation of this technique has facilitated a deeper understanding of how viral replication defines parameters of early HIV-1 pathogenesis such as set point viral load and longitudinal CD4+ T cell decline.
Infectious Diseases, Issue 90, HIV-1, Gag, viral replication, replication capacity, viral fitness, MJ4, CEM, GXR25
Monitoring Intraspecies Competition in a Bacterial Cell Population by Cocultivation of Fluorescently Labelled Strains
Institutions: Georg-August University.
Many microorganisms such as bacteria proliferate extremely fast and the populations may reach high cell densities. Small fractions of cells in a population always have accumulated mutations that are either detrimental or beneficial for the cell. If the fitness effect of a mutation provides the subpopulation with a strong selective growth advantage, the individuals of this subpopulation may rapidly outcompete and even completely eliminate their immediate fellows. Thus, small genetic changes and selection-driven accumulation of cells that have acquired beneficial mutations may lead to a complete shift of the genotype of a cell population. Here we present a procedure to monitor the rapid clonal expansion and elimination of beneficial and detrimental mutations, respectively, in a bacterial cell population over time by cocultivation of fluorescently labeled individuals of the Gram-positive model bacterium Bacillus subtilis
. The method is easy to perform and very illustrative to display intraspecies competition among the individuals in a bacterial cell population.
Cellular Biology, Issue 83, Bacillus subtilis, evolution, adaptation, selective pressure, beneficial mutation, intraspecies competition, fluorophore-labelling, Fluorescence Microscopy
Mapping Bacterial Functional Networks and Pathways in Escherichia Coli using Synthetic Genetic Arrays
Institutions: University of Toronto, University of Toronto, University of Regina.
Phenotypes are determined by a complex series of physical (e.g.
protein-protein) and functional (e.g.
gene-gene or genetic) interactions (GI)1
. While physical interactions can indicate which bacterial proteins are associated as complexes, they do not necessarily reveal pathway-level functional relationships1. GI screens, in which the growth of double mutants bearing two deleted or inactivated genes is measured and compared to the corresponding single mutants, can illuminate epistatic dependencies between loci and hence provide a means to query and discover novel functional relationships2
. Large-scale GI maps have been reported for eukaryotic organisms like yeast3-7
, but GI information remains sparse for prokaryotes8
, which hinders the functional annotation of bacterial genomes. To this end, we and others have developed high-throughput quantitative bacterial GI screening methods9, 10
Here, we present the key steps required to perform quantitative E. coli
Synthetic Genetic Array (eSGA) screening procedure on a genome-scale9
, using natural bacterial conjugation and homologous recombination to systemically generate and measure the fitness of large numbers of double mutants in a colony array format.
Briefly, a robot is used to transfer, through conjugation, chloramphenicol (Cm) - marked mutant alleles from engineered Hfr (High frequency of recombination) 'donor strains' into an ordered array of kanamycin (Kan) - marked F- recipient strains. Typically, we use loss-of-function single mutants bearing non-essential gene deletions (e.g.
the 'Keio' collection11
) and essential gene hypomorphic mutations (i.e.
alleles conferring reduced protein expression, stability, or activity9, 12, 13
) to query the functional associations of non-essential and essential genes, respectively. After conjugation and ensuing genetic exchange mediated by homologous recombination, the resulting double mutants are selected on solid medium containing both antibiotics. After outgrowth, the plates are digitally imaged and colony sizes are quantitatively scored using an in-house automated image processing system14
. GIs are revealed when the growth rate of a double mutant is either significantly better or worse than expected9
. Aggravating (or negative) GIs often result between loss-of-function mutations in pairs of genes from compensatory pathways that impinge on the same essential process2
. Here, the loss of a single gene is buffered, such that either single mutant is viable. However, the loss of both pathways is deleterious and results in synthetic lethality or sickness (i.e.
slow growth). Conversely, alleviating (or positive) interactions can occur between genes in the same pathway or protein complex2
as the deletion of either gene alone is often sufficient to perturb the normal function of the pathway or complex such that additional perturbations do not reduce activity, and hence growth, further. Overall, systematically identifying and analyzing GI networks can provide unbiased, global maps of the functional relationships between large numbers of genes, from which pathway-level information missed by other approaches can be inferred9
Genetics, Issue 69, Molecular Biology, Medicine, Biochemistry, Microbiology, Aggravating, alleviating, conjugation, double mutant, Escherichia coli, genetic interaction, Gram-negative bacteria, homologous recombination, network, synthetic lethality or sickness, suppression
Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
Institutions: Washington University School of Medicine, Washington University School of Medicine, Washington University School of Medicine.
As DNA sequencing technology has markedly advanced in recent years2
, it has become increasingly evident that the amount of genetic variation between any two individuals is greater than previously thought3
. In contrast, array-based genotyping has failed to identify a significant contribution of common sequence variants to the phenotypic variability of common disease4,5
. Taken together, these observations have led to the evolution of the Common Disease / Rare Variant hypothesis suggesting that the majority of the "missing heritability" in common and complex phenotypes is instead due to an individual's personal profile of rare or private DNA variants6-8
. However, characterizing how rare variation impacts complex phenotypes requires the analysis of many affected individuals at many genomic loci, and is ideally compared to a similar survey in an unaffected cohort. Despite the sequencing power offered by today's platforms, a population-based survey of many genomic loci and the subsequent computational analysis required remains prohibitive for many investigators.
To address this need, we have developed a pooled sequencing approach1,9
and a novel software package1
for highly accurate rare variant detection from the resulting data. The ability to pool genomes from entire populations of affected individuals and survey the degree of genetic variation at multiple targeted regions in a single sequencing library provides excellent cost and time savings to traditional single-sample sequencing methodology. With a mean sequencing coverage per allele of 25-fold, our custom algorithm, SPLINTER, uses an internal variant calling control strategy to call insertions, deletions and substitutions up to four base pairs in length with high sensitivity and specificity from pools of up to 1 mutant allele in 500 individuals. Here we describe the method for preparing the pooled sequencing library followed by step-by-step instructions on how to use the SPLINTER package for pooled sequencing analysis (https://www.ibridgenetwork.org/wustl/splinter). We show a comparison between pooled sequencing of 947 individuals, all of whom also underwent genome-wide array, at over 20kb of sequencing per person. Concordance between genotyping of tagged and novel variants called in the pooled sample were excellent. This method can be easily scaled up to any number of genomic loci and any number of individuals. By incorporating the internal positive and negative amplicon controls at ratios that mimic the population under study, the algorithm can be calibrated for optimal performance. This strategy can also be modified for use with hybridization capture or individual-specific barcodes and can be applied to the sequencing of naturally heterogeneous samples, such as tumor DNA.
Genetics, Issue 64, Genomics, Cancer Biology, Bioinformatics, Pooled DNA sequencing, SPLINTER, rare genetic variants, genetic screening, phenotype, high throughput, computational analysis, DNA, PCR, primers
A New Screening Method for the Directed Evolution of Thermostable Bacteriolytic Enzymes
Institutions: University of Maryland .
Directed evolution is defined as a method to harness natural selection in order to engineer proteins to acquire particular properties that are not associated with the protein in nature. Literature has provided numerous examples regarding the implementation of directed evolution to successfully alter molecular specificity and catalysis1
. The primary advantage of utilizing directed evolution instead of more rational-based approaches for molecular engineering relates to the volume and diversity of variants that can be screened2
. One possible application of directed evolution involves improving structural stability of bacteriolytic enzymes, such as endolysins. Bacteriophage encode and express endolysins to hydrolyze a critical covalent bond in the peptidoglycan (i.e.
cell wall) of bacteria, resulting in host cell lysis and liberation of progeny virions. Notably, these enzymes possess the ability to extrinsically induce lysis to susceptible bacteria in the absence of phage and furthermore have been validated both in vitro
and in vivo
for their therapeutic potential3-5
. The subject of our directed evolution study involves the PlyC endolysin, which is composed of PlyCA and PlyCB subunits6
. When purified and added extrinsically, the PlyC holoenzyme lyses group A streptococci (GAS) as well as other streptococcal groups in a matter of seconds and furthermore has been validated in vivo
. Significantly, monitoring residual enzyme kinetics after elevated temperature incubation provides distinct evidence that PlyC loses lytic activity abruptly at 45 °C, suggesting a short therapeutic shelf life, which may limit additional development of this enzyme. Further studies reveal the lack of thermal stability is only observed for the PlyCA subunit, whereas the PlyCB subunit is stable up to ~90 °C (unpublished observation). In addition to PlyC, there are several examples in literature that describe the thermolabile nature of endolysins. For example, the Staphylococcus aureus
endolysin LysK and Streptococcus pneumoniae
endolysins Cpl-1 and Pal lose activity spontaneously at 42 °C, 43.5 °C and 50.2 °C, respectively8-10
. According to the Arrhenius equation, which relates the rate of a chemical reaction to the temperature present in the particular system, an increase in thermostability will correlate with an increase in shelf life expectancy11
. Toward this end, directed evolution has been shown to be a useful tool for altering the thermal activity of various molecules in nature, but never has this particular technology been exploited successfully for the study of bacteriolytic enzymes. Likewise, successful accounts of progressing the structural stability of this particular class of antimicrobials altogether are nonexistent. In this video, we employ a novel methodology that uses an error-prone DNA polymerase followed by an optimized screening process using a 96 well microtiter plate format to identify mutations to the PlyCA subunit of the PlyC streptococcal endolysin that correlate to an increase in enzyme kinetic stability (Figure 1
). Results after just one round of random mutagenesis suggest the methodology is generating PlyC variants that retain more than twice the residual activity when compared to wild-type (WT) PlyC after elevated temperature treatment.
Immunology, Issue 69, Molecular Biology, Genetics, Microbiology, directed evolution, thermal behavior, thermostability, endolysin, enzybiotic, bacteriolytic, antimicrobial, therapeutic, PlyC
Following in Real Time the Impact of Pneumococcal Virulence Factors in an Acute Mouse Pneumonia Model Using Bioluminescent Bacteria
Institutions: University of Greifswald.
Pneumonia is one of the major health care problems in developing and industrialized countries and is associated with considerable morbidity and mortality. Despite advances in knowledge of this illness, the availability of intensive care units (ICU), and the use of potent antimicrobial agents and effective vaccines, the mortality rates remain high1
. Streptococcus pneumoniae
is the leading pathogen of community-acquired pneumonia (CAP) and one of the most common causes of bacteremia in humans. This pathogen is equipped with an armamentarium of surface-exposed adhesins and virulence factors contributing to pneumonia and invasive pneumococcal disease (IPD). The assessment of the in vivo
role of bacterial fitness or virulence factors is of utmost importance to unravel S. pneumoniae
pathogenicity mechanisms. Murine models of pneumonia, bacteremia, and meningitis are being used to determine the impact of pneumococcal factors at different stages of the infection. Here we describe a protocol to monitor in real-time pneumococcal dissemination in mice after intranasal or intraperitoneal infections with bioluminescent bacteria. The results show the multiplication and dissemination of pneumococci in the lower respiratory tract and blood, which can be visualized and evaluated using an imaging system and the accompanying analysis software.
Infection, Issue 84, Gram-Positive Bacteria, Streptococcus pneumoniae, Pneumonia, Bacterial, Respiratory Tract Infections, animal models, community-acquired pneumonia, invasive pneumococcal diseases, Pneumococci, bioimaging, virulence factor, dissemination, bioluminescence, IVIS Spectrum
A Strategy to Identify de Novo Mutations in Common Disorders such as Autism and Schizophrenia
Institutions: Universite de Montreal, Universite de Montreal, Universite de Montreal.
There are several lines of evidence supporting the role of de novo
mutations as a mechanism for common disorders, such as autism and schizophrenia. First, the de novo
mutation rate in humans is relatively high, so new mutations are generated at a high frequency in the population. However, de novo
mutations have not been reported in most common diseases. Mutations in genes leading to severe diseases where there is a strong negative selection against the phenotype, such as lethality in embryonic stages or reduced reproductive fitness, will not be transmitted to multiple family members, and therefore will not be detected by linkage gene mapping or association studies. The observation of very high concordance in monozygotic twins and very low concordance in dizygotic twins also strongly supports the hypothesis that a significant fraction of cases may result from new mutations. Such is the case for diseases such as autism and schizophrenia. Second, despite reduced reproductive fitness1
and extremely variable environmental factors, the incidence of some diseases is maintained worldwide at a relatively high and constant rate. This is the case for autism and schizophrenia, with an incidence of approximately 1% worldwide. Mutational load can be thought of as a balance between selection for or against a deleterious mutation and its production by de novo
mutation. Lower rates of reproduction constitute a negative selection factor that should reduce the number of mutant alleles in the population, ultimately leading to decreased disease prevalence. These selective pressures tend to be of different intensity in different environments. Nonetheless, these severe mental disorders have been maintained at a constant relatively high prevalence in the worldwide population across a wide range of cultures and countries despite a strong negative selection against them2
. This is not what one would predict in diseases with reduced reproductive fitness, unless there was a high new mutation rate. Finally, the effects of paternal age: there is a significantly increased risk of the disease with increasing paternal age, which could result from the age related increase in paternal de novo
mutations. This is the case for autism and schizophrenia3
. The male-to-female ratio of mutation rate is estimated at about 4–6:1, presumably due to a higher number of germ-cell divisions with age in males. Therefore, one would predict that de novo
mutations would more frequently come from males, particularly older males4
. A high rate of new mutations may in part explain why genetic studies have so far failed to identify many genes predisposing to complexes diseases genes, such as autism and schizophrenia, and why diseases have been identified for a mere 3% of genes in the human genome. Identification for de novo
mutations as a cause of a disease requires a targeted molecular approach, which includes studying parents and affected subjects. The process for determining if the genetic basis of a disease may result in part from de novo
mutations and the molecular approach to establish this link will be illustrated, using autism and schizophrenia as examples.
Medicine, Issue 52, de novo mutation, complex diseases, schizophrenia, autism, rare variations, DNA sequencing
Population Replacement Strategies for Controlling Vector Populations and the Use of Wolbachia pipientis for Genetic Drive
Institutions: Johns Hopkins University.
In this video, Jason Rasgon discusses population replacement strategies to control vector-borne diseases such as malaria and dengue. "Population replacement" is the replacement of wild vector populations (that are competent to transmit pathogens) with those that are not competent to transmit pathogens. There are several theoretical strategies to accomplish this. One is to exploit the maternally-inherited symbiotic bacteria Wolbachia pipientis. Wolbachia is a widespread reproductive parasite that spreads in a selfish manner at the extent of its host's fitness. Jason Rasgon discusses, in detail, the basic biology of this bacterial symbiont and various ways to use it for control of vector-borne diseases.
Cellular Biology, Issue 5, mosquito, malaria, genetics, infectious disease, Wolbachia