Lysine methylation is an emerging post-translation modification and it has been identified on several histone and non-histone proteins, where it plays crucial roles in cell development and many diseases. Approximately 5,000 lysine methylation sites were identified on different proteins, which are set by few dozens of protein lysine methyltransferases. This suggests that each PKMT methylates multiple proteins, however till now only one or two substrates have been identified for several of these enzymes. To approach this problem, we have introduced peptide array based substrate specificity analyses of PKMTs. Peptide arrays are powerful tools to characterize the specificity of PKMTs because methylation of several substrates with different sequences can be tested on one array. We synthesized peptide arrays on cellulose membrane using an Intavis SPOT synthesizer and analyzed the specificity of various PKMTs. Based on the results, for several of these enzymes, novel substrates could be identified. For example, for NSD1 by employing peptide arrays, we showed that it methylates K44 of H4 instead of the reported H4K20 and in addition H1.5K168 is the highly preferred substrate over the previously known H3K36. Hence, peptide arrays are powerful tools to biochemically characterize the PKMTs.
23 Related JoVE Articles!
Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules
Institutions: Princeton University.
The aim of de novo
protein design is to find the amino acid sequences that will fold into a desired 3-dimensional structure with improvements in specific properties, such as binding affinity, agonist or antagonist behavior, or stability, relative to the native sequence. Protein design lies at the center of current advances drug design and discovery. Not only does protein design provide predictions for potentially useful drug targets, but it also enhances our understanding of the protein folding process and protein-protein interactions. Experimental methods such as directed evolution have shown success in protein design. However, such methods are restricted by the limited sequence space that can be searched tractably. In contrast, computational design strategies allow for the screening of a much larger set of sequences covering a wide variety of properties and functionality. We have developed a range of computational de novo
protein design methods capable of tackling several important areas of protein design. These include the design of monomeric proteins for increased stability and complexes for increased binding affinity.
To disseminate these methods for broader use we present Protein WISDOM (https://www.proteinwisdom.org), a tool that provides automated methods for a variety of protein design problems. Structural templates are submitted to initialize the design process. The first stage of design is an optimization sequence selection stage that aims at improving stability through minimization of potential energy in the sequence space. Selected sequences are then run through a fold specificity stage and a binding affinity stage. A rank-ordered list of the sequences for each step of the process, along with relevant designed structures, provides the user with a comprehensive quantitative assessment of the design. Here we provide the details of each design method, as well as several notable experimental successes attained through the use of the methods.
Genetics, Issue 77, Molecular Biology, Bioengineering, Biochemistry, Biomedical Engineering, Chemical Engineering, Computational Biology, Genomics, Proteomics, Protein, Protein Binding, Computational Biology, Drug Design, optimization (mathematics), Amino Acids, Peptides, and Proteins, De novo protein and peptide design, Drug design, In silico sequence selection, Optimization, Fold specificity, Binding affinity, sequencing
High-throughput Purification of Affinity-tagged Recombinant Proteins
Institutions: Imperial College London .
X-ray crystallography is the method of choice for obtaining a detailed view of the structure of proteins. Such studies need to be complemented by further biochemical analyses to obtain detailed insights into structure/function relationships. Advances in oligonucleotide- and gene synthesis technology make large-scale mutagenesis strategies increasingly feasible, including the substitution of target residues by all 19 other amino acids. Gain- or loss-of-function phenotypes then allow systematic conclusions to be drawn, such as the contribution of particular residues to catalytic activity, protein stability and/or protein-protein interaction specificity.
In order to attribute the different phenotypes to the nature of the mutation - rather than to fluctuating experimental conditions - it is vital to purify and analyse the proteins in a controlled and reproducible manner. High-throughput strategies and the automation of manual protocols on robotic liquid-handling platforms have created opportunities to perform such complex molecular biological procedures with little human intervention and minimal error rates1-5
Here, we present a general method for the purification of His-tagged recombinant proteins in a high-throughput manner. In a recent study, we applied this method to a detailed structure-function investigation of TFIIB, a component of the basal transcription machinery. TFIIB is indispensable for promoter-directed transcription in vitro
and is essential for the recruitment of RNA polymerase into a preinitiation complex6-8
. TFIIB contains a flexible linker domain that penetrates the active site cleft of RNA polymerase9-11
. This linker domain confers two biochemically quantifiable activities on TFIIB, namely (i) the stimulation of the catalytic activity during the 'abortive' stage of transcript initiation, and (ii) an additional contribution to the specific recruitment of RNA polymerase into the preinitiation complex4,5,12
. We exploited the high-throughput purification method to generate single, double and triple substitution and deletions mutations within the TFIIB linker and to subsequently analyse them in functional assays for their stimulation effect on the catalytic activity of RNA polymerase4
. Altogether, we generated, purified and analysed 381 mutants - a task which would have been time-consuming and laborious to perform manually. We produced and assayed the proteins in multiplicates which allowed us to appreciate any experimental variations and gave us a clear idea of the reproducibility of our results.
This method serves as a generic protocol for the purification of His-tagged proteins and has been successfully used to purify other recombinant proteins. It is currently optimised for the purification of 24 proteins but can be adapted to purify up to 96 proteins.
Biochemistry, Issue 66, Genetics, Molecular Biology, Bioinformatics, Recombinant proteins, histidine tag, affinity purification, high-throughput, automation
Identifying Protein-protein Interaction Sites Using Peptide Arrays
Institutions: The Hebrew University of Jerusalem.
Protein-protein interactions mediate most of the processes in the living cell and control homeostasis of the organism. Impaired protein interactions may result in disease, making protein interactions important drug targets. It is thus highly important to understand these interactions at the molecular level. Protein interactions are studied using a variety of techniques ranging from cellular and biochemical assays to quantitative biophysical assays, and these may be performed either with full-length proteins, with protein domains or with peptides. Peptides serve as excellent tools to study protein interactions since peptides can be easily synthesized and allow the focusing on specific interaction sites. Peptide arrays enable the identification of the interaction sites between two proteins as well as screening for peptides that bind the target protein for therapeutic purposes. They also allow high throughput SAR studies. For identification of binding sites, a typical peptide array usually contains partly overlapping 10-20 residues peptides derived from the full sequences of one or more partner proteins of the desired target protein. Screening the array for binding the target protein reveals the binding peptides, corresponding to the binding sites in the partner proteins, in an easy and fast method using only small amount of protein.
In this article we describe a protocol for screening peptide arrays for mapping the interaction sites between a target protein and its partners. The peptide array is designed based on the sequences of the partner proteins taking into account their secondary structures. The arrays used in this protocol were Celluspots arrays prepared by INTAVIS Bioanalytical Instruments. The array is blocked to prevent unspecific binding and then incubated with the studied protein. Detection using an antibody reveals the binding peptides corresponding to the specific interaction sites between the proteins.
Molecular Biology, Issue 93, peptides, peptide arrays, protein-protein interactions, binding sites, peptide synthesis, micro-arrays
Forward Genetic Approaches in Chlamydia trachomatis
Institutions: Duke University Medical Center.
, the etiological agent of sexually transmitted diseases and ocular infections, remains poorly characterized due to its intractability to experimental transformation with recombinant DNA. We developed an approach to perform genetic analysis in C. trachomatis
despite the lack of molecular genetic tools. Our method involves: i.) chemical mutagenesis to rapidly generate comprehensive libraries of genetically-defined mutants with distinct phenotypes; ii.) whole-genome sequencing (WGS) to map the underlying genetic lesions and to find associations between mutated gene(s) and a common phenotype; iii.) generation of recombinant strains through co-infection of mammalian cells with mutant and wild type bacteria. Accordingly, we were able to establish causal relationships between genotypes and phenotypes. The coupling of chemically-induced gene variation and WGS to establish correlative genotype–phenotype associations should be broadly applicable to the large list of medically and environmentally important microorganisms currently intractable to genetic analysis.
Immunology, Issue 80, genetics, chemical mutagenesis, whole genome sequencing
Rapid Synthesis and Screening of Chemically Activated Transcription Factors with GFP-based Reporters
Institutions: Princeton University, Princeton University, California Institute of Technology.
Synthetic biology aims to rationally design and build synthetic circuits with desired quantitative properties, as well as provide tools to interrogate the structure of native control circuits. In both cases, the ability to program gene expression in a rapid and tunable fashion, with no off-target effects, can be useful. We have constructed yeast strains containing the ACT1
promoter upstream of a URA3
cassette followed by the ligand-binding domain of the human estrogen receptor and VP16. By transforming this strain with a linear PCR product containing a DNA binding domain and selecting against the presence of URA3
, a constitutively expressed artificial transcription factor (ATF) can be generated by homologous recombination. ATFs engineered in this fashion can activate a unique target gene in the presence of inducer, thereby eliminating both the off-target activation and nonphysiological growth conditions found with commonly used conditional gene expression systems. A simple method for the rapid construction of GFP reporter plasmids that respond specifically to a native or artificial transcription factor of interest is also provided.
Genetics, Issue 81, transcription, transcription factors, artificial transcription factors, zinc fingers, Zif268, synthetic biology
Combinatorial Synthesis of and High-throughput Protein Release from Polymer Film and Nanoparticle Libraries
Institutions: Iowa State University.
Polyanhydrides are a class of biomaterials with excellent biocompatibility and drug delivery capabilities. While they have been studied extensively with conventional one-sample-at-a-time synthesis techniques, a more recent high-throughput approach has been developed enabling the synthesis and testing of large libraries of polyanhydrides1
. This will facilitate more efficient optimization and design process of these biomaterials for drug and vaccine delivery applications. The method in this work describes the combinatorial synthesis of biodegradable polyanhydride film and nanoparticle libraries and the high-throughput detection of protein release from these libraries. In this robotically operated method (Figure 1
), linear actuators and syringe pumps are controlled by LabVIEW, which enables a hands-free automated protocol, eliminating user error. Furthermore, this method enables the rapid fabrication of micro-scale polymer libraries, reducing the batch size while resulting in the creation of multivariant polymer systems. This combinatorial approach to polymer synthesis facilitates the synthesis of up to 15 different polymers in an equivalent amount of time it would take to synthesize one polymer conventionally. In addition, the combinatorial polymer library can be fabricated into blank or protein-loaded geometries including films or nanoparticles upon dissolution of the polymer library in a solvent and precipitation into a non-solvent (for nanoparticles) or by vacuum drying (for films). Upon loading a fluorochrome-conjugated protein into the polymer libraries, protein release kinetics can be assessed at high-throughput using a fluorescence-based detection method (Figures 2
) as described previously1
. This combinatorial platform has been validated with conventional methods2
and the polyanhydride film and nanoparticle libraries have been characterized with 1
H NMR and FTIR. The libraries have been screened for protein release kinetics, stability and antigenicity; in vitro
cellular toxicity, cytokine production, surface marker expression, adhesion, proliferation and differentiation; and in vivo
biodistribution and mucoadhesion1-11
. The combinatorial method developed herein enables high-throughput polymer synthesis and fabrication of protein-loaded nanoparticle and film libraries, which can, in turn, be screened in vitro
and in vivo
for optimization of biomaterial performance.
Bioengineering, Issue 67, combinatorial, high-throughput, polymer synthesis, polyanhydrides, nanoparticle fabrication, release kinetics, protein delivery
Orthogonal Protein Purification Facilitated by a Small Bispecific Affinity Tag
Institutions: Royal Institute of Technology.
Due to the high costs associated with purification of recombinant proteins the protocols need to be rationalized. For high-throughput efforts there is a demand for general methods that do not require target protein specific optimization1
. To achieve this, purification tags that genetically can be fused to the gene of interest are commonly used2
. The most widely used affinity handle is the hexa-histidine tag, which is suitable for purification under both native and denaturing conditions3
. The metabolic burden for producing the tag is low, but it does not provide as high specificity as competing affinity chromatography based strategies1,2
Here, a bispecific purification tag with two different binding sites on a 46 amino acid, small protein domain has been developed. The albumin-binding domain is derived from Streptococcal protein G and has a strong inherent affinity to human serum albumin (HSA). Eleven surface-exposed amino acids, not involved in albumin-binding4
, were genetically randomized to produce a combinatorial library. The protein library with the novel randomly arranged binding surface (Figure 1) was expressed on phage particles to facilitate selection of binders by phage display technology. Through several rounds of biopanning against a dimeric Z-domain derived from Staphylococcal protein A5
, a small, bispecific molecule with affinity for both HSA and the novel target was identified6
The novel protein domain, referred to as ABDz1, was evaluated as a purification tag for a selection of target proteins with different molecular weight, solubility and isoelectric point. Three target proteins were expressed in Escherishia coli
with the novel tag fused to their N-termini and thereafter affinity purified. Initial purification on either a column with immobilized HSA or Z-domain resulted in relatively pure products. Two-step affinity purification with the bispecific tag resulted in substantial improvement of protein purity. Chromatographic media with the Z-domain immobilized, for example MabSelect SuRe, are readily available for purification of antibodies and HSA can easily be chemically coupled to media to provide the second matrix.
This method is especially advantageous when there is a high demand on purity of the recovered target protein. The bifunctionality of the tag allows two different chromatographic steps to be used while the metabolic burden on the expression host is limited due to the small size of the tag. It provides a competitive alternative to so called combinatorial tagging where multiple tags are used in combination1,7
Molecular Biology, Issue 59, Affinity chromatography, albumin-binding domain, human serum albumin, Z-domain
Improving the Success Rate of Protein Crystallization by Random Microseed Matrix Screening
Institutions: University of Bristol, Douglas Instruments.
Random microseed matrix screening (rMMS) is a protein crystallization technique in which seed crystals are added to random screens. By increasing the likelihood that crystals will grow in the metastable zone of a protein's phase diagram, extra crystallization leads are often obtained, the quality of crystals produced may be increased, and a good supply of crystals for data collection and soaking experiments is provided. Here we describe a general method for rMMS that may be applied to either sitting drop or hanging drop vapor diffusion experiments, established either by hand or using liquid handling robotics, in 96-well or 24-well tray format.
Structural Biology, Issue 78, Crystallography, X-Ray, Biochemical Phenomena, Molecular Structure, Molecular Conformation, protein crystallization, seeding, protein structure
Electronic Tongue Generating Continuous Recognition Patterns for Protein Analysis
Institutions: Institut Nanosciences et Cryogénie, CEA-Grenoble, Université Paris-Sud, Institut de Biologie Structurale.
In current protocol, a combinatorial approach has been developed to simplify the design and production of sensing materials for the construction of electronic tongues (eT) for protein analysis. By mixing a small number of simple and easily accessible molecules with different physicochemical properties, used as building blocks (BBs), in varying and controlled proportions and allowing the mixtures to self-assemble on the gold surface of a prism, an array of combinatorial surfaces featuring appropriate properties for protein sensing was created. In this way, a great number of cross-reactive receptors can be rapidly and efficiently obtained. By combining such an array of combinatorial cross-reactive receptors (CoCRRs) with an optical detection system such as surface plasmon resonance imaging (SPRi), the obtained eT can monitor the binding events in real-time and generate continuous recognition patterns including 2D continuous evolution profile (CEP) and 3D continuous evolution landscape (CEL) for samples in liquid. Such an eT system is efficient for discrimination of common purified proteins.
Bioengineering, Issue 91, electronic tongue, combinatorial cross-reactive receptor, surface plasmon resonance imaging, pattern recognition, continuous evolution profiles, continuous evolution landscapes, protein analysis
Assessment of Immunologically Relevant Dynamic Tertiary Structural Features of the HIV-1 V3 Loop Crown R2 Sequence by ab initio Folding
Institutions: School of Medicine, New York University.
The antigenic diversity of HIV-1 has long been an obstacle to vaccine design, and this variability is especially pronounced in the V3 loop of the virus' surface envelope glycoprotein. We previously proposed that the crown of the V3 loop, although dynamic and sequence variable, is constrained throughout the population of HIV-1 viruses to an immunologically relevant β-hairpin tertiary structure. Importantly, there are thousands of different V3 loop crown sequences in circulating HIV-1 viruses, making 3D structural characterization of trends across the diversity of viruses difficult or impossible by crystallography or NMR. Our previous successful studies with folding of the V3 crown1, 2
used the ab initio
accessible in the ICM-Pro molecular modeling software package (Molsoft LLC, La Jolla, CA) and suggested that the crown of the V3 loop, specifically from positions 10 to 22, benefits sufficiently from the flexibility and length of its flanking stems to behave to a large degree as if it were an unconstrained peptide freely folding in solution. As such, rapid ab initio
folding of just this portion of the V3 loop of any individual strain of the 60,000+ circulating HIV-1 strains can be informative. Here, we folded the V3 loop of the R2 strain to gain insight into the structural basis of its unique properties. R2 bears a rare V3 loop sequence thought to be responsible for the exquisite sensitivity of this strain to neutralization by patient sera and monoclonal antibodies4, 5
. The strain mediates CD4-independent infection and appears to elicit broadly neutralizing antibodies. We demonstrate how evaluation of the results of the folding can be informative for associating observed structures in the folding with the immunological activities observed for R2.
Infection, Issue 43, HIV-1, structure-activity relationships, ab initio simulations, antibody-mediated neutralization, vaccine design
High Throughput Screening of Fungal Endoglucanase Activity in Escherichia coli
Institutions: California Institute of Technology, California Institute of Technology.
Cellulase enzymes (endoglucanases, cellobiohydrolases, and β-glucosidases) hydrolyze cellulose into component sugars, which in turn can be converted into fuel alcohols1
. The potential for enzymatic hydrolysis of cellulosic biomass to provide renewable energy has intensified efforts to engineer cellulases for economical fuel production2
. Of particular interest are fungal cellulases3-8
, which are already being used industrially for foods and textiles processing.
Identifying active variants among a library of mutant cellulases is critical to the engineering process; active mutants can be further tested for improved properties and/or subjected to additional mutagenesis. Efficient engineering of fungal cellulases has been hampered by a lack of genetic tools for native organisms and by difficulties in expressing the enzymes in heterologous hosts. Recently, Morikawa and coworkers developed a method for expressing in E. coli
the catalytic domains of endoglucanases from H. jecorina3,9
, an important industrial fungus with the capacity to secrete cellulases in large quantities. Functional E. coli
expression has also been reported for cellulases from other fungi, including Macrophomina phaseolina10
and Phanerochaete chrysosporium11-12
We present a method for high throughput screening of fungal endoglucanase activity in E. coli
. (Fig 1
) This method uses the common microbial dye Congo Red (CR) to visualize enzymatic degradation of carboxymethyl cellulose (CMC) by cells growing on solid medium. The activity assay requires inexpensive reagents, minimal manipulation, and gives unambiguous results as zones of degradation (“halos”) at the colony site. Although a quantitative measure of enzymatic activity cannot be determined by this method, we have found that halo size correlates with total enzymatic activity in the cell. Further characterization of individual positive clones will determine , relative protein fitness.
Traditional bacterial whole cell CMC/CR activity assays13
involve pouring agar containing CMC onto colonies, which is subject to cross-contamination, or incubating cultures in CMC agar wells, which is less amenable to large-scale experimentation. Here we report an improved protocol that modifies existing wash methods14
for cellulase activity: cells grown on CMC agar plates are removed prior to CR staining. Our protocol significantly reduces cross-contamination and is highly scalable, allowing the rapid screening of thousands of clones. In addition to H. jecorina enzymes
, we have expressed and screened endoglucanase variants from the Thermoascus aurantiacus
and Penicillium decumbens
(shown in Figure 2
), suggesting that this protocol is applicable to enzymes from a range of organisms.
Molecular Biology, Issue 54, cellulase, endoglucanase, CMC, Congo Red
Designing a Bio-responsive Robot from DNA Origami
Institutions: Bar-Ilan University.
Nucleic acids are astonishingly versatile. In addition to their natural role as storage medium for biological information1
, they can be utilized in parallel computing2,3
, recognize and bind molecular or cellular targets4,5
, catalyze chemical reactions6,7
, and generate calculated responses in a biological system8,9
. Importantly, nucleic acids can be programmed to self-assemble into 2D and 3D structures10-12
, enabling the integration of all these remarkable features in a single robot linking the sensing of biological cues to a preset response in order to exert a desired effect.
Creating shapes from nucleic acids was first proposed by Seeman13
, and several variations on this theme have since been realized using various techniques11,12,14,15
. However, the most significant is perhaps the one proposed by Rothemund, termed scaffolded DNA origami16
. In this technique, the folding of a long (>7,000 bases) single-stranded DNA 'scaffold'
is directed to a desired shape by hundreds of short complementary strands termed 'staples'
. Folding is carried out by temperature annealing ramp. This technique was successfully demonstrated in the creation of a diverse array of 2D shapes with remarkable precision and robustness. DNA origami was later extended to 3D as well17,18
The current paper will focus on the caDNAno 2.0 software19
developed by Douglas and colleagues. caDNAno is a robust, user-friendly CAD tool enabling the design of 2D and 3D DNA origami shapes with versatile features. The design process relies on a systematic and accurate abstraction scheme for DNA structures, making it relatively straightforward and efficient.
In this paper we demonstrate the design of a DNA origami nanorobot that has been recently described20
. This robot is 'robotic' in the sense that it links sensing to actuation, in order to perform a task. We explain how various sensing schemes can be integrated into the structure, and how this can be relayed to a desired effect. Finally we use Cando21
to simulate the mechanical properties of the designed shape. The concept we discuss can be adapted to multiple tasks and settings.
Bioengineering, Issue 77, Genetics, Biomedical Engineering, Molecular Biology, Medicine, Genomics, Nanotechnology, Nanomedicine, DNA origami, nanorobot, caDNAno, DNA, DNA Origami, nucleic acids, DNA structures, CAD, sequencing
The ChroP Approach Combines ChIP and Mass Spectrometry to Dissect Locus-specific Proteomic Landscapes of Chromatin
Institutions: European Institute of Oncology.
Chromatin is a highly dynamic nucleoprotein complex made of DNA and proteins that controls various DNA-dependent processes. Chromatin structure and function at specific regions is regulated by the local enrichment of histone post-translational modifications (hPTMs) and variants, chromatin-binding proteins, including transcription factors, and DNA methylation. The proteomic characterization of chromatin composition at distinct functional regions has been so far hampered by the lack of efficient protocols to enrich such domains at the appropriate purity and amount for the subsequent in-depth analysis by Mass Spectrometry (MS). We describe here a newly designed chromatin proteomics strategy, named ChroP (Chromatin Proteomics
), whereby a preparative chromatin immunoprecipitation is used to isolate distinct chromatin regions whose features, in terms of hPTMs, variants and co-associated non-histonic proteins, are analyzed by MS. We illustrate here the setting up of ChroP for the enrichment and analysis of transcriptionally silent heterochromatic regions, marked by the presence of tri-methylation of lysine 9 on histone H3. The results achieved demonstrate the potential of ChroP
in thoroughly characterizing the heterochromatin proteome and prove it as a powerful analytical strategy for understanding how the distinct protein determinants of chromatin interact and synergize to establish locus-specific structural and functional configurations.
Biochemistry, Issue 86, chromatin, histone post-translational modifications (hPTMs), epigenetics, mass spectrometry, proteomics, SILAC, chromatin immunoprecipitation , histone variants, chromatome, hPTMs cross-talks
Split-and-pool Synthesis and Characterization of Peptide Tertiary Amide Library
Institutions: The Scripps Research Institute.
Peptidomimetics are great sources of protein ligands. The oligomeric nature of these compounds enables us to access large synthetic libraries on solid phase by using combinatorial chemistry. One of the most well studied classes of peptidomimetics is peptoids. Peptoids are easy to synthesize and have been shown to be proteolysis-resistant and cell-permeable. Over the past decade, many useful protein ligands have been identified through screening of peptoid libraries. However, most of the ligands identified from peptoid libraries do not display high affinity, with rare exceptions. This may be due, in part, to the lack of chiral centers and conformational constraints in peptoid molecules. Recently, we described a new synthetic route to access peptide tertiary amides (PTAs). PTAs are a superfamily of peptidomimetics that include but are not limited to peptides, peptoids and N-methylated peptides. With side chains on both α-carbon and main chain nitrogen atoms, the conformation of these molecules are greatly constrained by sterical hindrance and allylic 1,3 strain. (Figure 1
) Our study suggests that these PTA molecules are highly structured in solution and can be used to identify protein ligands. We believe that these molecules can be a future source of high-affinity protein ligands. Here we describe the synthetic method combining the power of both split-and-pool and sub-monomer strategies to synthesize a sample one-bead one-compound (OBOC) library of PTAs.
Chemistry, Issue 88, Split-and-pool synthesis, peptide tertiary amide, PTA, peptoid, high-throughput screening, combinatorial library, solid phase, triphosgene (BTC), one-bead one-compound, OBOC
Isolation of Fidelity Variants of RNA Viruses and Characterization of Virus Mutation Frequency
Institutions: Institut Pasteur .
RNA viruses use RNA dependent RNA polymerases to replicate their genomes. The intrinsically high error rate of these enzymes is a large contributor to the generation of extreme population diversity that facilitates virus adaptation and evolution. Increasing evidence shows that the intrinsic error rates, and the resulting mutation frequencies, of RNA viruses can be modulated by subtle amino acid changes to the viral polymerase. Although biochemical assays exist for some viral RNA polymerases that permit quantitative measure of incorporation fidelity, here we describe a simple method of measuring mutation frequencies of RNA viruses that has proven to be as accurate as biochemical approaches in identifying fidelity altering mutations. The approach uses conventional virological and sequencing techniques that can be performed in most biology laboratories. Based on our experience with a number of different viruses, we have identified the key steps that must be optimized to increase the likelihood of isolating fidelity variants and generating data of statistical significance. The isolation and characterization of fidelity altering mutations can provide new insights into polymerase structure and function1-3
. Furthermore, these fidelity variants can be useful tools in characterizing mechanisms of virus adaptation and evolution4-7
Immunology, Issue 52, Polymerase fidelity, RNA virus, mutation frequency, mutagen, RNA polymerase, viral evolution
Investigating Protein-protein Interactions in Live Cells Using Bioluminescence Resonance Energy Transfer
Institutions: Max Planck Institute for Psycholinguistics, Donders Institute for Brain, Cognition and Behaviour.
Assays based on Bioluminescence Resonance Energy Transfer (BRET) provide a sensitive and reliable means to monitor protein-protein interactions in live cells. BRET is the non-radiative transfer of energy from a 'donor' luciferase enzyme to an 'acceptor' fluorescent protein. In the most common configuration of this assay, the donor is Renilla reniformis
luciferase and the acceptor is Yellow Fluorescent Protein (YFP). Because the efficiency of energy transfer is strongly distance-dependent, observation of the BRET phenomenon requires that the donor and acceptor be in close proximity. To test for an interaction between two proteins of interest in cultured mammalian cells, one protein is expressed as a fusion with luciferase and the second as a fusion with YFP. An interaction between the two proteins of interest may bring the donor and acceptor sufficiently close for energy transfer to occur. Compared to other techniques for investigating protein-protein interactions, the BRET assay is sensitive, requires little hands-on time and few reagents, and is able to detect interactions which are weak, transient, or dependent on the biochemical environment found within a live cell. It is therefore an ideal approach for confirming putative interactions suggested by yeast two-hybrid or mass spectrometry proteomics studies, and in addition it is well-suited for mapping interacting regions, assessing the effect of post-translational modifications on protein-protein interactions, and evaluating the impact of mutations identified in patient DNA.
Cellular Biology, Issue 87, Protein-protein interactions, Bioluminescence Resonance Energy Transfer, Live cell, Transfection, Luciferase, Yellow Fluorescent Protein, Mutations
Genetically-encoded Molecular Probes to Study G Protein-coupled Receptors
Institutions: The Rockefeller University.
To facilitate structural and dynamic studies of G protein-coupled receptor (GPCR) signaling complexes, new approaches are required to introduce informative probes or labels into expressed receptors that do not perturb receptor function. We used amber codon suppression technology to genetically-encode the unnatural amino acid, p
-azido-L-phenylalanine (azF) at various targeted positions in GPCRs heterologously expressed in mammalian cells. The versatility of the azido group is illustrated here in different applications to study GPCRs in their native cellular environment or under detergent solubilized conditions. First, we demonstrate a cell-based targeted photocrosslinking technology to identify the residues in the ligand-binding pocket of GPCR where a tritium-labeled small-molecule ligand is crosslinked to a genetically-encoded azido amino acid. We then demonstrate site-specific modification of GPCRs by the bioorthogonal Staudinger-Bertozzi ligation reaction that targets the azido group using phosphine derivatives. We discuss a general strategy for targeted peptide-epitope tagging of expressed membrane proteins in-culture and its detection using a whole-cell-based ELISA approach. Finally, we show that azF-GPCRs can be selectively tagged with fluorescent probes. The methodologies discussed are general, in that they can in principle be applied to any amino acid position in any expressed GPCR to interrogate active signaling complexes.
Genetics, Issue 79, Receptors, G-Protein-Coupled, Protein Engineering, Signal Transduction, Biochemistry, Unnatural amino acid, site-directed mutagenesis, G protein-coupled receptor, targeted photocrosslinking, bioorthogonal labeling, targeted epitope tagging
A Restriction Enzyme Based Cloning Method to Assess the In vitro Replication Capacity of HIV-1 Subtype C Gag-MJ4 Chimeric Viruses
Institutions: Emory University, Emory University.
The protective effect of many HLA class I alleles on HIV-1 pathogenesis and disease progression is, in part, attributed to their ability to target conserved portions of the HIV-1 genome that escape with difficulty. Sequence changes attributed to cellular immune pressure arise across the genome during infection, and if found within conserved regions of the genome such as Gag, can affect the ability of the virus to replicate in vitro
. Transmission of HLA-linked polymorphisms in Gag to HLA-mismatched recipients has been associated with reduced set point viral loads. We hypothesized this may be due to a reduced replication capacity of the virus. Here we present a novel method for assessing the in vitro
replication of HIV-1 as influenced by the gag
gene isolated from acute time points from subtype C infected Zambians. This method uses restriction enzyme based cloning to insert the gag
gene into a common subtype C HIV-1 proviral backbone, MJ4. This makes it more appropriate to the study of subtype C sequences than previous recombination based methods that have assessed the in vitro
replication of chronically derived gag-pro
sequences. Nevertheless, the protocol could be readily modified for studies of viruses from other subtypes. Moreover, this protocol details a robust and reproducible method for assessing the replication capacity of the Gag-MJ4 chimeric viruses on a CEM-based T cell line. This method was utilized for the study of Gag-MJ4 chimeric viruses derived from 149 subtype C acutely infected Zambians, and has allowed for the identification of residues in Gag that affect replication. More importantly, the implementation of this technique has facilitated a deeper understanding of how viral replication defines parameters of early HIV-1 pathogenesis such as set point viral load and longitudinal CD4+ T cell decline.
Infectious Diseases, Issue 90, HIV-1, Gag, viral replication, replication capacity, viral fitness, MJ4, CEM, GXR25
In Vivo Modeling of the Morbid Human Genome using Danio rerio
Institutions: Duke University Medical Center, Duke University, Duke University Medical Center.
Here, we present methods for the development of assays to query potentially clinically significant nonsynonymous changes using in vivo
complementation in zebrafish. Zebrafish (Danio rerio
) are a useful animal system due to their experimental tractability; embryos are transparent to enable facile viewing, undergo rapid development ex vivo,
and can be genetically manipulated.1
These aspects have allowed for significant advances in the analysis of embryogenesis, molecular processes, and morphogenetic signaling. Taken together, the advantages of this vertebrate model make zebrafish highly amenable to modeling the developmental defects in pediatric disease, and in some cases, adult-onset disorders. Because the zebrafish genome is highly conserved with that of humans (~70% orthologous), it is possible to recapitulate human disease states in zebrafish. This is accomplished either through the injection of mutant human mRNA to induce dominant negative or gain of function alleles, or utilization of morpholino (MO) antisense oligonucleotides to suppress genes to mimic loss of function variants. Through complementation of MO-induced phenotypes with capped human mRNA, our approach enables the interpretation of the deleterious effect of mutations on human protein sequence based on the ability of mutant mRNA to rescue a measurable, physiologically relevant phenotype. Modeling of the human disease alleles occurs through microinjection of zebrafish embryos with MO and/or human mRNA at the 1-4 cell stage, and phenotyping up to seven days post fertilization (dpf). This general strategy can be extended to a wide range of disease phenotypes, as demonstrated in the following protocol. We present our established models for morphogenetic signaling, craniofacial, cardiac, vascular integrity, renal function, and skeletal muscle disorder phenotypes, as well as others.
Molecular Biology, Issue 78, Genetics, Biomedical Engineering, Medicine, Developmental Biology, Biochemistry, Anatomy, Physiology, Bioengineering, Genomics, Medical, zebrafish, in vivo, morpholino, human disease modeling, transcription, PCR, mRNA, DNA, Danio rerio, animal model
Sequence-specific Labeling of Nucleic Acids and Proteins with Methyltransferases and Cofactor Analogues
Institutions: RWTH Aachen University.
-Adenosyl-l-methionine (AdoMet or SAM)-dependent methyltransferases (MTase) catalyze the transfer of the activated methyl group from AdoMet to specific positions in DNA, RNA, proteins and small biomolecules. This natural methylation reaction can be expanded to a wide variety of alkylation reactions using synthetic cofactor analogues. Replacement of the reactive sulfonium center of AdoMet with an aziridine ring leads to cofactors which can be coupled with DNA by various DNA MTases. These aziridine cofactors can be equipped with reporter groups at different positions of the adenine moiety and used for S
of DNA (SMILing DNA). As a typical example we give a protocol for biotinylation of pBR322 plasmid DNA at the 5’-ATCGA
T-3’ sequence with the DNA MTase M.BseCI and the aziridine cofactor 6BAz in one step. Extension of the activated methyl group with unsaturated alkyl groups results in another class of AdoMet analogues which are used for m
ransfer of A
roups (mTAG). Since the extended side chains are activated by the sulfonium center and the unsaturated bond, these cofactors are called double-activated AdoMet analogues. These analogues not only function as cofactors for DNA MTases, like the aziridine cofactors, but also for RNA, protein and small molecule MTases. They are typically used for enzymatic modification of MTase substrates with unique functional groups which are labeled with reporter groups in a second chemical step. This is exemplified in a protocol for fluorescence labeling of histone H3 protein. A small propargyl group is transferred from the cofactor analogue SeAdoYn to the protein by the histone H3 lysine 4 (H3K4) MTase Set7/9 followed by click labeling of the alkynylated histone H3 with TAMRA azide. MTase-mediated labeling with cofactor analogues is an enabling technology for many exciting applications including identification and functional study of MTase substrates as well as DNA genotyping and methylation detection.
Biochemistry, Issue 93, S-adenosyl-l-methionine, AdoMet, SAM, aziridine cofactor, double activated cofactor, methyltransferase, DNA methylation, protein methylation, biotin labeling, fluorescence labeling, SMILing, mTAG
Principles of Site-Specific Recombinase (SSR) Technology
Institutions: Max Plank Institute for Molecular Cell Biology and Genetics, Dresden.
Site-specific recombinase (SSR) technology allows the manipulation of gene structure to explore gene function and has become an integral tool of molecular biology. Site-specific recombinases are proteins that bind to distinct DNA target sequences. The Cre/lox system was first described in bacteriophages during the 1980's. Cre recombinase is a Type I topoisomerase that catalyzes site-specific recombination of DNA between two loxP (locus of X-over P1) sites. The Cre/lox system does not require any cofactors. LoxP sequences contain distinct binding sites for Cre recombinases that surround a directional core sequence where recombination and rearrangement takes place. When cells contain loxP sites and express the Cre recombinase, a recombination event occurs. Double-stranded DNA is cut at both loxP sites by the Cre recombinase, rearranged, and ligated ("scissors and glue"). Products of the recombination event depend on the relative orientation of the asymmetric sequences.
SSR technology is frequently used as a tool to explore gene function. Here the gene of interest is flanked with Cre target sites loxP ("floxed"). Animals are then crossed with animals expressing the Cre recombinase under the control of a tissue-specific promoter. In tissues that express the Cre recombinase it binds to target sequences and excises the floxed gene. Controlled gene deletion allows the investigation of gene function in specific tissues and at distinct time points. Analysis of gene function employing SSR technology --- conditional mutagenesis -- has significant advantages over traditional knock-outs where gene deletion is frequently lethal.
Cellular Biology, Issue 15, Molecular Biology, Site-Specific Recombinase, Cre recombinase, Cre/lox system, transgenic animals, transgenic technology
A Strategy to Identify de Novo Mutations in Common Disorders such as Autism and Schizophrenia
Institutions: Universite de Montreal, Universite de Montreal, Universite de Montreal.
There are several lines of evidence supporting the role of de novo
mutations as a mechanism for common disorders, such as autism and schizophrenia. First, the de novo
mutation rate in humans is relatively high, so new mutations are generated at a high frequency in the population. However, de novo
mutations have not been reported in most common diseases. Mutations in genes leading to severe diseases where there is a strong negative selection against the phenotype, such as lethality in embryonic stages or reduced reproductive fitness, will not be transmitted to multiple family members, and therefore will not be detected by linkage gene mapping or association studies. The observation of very high concordance in monozygotic twins and very low concordance in dizygotic twins also strongly supports the hypothesis that a significant fraction of cases may result from new mutations. Such is the case for diseases such as autism and schizophrenia. Second, despite reduced reproductive fitness1
and extremely variable environmental factors, the incidence of some diseases is maintained worldwide at a relatively high and constant rate. This is the case for autism and schizophrenia, with an incidence of approximately 1% worldwide. Mutational load can be thought of as a balance between selection for or against a deleterious mutation and its production by de novo
mutation. Lower rates of reproduction constitute a negative selection factor that should reduce the number of mutant alleles in the population, ultimately leading to decreased disease prevalence. These selective pressures tend to be of different intensity in different environments. Nonetheless, these severe mental disorders have been maintained at a constant relatively high prevalence in the worldwide population across a wide range of cultures and countries despite a strong negative selection against them2
. This is not what one would predict in diseases with reduced reproductive fitness, unless there was a high new mutation rate. Finally, the effects of paternal age: there is a significantly increased risk of the disease with increasing paternal age, which could result from the age related increase in paternal de novo
mutations. This is the case for autism and schizophrenia3
. The male-to-female ratio of mutation rate is estimated at about 4–6:1, presumably due to a higher number of germ-cell divisions with age in males. Therefore, one would predict that de novo
mutations would more frequently come from males, particularly older males4
. A high rate of new mutations may in part explain why genetic studies have so far failed to identify many genes predisposing to complexes diseases genes, such as autism and schizophrenia, and why diseases have been identified for a mere 3% of genes in the human genome. Identification for de novo
mutations as a cause of a disease requires a targeted molecular approach, which includes studying parents and affected subjects. The process for determining if the genetic basis of a disease may result in part from de novo
mutations and the molecular approach to establish this link will be illustrated, using autism and schizophrenia as examples.
Medicine, Issue 52, de novo mutation, complex diseases, schizophrenia, autism, rare variations, DNA sequencing
Molecular Evolution of the Tre Recombinase
Institutions: Max Plank Institute for Molecular Cell Biology and Genetics, Dresden.
Here we report the generation of Tre recombinase through directed, molecular evolution. Tre recombinase recognizes a pre-defined target sequence within the LTR sequences of the HIV-1 provirus, resulting in the excision and eradication of the provirus from infected human cells.
We started with Cre, a 38-kDa recombinase, that recognizes a 34-bp double-stranded DNA sequence known as loxP. Because Cre can effectively eliminate genomic sequences, we set out to tailor a recombinase that could remove the sequence between the 5'-LTR and 3'-LTR of an integrated HIV-1 provirus. As a first step we identified sequences within the LTR sites that were similar to loxP and tested for recombination activity. Initially Cre and mutagenized Cre libraries failed to recombine the chosen loxLTR sites of the HIV-1 provirus. As the start of any directed molecular evolution process requires at least residual activity, the original asymmetric loxLTR sequences were split into subsets and tested again for recombination activity. Acting as intermediates, recombination activity was shown with the subsets. Next, recombinase libraries were enriched through reiterative evolution cycles. Subsequently, enriched libraries were shuffled and recombined. The combination of different mutations proved synergistic and recombinases were created that were able to recombine loxLTR1 and loxLTR2. This was evidence that an evolutionary strategy through intermediates can be successful. After a total of 126 evolution cycles individual recombinases were functionally and structurally analyzed. The most active recombinase -- Tre -- had 19 amino acid changes as compared to Cre. Tre recombinase was able to excise the HIV-1 provirus from the genome HIV-1 infected HeLa cells (see "HIV-1 Proviral DNA Excision Using an Evolved Recombinase", Hauber J., Heinrich-Pette-Institute for Experimental Virology and Immunology, Hamburg, Germany). While still in its infancy, directed molecular evolution will allow the creation of custom enzymes that will serve as tools of "molecular surgery" and molecular medicine.
Cell Biology, Issue 15, HIV-1, Tre recombinase, Site-specific recombination, molecular evolution