Genome sequencing projects have ciphered millions of protein sequence, which require knowledge of their structure and function to improve the understanding of their biological role. Although experimental methods can provide detailed information for a small fraction of these proteins, computational modeling is needed for the majority of protein molecules which are experimentally uncharacterized. The I-TASSER server is an on-line workbench for high-resolution modeling of protein structure and function. Given a protein sequence, a typical output from the I-TASSER server includes secondary structure prediction, predicted solvent accessibility of each residue, homologous template proteins detected by threading and structure alignments, up to five full-length tertiary structural models, and structure-based functional annotations for enzyme classification, Gene Ontology terms and protein-ligand binding sites. All the predictions are tagged with a confidence score which tells how accurate the predictions are without knowing the experimental data. To facilitate the special requests of end users, the server provides channels to accept user-specified inter-residue distance and contact maps to interactively change the I-TASSER modeling; it also allows users to specify any proteins as template, or to exclude any template proteins during the structure assembly simulations. The structural information could be collected by the users based on experimental evidences or biological insights with the purpose of improving the quality of I-TASSER predictions. The server was evaluated as the best programs for protein structure and function predictions in the recent community-wide CASP experiments. There are currently >20,000 registered scientists from over 100 countries who are using the on-line I-TASSER server.
20 Related JoVE Articles!
Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules
Institutions: Princeton University.
The aim of de novo
protein design is to find the amino acid sequences that will fold into a desired 3-dimensional structure with improvements in specific properties, such as binding affinity, agonist or antagonist behavior, or stability, relative to the native sequence. Protein design lies at the center of current advances drug design and discovery. Not only does protein design provide predictions for potentially useful drug targets, but it also enhances our understanding of the protein folding process and protein-protein interactions. Experimental methods such as directed evolution have shown success in protein design. However, such methods are restricted by the limited sequence space that can be searched tractably. In contrast, computational design strategies allow for the screening of a much larger set of sequences covering a wide variety of properties and functionality. We have developed a range of computational de novo
protein design methods capable of tackling several important areas of protein design. These include the design of monomeric proteins for increased stability and complexes for increased binding affinity.
To disseminate these methods for broader use we present Protein WISDOM (http://www.proteinwisdom.org), a tool that provides automated methods for a variety of protein design problems. Structural templates are submitted to initialize the design process. The first stage of design is an optimization sequence selection stage that aims at improving stability through minimization of potential energy in the sequence space. Selected sequences are then run through a fold specificity stage and a binding affinity stage. A rank-ordered list of the sequences for each step of the process, along with relevant designed structures, provides the user with a comprehensive quantitative assessment of the design. Here we provide the details of each design method, as well as several notable experimental successes attained through the use of the methods.
Genetics, Issue 77, Molecular Biology, Bioengineering, Biochemistry, Biomedical Engineering, Chemical Engineering, Computational Biology, Genomics, Proteomics, Protein, Protein Binding, Computational Biology, Drug Design, optimization (mathematics), Amino Acids, Peptides, and Proteins, De novo protein and peptide design, Drug design, In silico sequence selection, Optimization, Fold specificity, Binding affinity, sequencing
The Importance of Correct Protein Concentration for Kinetics and Affinity Determination in Structure-function Analysis
Institutions: GE Healthcare Bio-Sciences AB.
In this study, we explore the interaction between the bovine cysteine protease inhibitor cystatin B and a catalytically inactive form of papain (Fig. 1), a plant cysteine protease, by real-time label-free analysis using Biacore X100. Several cystatin B variants with point mutations in areas of interaction with papain, are produced. For each cystatin B variant we determine its specific binding concentration using calibration-free concentration analysis (CFCA) and compare the values obtained with total protein concentration as determined by A280
. After that, the kinetics of each cystatin B variant binding to papain is measured using single-cycle kinetics (SCK). We show that one of the four cystatin B variants we examine is only partially active for binding. This partial activity, revealed by CFCA, translates to a significant difference in the association rate constant (ka
) and affinity (KD
), compared to the values calculated using total protein concentration. Using CFCA in combination with kinetic analysis in a structure-function study contributes to obtaining reliable results, and helps to make the right interpretation of the interaction mechanism.
Cellular Biology, Issue 37, Protein interaction, Surface Plasmon Resonance, Biacore X100, CFCA, Cystatin B, Papain
Monitoring the Assembly of a Secreted Bacterial Virulence Factor Using Site-specific Crosslinking
Institutions: National Institutes of Health.
This article describes a method to detect and analyze dynamic interactions between a protein of interest and other factors in vivo
. Our method is based on the amber suppression technology that was originally developed by Peter Schultz and colleagues1
. An amber mutation is first introduced at a specific codon of the gene encoding the protein of interest. The amber mutant is then expressed in E. coli
together with genes encoding an amber suppressor tRNA and an amino acyl-tRNA synthetase derived from Methanococcus jannaschii
. Using this system, the photo activatable amino acid analog p-benzoylphenylalanine (Bpa) is incorporated at the amber codon. Cells are then irradiated with ultraviolet light to covalently link the Bpa residue to proteins that are located within 3-8 Å. Photocrosslinking is performed in combination with pulse-chase labeling and immunoprecipitation of the protein of interest in order to monitor changes in protein-protein interactions that occur over a time scale of seconds to minutes. We optimized the procedure to study the assembly of a bacterial virulence factor that consists of two independent domains, a domain that is integrated into the outer membrane and a domain that is translocated into the extracellular space, but the method can be used to study many different assembly processes and biological pathways in both prokaryotic and eukaryotic cells. In principle interacting factors and even specific residues of interacting factors that bind to a protein of interest can be identified by mass spectrometry.
Immunology, Issue 82, Autotransporters, Bam complex, Molecular chaperones, protein-protein interactions, Site-specific photocrosslinking
Investigating Protein-protein Interactions in Live Cells Using Bioluminescence Resonance Energy Transfer
Institutions: Max Planck Institute for Psycholinguistics, Donders Institute for Brain, Cognition and Behaviour.
Assays based on Bioluminescence Resonance Energy Transfer (BRET) provide a sensitive and reliable means to monitor protein-protein interactions in live cells. BRET is the non-radiative transfer of energy from a 'donor' luciferase enzyme to an 'acceptor' fluorescent protein. In the most common configuration of this assay, the donor is Renilla reniformis
luciferase and the acceptor is Yellow Fluorescent Protein (YFP). Because the efficiency of energy transfer is strongly distance-dependent, observation of the BRET phenomenon requires that the donor and acceptor be in close proximity. To test for an interaction between two proteins of interest in cultured mammalian cells, one protein is expressed as a fusion with luciferase and the second as a fusion with YFP. An interaction between the two proteins of interest may bring the donor and acceptor sufficiently close for energy transfer to occur. Compared to other techniques for investigating protein-protein interactions, the BRET assay is sensitive, requires little hands-on time and few reagents, and is able to detect interactions which are weak, transient, or dependent on the biochemical environment found within a live cell. It is therefore an ideal approach for confirming putative interactions suggested by yeast two-hybrid or mass spectrometry proteomics studies, and in addition it is well-suited for mapping interacting regions, assessing the effect of post-translational modifications on protein-protein interactions, and evaluating the impact of mutations identified in patient DNA.
Cellular Biology, Issue 87, Protein-protein interactions, Bioluminescence Resonance Energy Transfer, Live cell, Transfection, Luciferase, Yellow Fluorescent Protein, Mutations
Nanomanipulation of Single RNA Molecules by Optical Tweezers
Institutions: University at Albany, State University of New York, University at Albany, State University of New York, University at Albany, State University of New York, University at Albany, State University of New York, University at Albany, State University of New York.
A large portion of the human genome is transcribed but not translated. In this post genomic era, regulatory functions of RNA have been shown to be increasingly important. As RNA function often depends on its ability to adopt alternative structures, it is difficult to predict RNA three-dimensional structures directly from sequence. Single-molecule approaches show potentials to solve the problem of RNA structural polymorphism by monitoring molecular structures one molecule at a time. This work presents a method to precisely manipulate the folding and structure of single RNA molecules using optical tweezers. First, methods to synthesize molecules suitable for single-molecule mechanical work are described. Next, various calibration procedures to ensure the proper operations of the optical tweezers are discussed. Next, various experiments are explained. To demonstrate the utility of the technique, results of mechanically unfolding RNA hairpins and a single RNA kissing complex are used as evidence. In these examples, the nanomanipulation technique was used to study folding of each structural domain, including secondary and tertiary, independently. Lastly, the limitations and future applications of the method are discussed.
Bioengineering, Issue 90, RNA folding, single-molecule, optical tweezers, nanomanipulation, RNA secondary structure, RNA tertiary structure
Determination of Protein-ligand Interactions Using Differential Scanning Fluorimetry
Institutions: University of Exeter.
A wide range of methods are currently available for determining the dissociation constant between a protein and interacting small molecules. However, most of these require access to specialist equipment, and often require a degree of expertise to effectively establish reliable experiments and analyze data. Differential scanning fluorimetry (DSF) is being increasingly used as a robust method for initial screening of proteins for interacting small molecules, either for identifying physiological partners or for hit discovery. This technique has the advantage that it requires only a PCR machine suitable for quantitative PCR, and so suitable instrumentation is available in most institutions; an excellent range of protocols are already available; and there are strong precedents in the literature for multiple uses of the method. Past work has proposed several means of calculating dissociation constants from DSF data, but these are mathematically demanding. Here, we demonstrate a method for estimating dissociation constants from a moderate amount of DSF experimental data. These data can typically be collected and analyzed within a single day. We demonstrate how different models can be used to fit data collected from simple binding events, and where cooperative binding or independent binding sites are present. Finally, we present an example of data analysis in a case where standard models do not apply. These methods are illustrated with data collected on commercially available control proteins, and two proteins from our research program. Overall, our method provides a straightforward way for researchers to rapidly gain further insight into protein-ligand interactions using DSF.
Biophysics, Issue 91, differential scanning fluorimetry, dissociation constant, protein-ligand interactions, StepOne, cooperativity, WcbI.
In Vitro Reconstitution of Light-harvesting Complexes of Plants and Green Algae
Institutions: VU University Amsterdam.
In plants and green algae, light is captured by the light-harvesting complexes (LHCs), a family of integral membrane proteins that coordinate chlorophylls and carotenoids. In vivo
, these proteins are folded with pigments to form complexes which are inserted in the thylakoid membrane of the chloroplast. The high similarity in the chemical and physical properties of the members of the family, together with the fact that they can easily lose pigments during isolation, makes their purification in a native state challenging. An alternative approach to obtain homogeneous preparations of LHCs was developed by Plumley and Schmidt in 19871
, who showed that it was possible to reconstitute these complexes in vitro
starting from purified pigments and unfolded apoproteins, resulting in complexes with properties very similar to that of native complexes. This opened the way to the use of bacterial expressed recombinant proteins for in vitro
reconstitution. The reconstitution method is powerful for various reasons: (1) pure preparations of individual complexes can be obtained, (2) pigment composition can be controlled to assess their contribution to structure and function, (3) recombinant proteins can be mutated to study the functional role of the individual residues (e.g.,
pigment binding sites) or protein domain (e.g.,
protein-protein interaction, folding). This method has been optimized in several laboratories and applied to most of the light-harvesting complexes. The protocol described here details the method of reconstituting light-harvesting complexes in vitro
currently used in our laboratory,
and examples describing applications of the method are provided.
Biochemistry, Issue 92, Reconstitution, Photosynthesis, Chlorophyll, Carotenoids, Light Harvesting Protein, Chlamydomonas reinhardtii, Arabidopsis thaliana
Mass Spectrometric Approaches to Study Protein Structure and Interactions in Lyophilized Powders
Institutions: Purdue University.
Amide hydrogen/deuterium exchange (ssHDX-MS) and side-chain photolytic labeling (ssPL-MS) followed by mass spectrometric analysis can be valuable for characterizing lyophilized formulations of protein therapeutics. Labeling followed by suitable proteolytic digestion allows the protein structure and interactions to be mapped with peptide-level resolution. Since the protein structural elements are stabilized by a network of chemical bonds from the main-chains and side-chains of amino acids, specific labeling of atoms in the amino acid residues provides insight into the structure and conformation of the protein. In contrast to routine methods used to study proteins in lyophilized solids (e.g.
, FTIR), ssHDX-MS and ssPL-MS provide quantitative and site-specific information. The extent of deuterium incorporation and kinetic parameters can be related to rapidly and slowly exchanging amide pools (Nfast
) and directly reflects the degree of protein folding and structure in lyophilized formulations. Stable photolytic labeling does not undergo back-exchange, an advantage over ssHDX-MS. Here, we provide detailed protocols for both ssHDX-MS and ssPL-MS, using myoglobin (Mb) as a model protein in lyophilized formulations containing either trehalose or sorbitol.
Chemistry, Issue 98, Amide hydrogen/deuterium exchange, photolytic labeling, mass spectrometry, lyophilized formulations, photo-leucine, solid-state, protein structure, protein conformation, protein dynamics, secondary structure, protein stability, excipients
Direct Protein Delivery to Mammalian Cells Using Cell-permeable Cys2-His2 Zinc-finger Domains
Institutions: The Scripps Research Institute, ShanghaiTech University.
Due to their modularity and ability to be reprogrammed to recognize a wide range of DNA sequences, Cys2
zinc-finger DNA-binding domains have emerged as useful tools for targeted genome engineering. Like many other DNA-binding proteins, zinc-fingers also possess the innate ability to cross cell membranes. We recently demonstrated that this intrinsic cell-permeability could be leveraged for intracellular protein delivery. Genetic fusion of zinc-finger motifs leads to efficient transport of protein and enzyme cargo into a broad range of mammalian cell types. Unlike other protein transduction technologies, delivery via zinc-finger domains does not inhibit enzyme activity and leads to high levels of cytosolic delivery. Here a detailed step-by-step protocol is presented for the implementation of zinc-finger technology for protein delivery into mammalian cells. Key steps for achieving high levels of intracellular zinc-finger-mediated delivery are highlighted and strategies for maximizing the performance of this system are discussed.
Molecular Biology, Issue 97, protein delivery, cell-penetrating peptide, zinc-finger, protein transduction domain, chemical biology, molecular biology
Investigating Receptor-ligand Systems of the Cellulosome with AFM-based Single-molecule Force Spectroscopy
Cellulosomes are discrete multienzyme complexes used by a subset of anaerobic bacteria and fungi to digest lignocellulosic substrates. Assembly of the enzymes onto the noncatalytic scaffold protein is directed by interactions among a family of related receptor-ligand pairs comprising interacting cohesin and dockerin modules. The extremely strong binding between cohesin and dockerin modules results in dissociation constants in the low picomolar to nanomolar range, which may hamper accurate off-rate measurements with conventional bulk methods. Single-molecule force spectroscopy (SMFS) with the atomic force microscope measures the response of individual biomolecules to force, and in contrast to other single-molecule manipulation methods (i.e.
optical tweezers), is optimal for studying high-affinity receptor-ligand interactions because of its ability to probe the high-force regime (>120 pN). Here we present our complete protocol for studying cellulosomal protein assemblies at the single-molecule level. Using a protein topology derived from the native cellulosome, we worked with enzyme-dockerin and carbohydrate binding module-cohesin (CBM-cohesin) fusion proteins, each with an accessible free thiol group at an engineered cysteine residue. We present our site-specific surface immobilization protocol, along with our measurement and data analysis procedure for obtaining detailed binding parameters for the high-affinity complex. We demonstrate how to quantify single subdomain unfolding forces, complex rupture forces, kinetic off-rates, and potential widths of the binding well. The successful application of these methods in characterizing the cohesin-dockerin interaction responsible for assembly of multidomain cellulolytic complexes is further described.
Bioengineering, Issue 82, biophysics, protein unfolding, atomic force microscopy, surface immobilization
Mouse Genome Engineering Using Designer Nucleases
Institutions: University of Zurich, University of Minnesota.
Transgenic mice carrying site-specific genome modifications (knockout, knock-in) are of vital importance for dissecting complex biological systems as well as for modeling human diseases and testing therapeutic strategies. Recent advances in the use of designer nucleases such as zinc finger nucleases (ZFNs), transcription activator-like effector nucleases (TALENs), and the clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated (Cas) 9 system for site-specific genome engineering open the possibility to perform rapid targeted genome modification in virtually any laboratory species without the need to rely on embryonic stem (ES) cell technology. A genome editing experiment typically starts with identification of designer nuclease target sites within a gene of interest followed by construction of custom DNA-binding domains to direct nuclease activity to the investigator-defined genomic locus. Designer nuclease plasmids are in vitro
transcribed to generate mRNA for microinjection of fertilized mouse oocytes. Here, we provide a protocol for achieving targeted genome modification by direct injection of TALEN mRNA into fertilized mouse oocytes.
Genetics, Issue 86, Oocyte microinjection, Designer nucleases, ZFN, TALEN, Genome Engineering
The Cell-based L-Glutathione Protection Assays to Study Endocytosis and Recycling of Plasma Membrane Proteins
Institutions: Children's Hospital of Pittsburgh of UPMC, University of Pittsburgh School of Medicine.
Membrane trafficking involves transport of proteins from the plasma membrane to the cell interior (i.e.
endocytosis) followed by trafficking to lysosomes for degradation or to the plasma membrane for recycling. The cell based L-glutathione protection assays can be used to study endocytosis and recycling of protein receptors, channels, transporters, and adhesion molecules localized at the cell surface. The endocytic assay requires labeling of cell surface proteins with a cell membrane impermeable biotin containing a disulfide bond and the N-hydroxysuccinimide (NHS) ester at 4 ºC - a temperature at which membrane trafficking does not occur. Endocytosis of biotinylated plasma membrane proteins is induced by incubation at 37 ºC. Next, the temperature is decreased again to 4 ºC to stop endocytic trafficking and the disulfide bond in biotin covalently attached to proteins that have remained at the plasma membrane is reduced with L-glutathione. At this point, only proteins that were endocytosed remain protected from L-glutathione and thus remain biotinylated. After cell lysis, biotinylated proteins are isolated with streptavidin agarose, eluted from agarose, and the biotinylated protein of interest is detected by western blotting. During the recycling assay, after biotinylation cells are incubated at 37 °C to load endocytic vesicles with biotinylated proteins and the disulfide bond in biotin covalently attached to proteins remaining at the plasma membrane is reduced with L-glutathione at 4 ºC as in the endocytic assay. Next, cells are incubated again at 37 °C to allow biotinylated proteins from endocytic vesicles to recycle to the plasma membrane. Cells are then incubated at 4 ºC, and the disulfide bond in biotin attached to proteins that recycled to the plasma membranes is reduced with L-glutathione. The biotinylated proteins protected from L-glutathione are those that did not recycle to the plasma membrane.
Basic Protocol, Issue 82, Endocytosis, recycling, plasma membrane, cell surface, EZLink, Sulfo-NHS-SS-Biotin, L-Glutathione, GSH, thiol group, disulfide bond, epithelial cells, cell polarization
Analyzing Protein Dynamics Using Hydrogen Exchange Mass Spectrometry
Institutions: University of Heidelberg.
All cellular processes depend on the functionality of proteins. Although the functionality of a given protein is the direct consequence of its unique amino acid sequence, it is only realized by the folding of the polypeptide chain into a single defined three-dimensional arrangement or more commonly into an ensemble of interconverting conformations. Investigating the connection between protein conformation and its function is therefore essential for a complete understanding of how proteins are able to fulfill their great variety of tasks. One possibility to study conformational changes a protein undergoes while progressing through its functional cycle is hydrogen-1
H-exchange in combination with high-resolution mass spectrometry (HX-MS). HX-MS is a versatile and robust method that adds a new dimension to structural information obtained by e.g.
crystallography. It is used to study protein folding and unfolding, binding of small molecule ligands, protein-protein interactions, conformational changes linked to enzyme catalysis, and allostery. In addition, HX-MS is often used when the amount of protein is very limited or crystallization of the protein is not feasible. Here we provide a general protocol for studying protein dynamics with HX-MS and describe as an example how to reveal the interaction interface of two proteins in a complex.
Chemistry, Issue 81, Molecular Chaperones, mass spectrometers, Amino Acids, Peptides, Proteins, Enzymes, Coenzymes, Protein dynamics, conformational changes, allostery, protein folding, secondary structure, mass spectrometry
Structure and Coordination Determination of Peptide-metal Complexes Using 1D and 2D 1H NMR
Institutions: The Hebrew University of Jerusalem, The Hebrew University of Jerusalem.
Copper (I) binding by metallochaperone transport proteins prevents copper oxidation and release of the toxic ions that may participate in harmful redox reactions. The Cu (I) complex of the peptide model of a Cu (I) binding metallochaperone protein, which includes the sequence MTCSGCSRPG (underlined is conserved), was determined in solution under inert conditions by NMR spectroscopy.
NMR is a widely accepted technique for the determination of solution structures of proteins and peptides. Due to difficulty in crystallization to provide single crystals suitable for X-ray crystallography, the NMR technique is extremely valuable, especially as it provides information on the solution state rather than the solid state. Herein we describe all steps that are required for full three-dimensional structure determinations by NMR. The protocol includes sample preparation in an NMR tube, 1D and 2D data collection and processing, peak assignment and integration, molecular mechanics calculations, and structure analysis. Importantly, the analysis was first conducted without any preset metal-ligand bonds, to assure a reliable structure determination in an unbiased manner.
Chemistry, Issue 82, solution structure determination, NMR, peptide models, copper-binding proteins, copper complexes
Metabolic Labeling of Leucine Rich Repeat Kinases 1 and 2 with Radioactive Phosphate
Institutions: KU Leuven and Leuven Institute for Neuroscience and Disease (LIND).
Leucine rich repeat kinases 1 and 2 (LRRK1 and LRRK2) are paralogs which share a similar domain organization, including a serine-threonine kinase domain, a Ras of complex proteins domain (ROC), a C-terminal of ROC domain (COR), and leucine-rich and ankyrin-like repeats at the N-terminus. The precise cellular roles of LRRK1 and LRRK2 have yet to be elucidated, however LRRK1 has been implicated in tyrosine kinase receptor signaling1,2
, while LRRK2 is implicated in the pathogenesis of Parkinson's disease3,4
. In this report, we present a protocol to label the LRRK1 and LRRK2 proteins in cells with 32
P orthophosphate, thereby providing a means to measure the overall phosphorylation levels of these 2 proteins in cells. In brief, affinity tagged LRRK proteins are expressed in HEK293T cells which are exposed to medium containing 32
P-orthophosphate. The 32
P-orthophosphate is assimilated by the cells after only a few hours of incubation and all molecules in the cell containing phosphates are thereby radioactively labeled. Via the affinity tag (3xflag) the LRRK proteins are isolated from other cellular components by immunoprecipitation. Immunoprecipitates are then separated via SDS-PAGE, blotted to PVDF membranes and analysis of the incorporated phosphates is performed by autoradiography (32
P signal) and western detection (protein signal) of the proteins on the blots. The protocol can readily be adapted to monitor phosphorylation of any other protein that can be expressed in cells and isolated by immunoprecipitation.
Cellular Biology, Issue 79, biology (general), biochemistry, bioengineering (general), LRRK1, LRRK2, metabolic labeling, 32P orthophosphate, immunoprecipitation, autoradiography
In Vivo Modeling of the Morbid Human Genome using Danio rerio
Institutions: Duke University Medical Center, Duke University, Duke University Medical Center.
Here, we present methods for the development of assays to query potentially clinically significant nonsynonymous changes using in vivo
complementation in zebrafish. Zebrafish (Danio rerio
) are a useful animal system due to their experimental tractability; embryos are transparent to enable facile viewing, undergo rapid development ex vivo,
and can be genetically manipulated.1
These aspects have allowed for significant advances in the analysis of embryogenesis, molecular processes, and morphogenetic signaling. Taken together, the advantages of this vertebrate model make zebrafish highly amenable to modeling the developmental defects in pediatric disease, and in some cases, adult-onset disorders. Because the zebrafish genome is highly conserved with that of humans (~70% orthologous), it is possible to recapitulate human disease states in zebrafish. This is accomplished either through the injection of mutant human mRNA to induce dominant negative or gain of function alleles, or utilization of morpholino (MO) antisense oligonucleotides to suppress genes to mimic loss of function variants. Through complementation of MO-induced phenotypes with capped human mRNA, our approach enables the interpretation of the deleterious effect of mutations on human protein sequence based on the ability of mutant mRNA to rescue a measurable, physiologically relevant phenotype. Modeling of the human disease alleles occurs through microinjection of zebrafish embryos with MO and/or human mRNA at the 1-4 cell stage, and phenotyping up to seven days post fertilization (dpf). This general strategy can be extended to a wide range of disease phenotypes, as demonstrated in the following protocol. We present our established models for morphogenetic signaling, craniofacial, cardiac, vascular integrity, renal function, and skeletal muscle disorder phenotypes, as well as others.
Molecular Biology, Issue 78, Genetics, Biomedical Engineering, Medicine, Developmental Biology, Biochemistry, Anatomy, Physiology, Bioengineering, Genomics, Medical, zebrafish, in vivo, morpholino, human disease modeling, transcription, PCR, mRNA, DNA, Danio rerio, animal model
Determination of the Gas-phase Acidities of Oligopeptides
Institutions: University of the Pacific.
Amino acid residues located at different positions in folded proteins often exhibit different degrees of acidities. For example, a cysteine residue located at or near the N-terminus of a helix is often more acidic than that at or near the C-terminus 1-6
. Although extensive experimental studies on the acid-base properties of peptides have been carried out in the condensed phase, in particular in aqueous solutions 6-8
, the results are often complicated by solvent effects 7
. In fact, most of the active sites in proteins are located near the interior region where solvent effects have been minimized 9,10
. In order to understand intrinsic acid-base properties of peptides and proteins, it is important to perform the studies in a solvent-free environment.
We present a method to measure the acidities of oligopeptides in the gas-phase. We use a cysteine-containing oligopeptide, Ala3
CH), as the model compound. The measurements are based on the well-established extended Cooks kinetic method (Figure 1
. The experiments are carried out using a triple-quadrupole mass spectrometer interfaced with an electrospray ionization (ESI) ion source (Figure 2
). For each peptide sample, several reference acids are selected. The reference acids are structurally similar organic compounds with known gas-phase acidities. A solution of the mixture of the peptide and a reference acid is introduced into the mass spectrometer, and a gas-phase proton-bound anionic cluster of peptide-reference acid is formed. The proton-bound cluster is mass isolated and subsequently fragmented via collision-induced dissociation (CID) experiments. The resulting fragment ion abundances are analyzed using a relationship between the acidities and the cluster ion dissociation kinetics. The gas-phase acidity of the peptide is then obtained by linear regression of the thermo-kinetic plots 17,18
The method can be applied to a variety of molecular systems, including organic compounds, amino acids and their derivatives, oligonucleotides, and oligopeptides. By comparing the gas-phase acidities measured experimentally with those values calculated for different conformers, conformational effects on the acidities can be evaluated.
Chemistry, Issue 76, Biochemistry, Molecular Biology, Oligopeptide, gas-phase acidity, kinetic method, collision-induced dissociation, triple-quadrupole mass spectrometry, oligopeptides, peptides, mass spectrometry, MS
Thermodynamics of Membrane Protein Folding Measured by Fluorescence Spectroscopy
Institutions: University of California San Diego - UCSD.
Membrane protein folding is an emerging topic with both fundamental and health-related significance. The abundance of membrane proteins in cells underlies the need for comprehensive study of the folding of this ubiquitous family of proteins. Additionally, advances in our ability to characterize diseases associated with misfolded proteins have motivated significant experimental and theoretical efforts in the field of protein folding. Rapid progress in this important field is unfortunately hindered by the inherent challenges associated with membrane proteins and the complexity of the folding mechanism. Here, we outline an experimental procedure for measuring the thermodynamic property of the Gibbs free energy of unfolding in the absence of denaturant, ΔG°H2O
, for a representative integral membrane protein from E. coli
. This protocol focuses on the application of fluorescence spectroscopy to determine equilibrium populations of folded and unfolded states as a function of denaturant concentration. Experimental considerations for the preparation of synthetic lipid vesicles as well as key steps in the data analysis procedure are highlighted. This technique is versatile and may be pursued with different types of denaturant, including temperature and pH, as well as in various folding environments of lipids and micelles. The current protocol is one that can be generalized to any membrane or soluble protein that meets the set of criteria discussed below.
Bioengineering, Issue 50, tryptophan, peptides, Gibbs free energy, protein stability, vesicles
Assessment of Immunologically Relevant Dynamic Tertiary Structural Features of the HIV-1 V3 Loop Crown R2 Sequence by ab initio Folding
Institutions: School of Medicine, New York University.
The antigenic diversity of HIV-1 has long been an obstacle to vaccine design, and this variability is especially pronounced in the V3 loop of the virus' surface envelope glycoprotein. We previously proposed that the crown of the V3 loop, although dynamic and sequence variable, is constrained throughout the population of HIV-1 viruses to an immunologically relevant β-hairpin tertiary structure. Importantly, there are thousands of different V3 loop crown sequences in circulating HIV-1 viruses, making 3D structural characterization of trends across the diversity of viruses difficult or impossible by crystallography or NMR. Our previous successful studies with folding of the V3 crown1, 2
used the ab initio
accessible in the ICM-Pro molecular modeling software package (Molsoft LLC, La Jolla, CA) and suggested that the crown of the V3 loop, specifically from positions 10 to 22, benefits sufficiently from the flexibility and length of its flanking stems to behave to a large degree as if it were an unconstrained peptide freely folding in solution. As such, rapid ab initio
folding of just this portion of the V3 loop of any individual strain of the 60,000+ circulating HIV-1 strains can be informative. Here, we folded the V3 loop of the R2 strain to gain insight into the structural basis of its unique properties. R2 bears a rare V3 loop sequence thought to be responsible for the exquisite sensitivity of this strain to neutralization by patient sera and monoclonal antibodies4, 5
. The strain mediates CD4-independent infection and appears to elicit broadly neutralizing antibodies. We demonstrate how evaluation of the results of the folding can be informative for associating observed structures in the folding with the immunological activities observed for R2.
Infection, Issue 43, HIV-1, structure-activity relationships, ab initio simulations, antibody-mediated neutralization, vaccine design
Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues
Institutions: The Research Institute at Nationwide Children's Hospital.
Protein alignments are commonly used to evaluate the similarity of protein residues, and the derived consensus sequence used for identifying functional units (e.g.,
domains). Traditional consensus-building models fail to account for interpositional dependencies – functionally required covariation of residues that tend to appear simultaneously throughout evolution and across the phylogentic tree. These relationships can reveal important clues about the processes of protein folding, thermostability, and the formation of functional sites, which in turn can be used to inform the engineering of synthetic proteins. Unfortunately, these relationships essentially form sub-motifs which cannot be predicted by simple “majority rule” or even HMM-based consensus models, and the result can be a biologically invalid “consensus” which is not only never seen in nature but is less viable than any extant protein. We have developed a visual analytics tool, StickWRLD, which creates an interactive 3D representation of a protein alignment and clearly displays covarying residues. The user has the ability to pan and zoom, as well as dynamically change the statistical threshold underlying the identification of covariants. StickWRLD has previously been successfully used to identify functionally-required covarying residues in proteins such as Adenylate Kinase and in DNA sequences such as endonuclease target sites.
Chemistry, Issue 101, protein engineering, covariation, codependent residues, visualization