Genome sequencing projects have ciphered millions of protein sequence, which require knowledge of their structure and function to improve the understanding of their biological role. Although experimental methods can provide detailed information for a small fraction of these proteins, computational modeling is needed for the majority of protein molecules which are experimentally uncharacterized. The I-TASSER server is an on-line workbench for high-resolution modeling of protein structure and function. Given a protein sequence, a typical output from the I-TASSER server includes secondary structure prediction, predicted solvent accessibility of each residue, homologous template proteins detected by threading and structure alignments, up to five full-length tertiary structural models, and structure-based functional annotations for enzyme classification, Gene Ontology terms and protein-ligand binding sites. All the predictions are tagged with a confidence score which tells how accurate the predictions are without knowing the experimental data. To facilitate the special requests of end users, the server provides channels to accept user-specified inter-residue distance and contact maps to interactively change the I-TASSER modeling; it also allows users to specify any proteins as template, or to exclude any template proteins during the structure assembly simulations. The structural information could be collected by the users based on experimental evidences or biological insights with the purpose of improving the quality of I-TASSER predictions. The server was evaluated as the best programs for protein structure and function predictions in the recent community-wide CASP experiments. There are currently >20,000 registered scientists from over 100 countries who are using the on-line I-TASSER server.
23 Related JoVE Articles!
Photo-Induced Cross-Linking of Unmodified Proteins (PICUP) Applied to Amyloidogenic Peptides
Institutions: University of California, Los Angeles, University of California, Los Angeles, University of California, Los Angeles.
The assembly of amyloidogenic proteins into toxic oligomers is a seminal event in the pathogenesis of protein misfolding diseases, including Alzheimer's, Parkinson's, and Huntington's diseases, hereditary amyotrophic lateral sclerosis, and type 2 diabetes. Owing to the metastable nature of these protein assemblies, it is difficult to assess their oligomer size distribution quantitatively using classical methods, such as electrophoresis, chromatography, fluorescence, or dynamic light scattering. Oligomers of amyloidogenic proteins exist as metastable mixtures, in which the oligomers dissociate into monomers and associate into larger assemblies simultaneously. PICUP stabilizes oligomer populations by covalent cross-linking and when combined with fractionation methods, such as sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) or size-exclusion chromatography (SEC), PICUP provides snapshots of the oligomer size distributions that existed before cross-linking. Hence, PICUP enables visualization and quantitative analysis of metastable protein populations and can be used to monitor assembly and decipher relationships between sequence modifications and oligomerization1
. Mechanistically, PICUP involves photo-oxidation of Ru2+
in a tris(bipyridyl)Ru(II) complex (RuBpy) to Ru3+
by irradiation with visible light in the presence of an electron acceptor. Ru3+
is a strong one-electron oxidizer capable of abstracting an electron from a neighboring protein molecule, generating a protein radical1,2
. Radicals are unstable, highly-reactive species and therefore disappear rapidly through a variety of intra- and intermolecular reactions. A radical may utilize the high energy of an unpaired electron to react with another protein monomer forming a dimeric radical, which subsequently loses a hydrogen atom and forms a stable, covalently-linked dimer. The dimer may then react further through a similar mechanism with monomers or other dimers to form higher-order oligomers. Advantages of PICUP relative to other photo- or chemical cross-linking methods3,4
include short (≤1 s) exposure to non-destructive visible light, no need for pre facto
modification of the native sequence, and zero-length covalent cross-linking. In addition, PICUP enables cross-linking of proteins within wide pH and temperature ranges, including physiologic parameters. Here, we demonstrate application of PICUP to cross-linking of three amyloidogenic proteins the 40- and 42-residue amyloid β-protein variants (Aβ40 and Aβ42), and calcitonin, and a control protein, growth-hormone releasing factor (GRF).
Cross-linking, Issue 23, PICUP, amyloid β-protein, oligomer, amyloid, protein assembly
4D Imaging of Protein Aggregation in Live Cells
Institutions: Hebrew University of Jerusalem .
One of the key tasks of any living cell is maintaining the proper folding of newly synthesized proteins in the face of ever-changing environmental conditions and an intracellular environment that is tightly packed, sticky, and hazardous to protein stability1
. The ability to dynamically balance protein production, folding and degradation demands highly-specialized quality control machinery, whose absolute necessity is observed best when it malfunctions. Diseases such as ALS, Alzheimer's, Parkinson's, and certain forms of Cystic Fibrosis have a direct link to protein folding quality control components2
, and therefore future therapeutic development requires a basic understanding of underlying processes. Our experimental challenge is to understand how cells integrate damage signals and mount responses that are tailored to diverse circumstances.
The primary reason why protein misfolding represents an existential threat to the cell is the propensity of incorrectly folded proteins to aggregate, thus causing a global perturbation of the crowded and delicate intracellular folding environment1
. The folding health, or "proteostasis," of the cellular proteome is maintained, even under the duress of aging, stress and oxidative damage, by the coordinated action of different mechanistic units in an elaborate quality control system3,4
. A specialized machinery of molecular chaperones can bind non-native polypeptides and promote their folding into the native state1
, target them for degradation by the ubiquitin-proteasome system5
, or direct them to protective aggregation inclusions6-9
In eukaryotes, the cytosolic aggregation quality control load is partitioned between two compartments8-10
: the juxtanuclear quality control compartment (JUNQ) and the insoluble protein deposit (IPOD) (Figure 1
- model). Proteins that are ubiquitinated by the protein folding quality control machinery are delivered to the JUNQ, where they are processed for degradation by the proteasome. Misfolded proteins that are not ubiquitinated are diverted to the IPOD, where they are actively aggregated in a protective compartment.
Up until this point, the methodological paradigm of live-cell fluorescence microscopy has largely been to label proteins and track their locations in the cell at specific time-points and usually in two dimensions. As new technologies have begun to grant experimenters unprecedented access to the submicron scale in living cells, the dynamic architecture of the cytosol has come into view as a challenging new frontier for experimental characterization. We present a method for rapidly monitoring the 3D spatial distributions of multiple fluorescently labeled proteins in the yeast cytosol over time. 3D timelapse (4D imaging) is not merely a technical challenge; rather, it also facilitates a dramatic shift in the conceptual framework used to analyze cellular structure.
We utilize a cytosolic folding sensor protein in live yeast to visualize distinct fates for misfolded proteins in cellular aggregation quality control, using rapid 4D fluorescent imaging. The temperature sensitive mutant of the Ubc9 protein10-12
) is extremely effective both as a sensor of cellular proteostasis, and a physiological model for tracking aggregation quality control. As with most ts proteins, Ubc9ts
is fully folded and functional at permissive temperatures due to active cellular chaperones. Above 30 °C, or when the cell faces misfolding stress, Ubc9ts
misfolds and follows the fate of a native globular protein that has been misfolded due to mutation, heat denaturation, or oxidative damage. By fusing it to GFP or other fluorophores, it can be tracked in 3D as it forms Stress Foci, or is directed to JUNQ or IPOD.
Cellular Biology, Issue 74, Molecular Biology, Genetics, Proteins, Aggregation quality control, protein folding quality control, GFP, JUNQ (juxtanuclear quality control compartment), IPOD (insoluble protein deposit), proteostasis sensor, 4D live cell imaging, live cells, laser, cell biology, protein folding, Ubc9ts, yeast, assay, cell, imaging
Assessment of Immunologically Relevant Dynamic Tertiary Structural Features of the HIV-1 V3 Loop Crown R2 Sequence by ab initio Folding
Institutions: School of Medicine, New York University.
The antigenic diversity of HIV-1 has long been an obstacle to vaccine design, and this variability is especially pronounced in the V3 loop of the virus' surface envelope glycoprotein. We previously proposed that the crown of the V3 loop, although dynamic and sequence variable, is constrained throughout the population of HIV-1 viruses to an immunologically relevant β-hairpin tertiary structure. Importantly, there are thousands of different V3 loop crown sequences in circulating HIV-1 viruses, making 3D structural characterization of trends across the diversity of viruses difficult or impossible by crystallography or NMR. Our previous successful studies with folding of the V3 crown1, 2
used the ab initio
accessible in the ICM-Pro molecular modeling software package (Molsoft LLC, La Jolla, CA) and suggested that the crown of the V3 loop, specifically from positions 10 to 22, benefits sufficiently from the flexibility and length of its flanking stems to behave to a large degree as if it were an unconstrained peptide freely folding in solution. As such, rapid ab initio
folding of just this portion of the V3 loop of any individual strain of the 60,000+ circulating HIV-1 strains can be informative. Here, we folded the V3 loop of the R2 strain to gain insight into the structural basis of its unique properties. R2 bears a rare V3 loop sequence thought to be responsible for the exquisite sensitivity of this strain to neutralization by patient sera and monoclonal antibodies4, 5
. The strain mediates CD4-independent infection and appears to elicit broadly neutralizing antibodies. We demonstrate how evaluation of the results of the folding can be informative for associating observed structures in the folding with the immunological activities observed for R2.
Infection, Issue 43, HIV-1, structure-activity relationships, ab initio simulations, antibody-mediated neutralization, vaccine design
ReAsH/FlAsH Labeling and Image Analysis of Tetracysteine Sensor Proteins in Cells
Institutions: Bio21 Molecular Science and Biotechnology Institute.
Fluorescent proteins and dyes are essential tools for the study of protein trafficking, localization and function in cells. While fluorescent
proteins such as green fluorescence protein (GFP) have been extensively used as fusion partners to proteins to track the properties of a protein of interest1
developments with smaller tags enable new functionalities of proteins to be examined in cells such as conformational change and protein-association 2, 3
. One small
tag system involves a tetracysteine motif (CCXXCC) genetically inserted into a target protein, which binds to biarsenical dyes, ReAsH (red fluorescent) and FlAsH
(green fluorescent), with high specificity even in live cells 2
. The TC/biarsenical dye system offers far less steric constraints to the host protein than
fluorescent proteins which has enabled several new approaches to measure conformational change and protein-protein interactions 4-7
. We recently developed
a novel application of TC tags as sensors of oligomerization in cells expressing mutant huntingtin, which when mutated aggregates in neurons in Huntington
. Huntingtin was tagged with two fluorescent dyes, one a fluorescent protein to track protein location, and the second a TC tag which only
binds biarsenical dyes in monomers. Hence, changes in colocalization between protein and biarsenical dye reactivity enabled submicroscopic oligomer
content to be spatially mapped within cells. Here, we describe how to label TC-tagged proteins fused to a fluorescent protein (Cherry, GFP or CFP)
with FlAsH or ReAsH in live mammalian cells and how to quantify the two color fluorescence (Cherry/FlAsH, CFP/FlAsH or GFP/ReAsH combinations).
Cell Biology, Issue 54, tetracysteine, TC, ReAsH, FlAsH, biarsenical dyes, fluorescence, imaging, confocal microscopy, ImageJ, GFP
Isolating Potentiated Hsp104 Variants Using Yeast Proteinopathy Models
Institutions: Perelman School of Medicine at the University of Pennsylvania.
Many protein-misfolding disorders can be modeled in the budding yeast Saccharomyces cerevisiae
. Proteins such as TDP-43 and FUS, implicated in amyotrophic lateral sclerosis, and α-synuclein, implicated in Parkinson’s disease, are toxic and form cytoplasmic aggregates in yeast. These features recapitulate protein pathologies observed in patients with these disorders. Thus, yeast are an ideal platform for isolating toxicity suppressors from libraries of protein variants. We are interested in applying protein disaggregases to eliminate misfolded toxic protein conformers. Specifically, we are engineering Hsp104, a hexameric AAA+ protein from yeast that is uniquely capable of solubilizing both disordered aggregates and amyloid and returning the proteins to their native conformations. While Hsp104 is highly conserved in eukaryotes and eubacteria, it has no known metazoan homologue. Hsp104 has only limited ability to eliminate disordered aggregates and amyloid fibers implicated in human disease. Thus, we aim to engineer Hsp104 variants to reverse the protein misfolding implicated in neurodegenerative disorders. We have developed methods to screen large libraries of Hsp104 variants for suppression of proteotoxicity in yeast. As yeast are prone to spontaneous nonspecific suppression of toxicity, a two-step screening process has been developed to eliminate false positives. Using these methods, we have identified a series of potentiated Hsp104 variants that potently suppress the toxicity and aggregation of TDP-43, FUS, and α-synuclein. Here, we describe this optimized protocol, which could be adapted to screen libraries constructed using any protein backbone for suppression of toxicity of any protein that is toxic in yeast.
Microbiology, Issue 93, Protein-misfolding disorders, yeast proteinopathy models, Hsp104, proteotoxicity, amyloid, disaggregation
Synthesis of an Intein-mediated Artificial Protein Hydrogel
Institutions: Texas A&M University, College Station, Texas A&M University, College Station.
We present the synthesis of a highly stable protein hydrogel mediated by a split-intein-catalyzed protein trans
-splicing reaction. The building blocks of this hydrogel are two protein block-copolymers each containing a subunit of a trimeric protein that serves as a crosslinker and one half of a split intein. A highly hydrophilic random coil is inserted into one of the block-copolymers for water retention. Mixing of the two protein block copolymers triggers an intein trans
-splicing reaction, yielding a polypeptide unit with crosslinkers at either end that rapidly self-assembles into a hydrogel. This hydrogel is very stable under both acidic and basic conditions, at temperatures up to 50 °C, and in organic solvents. The hydrogel rapidly reforms after shear-induced rupture. Incorporation of a "docking station peptide" into the hydrogel building block enables convenient incorporation of "docking protein"-tagged target proteins. The hydrogel is compatible with tissue culture growth media, supports the diffusion of 20 kDa molecules, and enables the immobilization of bioactive globular proteins. The application of the intein-mediated protein hydrogel as an organic-solvent-compatible biocatalyst was demonstrated by encapsulating the horseradish peroxidase enzyme and corroborating its activity.
Bioengineering, Issue 83, split-intein, self-assembly, shear-thinning, enzyme, immobilization, organic synthesis
The ChroP Approach Combines ChIP and Mass Spectrometry to Dissect Locus-specific Proteomic Landscapes of Chromatin
Institutions: European Institute of Oncology.
Chromatin is a highly dynamic nucleoprotein complex made of DNA and proteins that controls various DNA-dependent processes. Chromatin structure and function at specific regions is regulated by the local enrichment of histone post-translational modifications (hPTMs) and variants, chromatin-binding proteins, including transcription factors, and DNA methylation. The proteomic characterization of chromatin composition at distinct functional regions has been so far hampered by the lack of efficient protocols to enrich such domains at the appropriate purity and amount for the subsequent in-depth analysis by Mass Spectrometry (MS). We describe here a newly designed chromatin proteomics strategy, named ChroP (Chromatin Proteomics
), whereby a preparative chromatin immunoprecipitation is used to isolate distinct chromatin regions whose features, in terms of hPTMs, variants and co-associated non-histonic proteins, are analyzed by MS. We illustrate here the setting up of ChroP for the enrichment and analysis of transcriptionally silent heterochromatic regions, marked by the presence of tri-methylation of lysine 9 on histone H3. The results achieved demonstrate the potential of ChroP
in thoroughly characterizing the heterochromatin proteome and prove it as a powerful analytical strategy for understanding how the distinct protein determinants of chromatin interact and synergize to establish locus-specific structural and functional configurations.
Biochemistry, Issue 86, chromatin, histone post-translational modifications (hPTMs), epigenetics, mass spectrometry, proteomics, SILAC, chromatin immunoprecipitation , histone variants, chromatome, hPTMs cross-talks
Specificity Analysis of Protein Lysine Methyltransferases Using SPOT Peptide Arrays
Institutions: Stuttgart University.
Lysine methylation is an emerging post-translation modification and it has been identified on several histone and non-histone proteins, where it plays crucial roles in cell development and many diseases. Approximately 5,000 lysine methylation sites were identified on different proteins, which are set by few dozens of protein lysine methyltransferases. This suggests that each PKMT methylates multiple proteins, however till now only one or two substrates have been identified for several of these enzymes. To approach this problem, we have introduced peptide array based substrate specificity analyses of PKMTs. Peptide arrays are powerful tools to characterize the specificity of PKMTs because methylation of several substrates with different sequences can be tested on one array. We synthesized peptide arrays on cellulose membrane using an Intavis SPOT synthesizer and analyzed the specificity of various PKMTs. Based on the results, for several of these enzymes, novel substrates could be identified. For example, for NSD1 by employing peptide arrays, we showed that it methylates K44 of H4 instead of the reported H4K20 and in addition H1.5K168 is the highly preferred substrate over the previously known H3K36. Hence, peptide arrays are powerful tools to biochemically characterize the PKMTs.
Biochemistry, Issue 93, Peptide arrays, solid phase peptide synthesis, SPOT synthesis, protein lysine methyltransferases, substrate specificity profile analysis, lysine methylation
Microwave-assisted Functionalization of Poly(ethylene glycol) and On-resin Peptides for Use in Chain Polymerizations and Hydrogel Formation
Institutions: University of Rochester, University of Rochester, University of Rochester Medical Center.
One of the main benefits to using poly(ethylene glycol) (PEG) macromers in hydrogel formation is synthetic versatility. The ability to draw from a large variety of PEG molecular weights and configurations (arm number, arm length, and branching pattern) affords researchers tight control over resulting hydrogel structures and properties, including Young’s modulus and mesh size. This video will illustrate a rapid, efficient, solvent-free, microwave-assisted method to methacrylate PEG precursors into poly(ethylene glycol) dimethacrylate (PEGDM). This synthetic method provides much-needed starting materials for applications in drug delivery and regenerative medicine. The demonstrated method is superior to traditional methacrylation methods as it is significantly faster and simpler, as well as more economical and environmentally friendly, using smaller amounts of reagents and solvents. We will also demonstrate an adaptation of this technique for on-resin methacrylamide functionalization of peptides. This on-resin method allows the N-terminus of peptides to be functionalized with methacrylamide groups prior to deprotection and cleavage from resin. This allows for selective addition of methacrylamide groups to the N-termini of the peptides while amino acids with reactive side groups (e.g.
primary amine of lysine, primary alcohol of serine, secondary alcohols of threonine, and phenol of tyrosine) remain protected, preventing functionalization at multiple sites. This article will detail common analytical methods (proton Nuclear Magnetic Resonance spectroscopy (;
H-NMR) and Matrix Assisted Laser Desorption Ionization Time of Flight mass spectrometry (MALDI-ToF)) to assess the efficiency of the functionalizations. Common pitfalls and suggested troubleshooting methods will be addressed, as will modifications of the technique which can be used to further tune macromer functionality and resulting hydrogel physical and chemical properties. Use of synthesized products for the formation of hydrogels for drug delivery and cell-material interaction studies will be demonstrated, with particular attention paid to modifying hydrogel composition to affect mesh size, controlling hydrogel stiffness and drug release.
Chemistry, Issue 80, Poly(ethylene glycol), peptides, polymerization, polymers, methacrylation, peptide functionalization, 1H-NMR, MALDI-ToF, hydrogels, macromer synthesis
Analyzing Protein Dynamics Using Hydrogen Exchange Mass Spectrometry
Institutions: University of Heidelberg.
All cellular processes depend on the functionality of proteins. Although the functionality of a given protein is the direct consequence of its unique amino acid sequence, it is only realized by the folding of the polypeptide chain into a single defined three-dimensional arrangement or more commonly into an ensemble of interconverting conformations. Investigating the connection between protein conformation and its function is therefore essential for a complete understanding of how proteins are able to fulfill their great variety of tasks. One possibility to study conformational changes a protein undergoes while progressing through its functional cycle is hydrogen-1
H-exchange in combination with high-resolution mass spectrometry (HX-MS). HX-MS is a versatile and robust method that adds a new dimension to structural information obtained by e.g.
crystallography. It is used to study protein folding and unfolding, binding of small molecule ligands, protein-protein interactions, conformational changes linked to enzyme catalysis, and allostery. In addition, HX-MS is often used when the amount of protein is very limited or crystallization of the protein is not feasible. Here we provide a general protocol for studying protein dynamics with HX-MS and describe as an example how to reveal the interaction interface of two proteins in a complex.
Chemistry, Issue 81, Molecular Chaperones, mass spectrometers, Amino Acids, Peptides, Proteins, Enzymes, Coenzymes, Protein dynamics, conformational changes, allostery, protein folding, secondary structure, mass spectrometry
Growth Assays to Assess Polyglutamine Toxicity in Yeast
Institutions: Boston Biomedical Research Institute.
Protein misfolding is associated with many human diseases, particularly neurodegenerative diseases, such as Alzheimer’s disease, Parkinson's disease, and Huntington's disease 1
. Huntington's disease (HD) is caused by the abnormal expansion of a polyglutamine (polyQ) region within the protein huntingtin. The polyQ-expanded huntingtin protein attains an aberrant conformation (i.e. it misfolds) and causes cellular toxicity 2
. At least eight further neurodegenerative diseases are caused by polyQ-expansions, including the Spinocerebellar Ataxias and Kennedy’s disease 3
The model organism yeast has facilitated significant insights into the cellular and molecular basis of polyQ-toxicity, including the impact of intra- and inter-molecular factors of polyQ-toxicity, and the identification of cellular pathways that are impaired in cells expressing polyQ-expansion proteins 3-8
. Importantly, many aspects of polyQ-toxicity that were found in yeast were reproduced in other experimental systems and to some extent in samples from HD patients, thus demonstrating the significance of the yeast model for the discovery of basic mechanisms underpinning polyQ-toxicity.
A direct and relatively simple way to determine polyQ-toxicity in yeast is to measure growth defects of yeast cells expressing polyQ-expansion proteins. This manuscript describes three complementary experimental approaches to determine polyQ-toxicity in yeast by measuring the growth of yeast cells expressing polyQ-expansion proteins. The first two experimental approaches monitor yeast growth on plates, the third approach monitors the growth of liquid yeast cultures using the BioscreenC instrument.
Furthermore, this manuscript describes experimental difficulties that can occur when handling yeast polyQ models and outlines strategies that will help to avoid or minimize these difficulties. The protocols described here can be used to identify and to characterize genetic pathways and small molecules that modulate polyQ-toxicity. Moreover, the described assays may serve as templates for accurate analyses of the toxicity caused by other disease-associated misfolded proteins in yeast models.
Molecular Biology, Issue 61, Protein misfolding, yeast, polyglutamine diseases, growth assays
Screening for Amyloid Aggregation by Semi-Denaturing Detergent-Agarose Gel Electrophoresis
Institutions: Whitehead Institute for Biomedical Research, MIT - Massachusetts Institute of Technology, Howard Hughes Medical Institute.
Amyloid aggregation is associated with numerous protein misfolding pathologies and underlies the infectious properties of prions, which are conformationally self-templating proteins that are thought to have beneficial roles in lower organisms. Amyloids have been notoriously difficult to study due to their insolubility and structural heterogeneity. However, resolution of amyloid polymers based on size and detergent insolubility has been made possible by Semi-Denaturing Detergent-Agarose Gel Electrophoresis (SDD-AGE). This technique is finding widespread use for the detection and characterization of amyloid conformational variants. Here, we demonstrate an adaptation of this technique that facilitates its use in large-scale applications, such as screens for novel prions and other amyloidogenic proteins. The new SDD-AGE method uses capillary transfer for greater reliability and ease of use, and allows any sized gel to be accomodated. Thus, a large number of samples, prepared from cells or purified proteins, can be processed simultaneously for the presence of SDS-insoluble conformers of tagged proteins.
Basic Protocols, Issue 17, biochemistry, SDD-AGE, amyloid, prion, aggregate
Rapid Generation of Amyloid from Native Proteins In vitro
Institutions: The University of Texas MD Anderson Cancer Center.
Proteins carry out crucial tasks in organisms by exerting functions elicited from their specific three dimensional folds. Although the native structures of polypeptides fulfill many purposes, it is now recognized that most proteins can adopt an alternative assembly of beta-sheet rich amyloid. Insoluble amyloid fibrils are initially associated with multiple human ailments, but they are increasingly shown as functional players participating in various important cellular processes. In addition, amyloid deposited in patient tissues contains nonproteinaceous
components, such as nucleic acids and glycosaminoglycans (GAGs). These cofactors can facilitate the formation of amyloid, resulting in the generation of different types of insoluble precipitates. By taking advantage of our understanding how proteins misfold via an intermediate stage of soluble amyloid precursor, we have devised a method to convert native proteins to amyloid fibrils in vitro
. This approach allows one to prepare amyloid in large quantities, examine the properties of amyloid generated from specific proteins, and evaluate the structural changes accompanying the conversion.
Biochemistry, Issue 82, amyloid, soluble protein oligomer, amyloid precursor, protein misfolding, amyloid fibril, protein aggregate
Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules
Institutions: Princeton University.
The aim of de novo
protein design is to find the amino acid sequences that will fold into a desired 3-dimensional structure with improvements in specific properties, such as binding affinity, agonist or antagonist behavior, or stability, relative to the native sequence. Protein design lies at the center of current advances drug design and discovery. Not only does protein design provide predictions for potentially useful drug targets, but it also enhances our understanding of the protein folding process and protein-protein interactions. Experimental methods such as directed evolution have shown success in protein design. However, such methods are restricted by the limited sequence space that can be searched tractably. In contrast, computational design strategies allow for the screening of a much larger set of sequences covering a wide variety of properties and functionality. We have developed a range of computational de novo
protein design methods capable of tackling several important areas of protein design. These include the design of monomeric proteins for increased stability and complexes for increased binding affinity.
To disseminate these methods for broader use we present Protein WISDOM (https://www.proteinwisdom.org), a tool that provides automated methods for a variety of protein design problems. Structural templates are submitted to initialize the design process. The first stage of design is an optimization sequence selection stage that aims at improving stability through minimization of potential energy in the sequence space. Selected sequences are then run through a fold specificity stage and a binding affinity stage. A rank-ordered list of the sequences for each step of the process, along with relevant designed structures, provides the user with a comprehensive quantitative assessment of the design. Here we provide the details of each design method, as well as several notable experimental successes attained through the use of the methods.
Genetics, Issue 77, Molecular Biology, Bioengineering, Biochemistry, Biomedical Engineering, Chemical Engineering, Computational Biology, Genomics, Proteomics, Protein, Protein Binding, Computational Biology, Drug Design, optimization (mathematics), Amino Acids, Peptides, and Proteins, De novo protein and peptide design, Drug design, In silico sequence selection, Optimization, Fold specificity, Binding affinity, sequencing
Characterization of Complex Systems Using the Design of Experiments Approach: Transient Protein Expression in Tobacco as a Case Study
Institutions: RWTH Aachen University, Fraunhofer Gesellschaft.
Plants provide multiple benefits for the production of biopharmaceuticals including low costs, scalability, and safety. Transient expression offers the additional advantage of short development and production times, but expression levels can vary significantly between batches thus giving rise to regulatory concerns in the context of good manufacturing practice. We used a design of experiments (DoE) approach to determine the impact of major factors such as regulatory elements in the expression construct, plant growth and development parameters, and the incubation conditions during expression, on the variability of expression between batches. We tested plants expressing a model anti-HIV monoclonal antibody (2G12) and a fluorescent marker protein (DsRed). We discuss the rationale for selecting certain properties of the model and identify its potential limitations. The general approach can easily be transferred to other problems because the principles of the model are broadly applicable: knowledge-based parameter selection, complexity reduction by splitting the initial problem into smaller modules, software-guided setup of optimal experiment combinations and step-wise design augmentation. Therefore, the methodology is not only useful for characterizing protein expression in plants but also for the investigation of other complex systems lacking a mechanistic description. The predictive equations describing the interconnectivity between parameters can be used to establish mechanistic models for other complex systems.
Bioengineering, Issue 83, design of experiments (DoE), transient protein expression, plant-derived biopharmaceuticals, promoter, 5'UTR, fluorescent reporter protein, model building, incubation conditions, monoclonal antibody
A High Throughput MHC II Binding Assay for Quantitative Analysis of Peptide Epitopes
Institutions: Dartmouth College, University of Rhode Island, Dartmouth College.
Biochemical assays with recombinant human MHC II molecules can provide rapid, quantitative insights into immunogenic epitope identification, deletion, or design1,2
. Here, a peptide-MHC II binding assay is scaled to 384-well format. The scaled down protocol reduces reagent costs by 75% and is higher throughput than previously described 96-well protocols1,3-5
. Specifically, the experimental design permits robust and reproducible analysis of up to 15 peptides against one MHC II allele per 384-well ELISA plate. Using a single liquid handling robot, this method allows one researcher to analyze approximately ninety test peptides in triplicate over a range of eight concentrations and four MHC II allele types in less than 48 hr. Others working in the fields of protein deimmunization or vaccine design and development may find the protocol to be useful in facilitating their own work. In particular, the step-by-step instructions and the visual format of JoVE should allow other users to quickly and easily establish this methodology in their own labs.
Biochemistry, Issue 85, Immunoassay, Protein Immunogenicity, MHC II, T cell epitope, High Throughput Screen, Deimmunization, Vaccine Design
Using Caenorhabditis elegans as a Model System to Study Protein Homeostasis in a Multicellular Organism
Institutions: Ben-Gurion University of the Negev.
The folding and assembly of proteins is essential for protein function, the long-term health of the cell, and longevity of the organism. Historically, the function and regulation of protein folding was studied in vitro
, in isolated tissue culture cells and in unicellular organisms. Recent studies have uncovered links between protein homeostasis (proteostasis), metabolism, development, aging, and temperature-sensing. These findings have led to the development of new tools for monitoring protein folding in the model metazoan organism Caenorhabditis elegans
. In our laboratory, we combine behavioral assays, imaging and biochemical approaches using temperature-sensitive or naturally occurring metastable proteins as sensors of the folding environment to monitor protein misfolding. Behavioral assays that are associated with the misfolding of a specific protein provide a simple and powerful readout for protein folding, allowing for the fast screening of genes and conditions that modulate folding. Likewise, such misfolding can be associated with protein mislocalization in the cell. Monitoring protein localization can, therefore, highlight changes in cellular folding capacity occurring in different tissues, at various stages of development and in the face of changing conditions. Finally, using biochemical tools ex vivo
, we can directly monitor protein stability and conformation. Thus, by combining behavioral assays, imaging and biochemical techniques, we are able to monitor protein misfolding at the resolution of the organism, the cell, and the protein, respectively.
Biochemistry, Issue 82, aging, Caenorhabditis elegans, heat shock response, neurodegenerative diseases, protein folding homeostasis, proteostasis, stress, temperature-sensitive
Expression, Isolation, and Purification of Soluble and Insoluble Biotinylated Proteins for Nerve Tissue Regeneration
Institutions: University of Akron.
Recombinant protein engineering has utilized Escherichia coli (E. coli)
expression systems for nearly 4 decades, and today E. coli
is still the most widely used host organism. The flexibility of the system allows for the addition of moieties such as a biotin tag (for streptavidin interactions) and larger functional proteins like green fluorescent protein or cherry red protein. Also, the integration of unnatural amino acids like metal ion chelators, uniquely reactive functional groups, spectroscopic probes, and molecules imparting post-translational modifications has enabled better manipulation of protein properties and functionalities. As a result this technique creates customizable fusion proteins that offer significant utility for various fields of research. More specifically, the biotinylatable protein sequence has been incorporated into many target proteins because of the high affinity interaction between biotin with avidin and streptavidin. This addition has aided in enhancing detection and purification of tagged proteins as well as opening the way for secondary applications such as cell sorting. Thus, biotin-labeled molecules show an increasing and widespread influence in bioindustrial and biomedical fields. For the purpose of our research we have engineered recombinant biotinylated fusion proteins containing nerve growth factor (NGF) and semaphorin3A (Sema3A) functional regions. We have reported previously how these biotinylated fusion proteins, along with other active protein sequences, can be tethered to biomaterials for tissue engineering and regenerative purposes. This protocol outlines the basics of engineering biotinylatable proteins at the milligram scale, utilizing a T7 lac
inducible vector and E. coli
expression hosts, starting from transformation to scale-up and purification.
Bioengineering, Issue 83, protein engineering, recombinant protein production, AviTag, BirA, biotinylation, pET vector system, E. coli, inclusion bodies, Ni-NTA, size exclusion chromatography
High Throughput Quantitative Expression Screening and Purification Applied to Recombinant Disulfide-rich Venom Proteins Produced in E. coli
Institutions: Aix-Marseille Université, Commissariat à l'énergie atomique et aux énergies alternatives (CEA) Saclay, France.
Escherichia coli (E. coli)
is the most widely used expression system for the production of recombinant proteins for structural and functional studies. However, purifying proteins is sometimes challenging since many proteins are expressed in an insoluble form. When working with difficult or multiple targets it is therefore recommended to use high throughput (HTP) protein expression screening on a small scale (1-4 ml cultures) to quickly identify conditions for soluble expression. To cope with the various structural genomics programs of the lab, a quantitative (within a range of 0.1-100 mg/L culture of recombinant protein) and HTP protein expression screening protocol was implemented and validated on thousands of proteins. The protocols were automated with the use of a liquid handling robot but can also be performed manually without specialized equipment.
Disulfide-rich venom proteins are gaining increasing recognition for their potential as therapeutic drug leads. They can be highly potent and selective, but their complex disulfide bond networks make them challenging to produce. As a member of the FP7 European Venomics project (www.venomics.eu), our challenge is to develop successful production strategies with the aim of producing thousands of novel venom proteins for functional characterization. Aided by the redox properties of disulfide bond isomerase DsbC, we adapted our HTP production pipeline for the expression of oxidized, functional venom peptides in the E. coli
cytoplasm. The protocols are also applicable to the production of diverse disulfide-rich proteins. Here we demonstrate our pipeline applied to the production of animal venom proteins. With the protocols described herein it is likely that soluble disulfide-rich proteins will be obtained in as little as a week. Even from a small scale, there is the potential to use the purified proteins for validating the oxidation state by mass spectrometry, for characterization in pilot studies, or for sensitive micro-assays.
Bioengineering, Issue 89, E. coli, expression, recombinant, high throughput (HTP), purification, auto-induction, immobilized metal affinity chromatography (IMAC), tobacco etch virus protease (TEV) cleavage, disulfide bond isomerase C (DsbC) fusion, disulfide bonds, animal venom proteins/peptides
Selection of Aptamers for Amyloid β-Protein, the Causative Agent of Alzheimer's Disease
Institutions: David Geffen School of Medicine, University of California, Los Angeles, University of California, Los Angeles.
Alzheimer's disease (AD) is a progressive, age-dependent, neurodegenerative disorder with an insidious course that renders its presymptomatic diagnosis difficult1
. Definite AD diagnosis is achieved only postmortem, thus establishing presymptomatic, early diagnosis of AD is crucial for developing and administering effective therapies2,3
Amyloid β-protein (Aβ) is central to AD pathogenesis. Soluble, oligomeric Aβ assemblies are believed to affect neurotoxicity underlying synaptic dysfunction and neuron loss in AD4,5
. Various forms of soluble Aβ assemblies have been described, however, their interrelationships and relevance to AD etiology and pathogenesis are complex and not well understood6
. Specific molecular recognition tools may unravel the relationships amongst Aβ assemblies and facilitate detection and characterization of these assemblies early in the disease course before symptoms emerge. Molecular recognition commonly relies on antibodies. However, an alternative class of molecular recognition tools, aptamers, offers important advantages relative to antibodies7,8
. Aptamers are oligonucleotides generated by in-vitro
selection: systematic evolution of ligands by exponential enrichment (SELEX)9,10
. SELEX is an iterative process that, similar to Darwinian evolution, allows selection, amplification, enrichment, and perpetuation of a property, e.g., avid, specific, ligand binding (aptamers) or catalytic activity (ribozymes and DNAzymes).
Despite emergence of aptamers as tools in modern biotechnology and medicine11
, they have been underutilized in the amyloid field. Few RNA or ssDNA aptamers have been selected against various forms of prion proteins (PrP)12-16
. An RNA aptamer generated against recombinant bovine PrP was shown to recognize bovine PrP-β17
, a soluble, oligomeric, β-sheet-rich conformational variant of full-length PrP that forms amyloid fibrils18
. Aptamers generated using monomeric and several forms of fibrillar β2
m) were found to bind fibrils of certain other amyloidogenic proteins besides β2
. Ylera et al
. described RNA aptamers selected against immobilized monomeric Aβ4020
. Unexpectedly, these aptamers bound fibrillar Aβ40. Altogether, these data raise several important questions. Why did aptamers selected against monomeric proteins recognize their polymeric forms? Could aptamers against monomeric and/or oligomeric forms of amyloidogenic proteins be obtained? To address these questions, we attempted to select aptamers for covalently-stabilized oligomeric Aβ4021
generated using photo-induced cross-linking of unmodified proteins (PICUP)22,23
. Similar to previous findings17,19,20
, these aptamers reacted with fibrils of Aβ and several other amyloidogenic proteins likely recognizing a potentially common amyloid structural aptatope21
. Here, we present the SELEX methodology used in production of these aptamers21
Neuroscience, Issue 39, Cellular Biology, Aptamer, RNA, amyloid β-protein, oligomer, amyloid fibrils, protein assembly
Test Samples for Optimizing STORM Super-Resolution Microscopy
Institutions: National Physical Laboratory.
STORM is a recently developed super-resolution microscopy technique with up to 10 times better resolution than standard fluorescence microscopy techniques. However, as the image is acquired in a very different way than normal, by building up an image molecule-by-molecule, there are some significant challenges for users in trying to optimize their image acquisition. In order to aid this process and gain more insight into how STORM works we present the preparation of 3 test samples and the methodology of acquiring and processing STORM super-resolution images with typical resolutions of between 30-50 nm. By combining the test samples with the use of the freely available rainSTORM processing software it is possible to obtain a great deal of information about image quality and resolution. Using these metrics it is then possible to optimize the imaging procedure from the optics, to sample preparation, dye choice, buffer conditions, and image acquisition settings. We also show examples of some common problems that result in poor image quality, such as lateral drift, where the sample moves during image acquisition and density related problems resulting in the 'mislocalization' phenomenon.
Molecular Biology, Issue 79, Genetics, Bioengineering, Biomedical Engineering, Biophysics, Basic Protocols, HeLa Cells, Actin Cytoskeleton, Coated Vesicles, Receptor, Epidermal Growth Factor, Actins, Fluorescence, Endocytosis, Microscopy, STORM, super-resolution microscopy, nanoscopy, cell biology, fluorescence microscopy, test samples, resolution, actin filaments, fiducial markers, epidermal growth factor, cell, imaging
Prediction of HIV-1 Coreceptor Usage (Tropism) by Sequence Analysis using a Genotypic Approach
Institutions: University of Cologne, Max Planck Institute for Informatics, Institute for Immune genetics, University of Duesseldorf, University of Essen, University of Cologne, Augustinerinnen Hospital.
Maraviroc (MVC) is the first licensed antiretroviral drug from the class of coreceptor antagonists. It binds to the host coreceptor CCR5, which is used by the majority of HIV strains in order to infect the human immune cells (Fig. 1). Other HIV isolates use a different coreceptor, the CXCR4. Which receptor is used, is determined in the virus by the Env protein (Fig. 2). Depending on the coreceptor used, the viruses are classified as R5 or X4, respectively. MVC binds to the CCR5 receptor inhibiting the entry of R5 viruses into the target cell. During the course of disease, X4 viruses may emerge and outgrow the R5 viruses. Determination of coreceptor usage (also called tropism) is therefore mandatory prior to administration of MVC, as demanded by EMA and FDA.
The studies for MVC efficiency MOTIVATE, MERIT and 1029 have been performed with the Trofile assay from Monogram, San Francisco, U.S.A. This is a high quality assay based on sophisticated recombinant tests. The acceptance for this test for daily routine is rather low outside of the U.S.A., since the European physicians rather tend to work with decentralized expert laboratories, which also provide concomitant resistance testing. These laboratories have undergone several quality assurance evaluations, the last one being presented in 20111
For several years now, we have performed tropism determinations based on sequence analysis from the HIV env-V3 gene region (V3)2
. This region carries enough information to perform a reliable prediction.
The genotypic determination of coreceptor usage presents advantages such as: shorter turnover time (equivalent to resistance testing), lower costs, possibility to adapt the results to the patients' needs and possibility of analysing clinical samples with very low or even undetectable viral load (VL), particularly since the number of samples analysed with VL<1000 copies/μl roughly increased in the last years (Fig. 3).
The main steps for tropism testing (Fig. 4) demonstrated in this video:
1. Collection of a blood sample
2. Isolation of the HIV RNA from the plasma and/or HIV proviral DNA from blood mononuclear cells
3. Amplification of the env
4. Amplification of the V3 region
5. Sequence reaction of the V3 amplicon
6. Purification of the sequencing samples
7. Sequencing the purified samples
8. Sequence editing
9. Sequencing data interpretation and tropism prediction
Immunology, Issue 58, HIV-1, coreceptor, coreceptor antagonist, prediction of coreceptor usage, tropism, R5, X4, maraviroc, MVC
Interview: Protein Folding and Studies of Neurodegenerative Diseases
Institutions: MIT - Massachusetts Institute of Technology.
In this interview, Dr. Lindquist describes relationships between protein folding, prion diseases and neurodegenerative disorders. The problem of the protein folding is at the core of the modern biology. In addition to their traditional biochemical functions, proteins can mediate transfer of biological information and therefore can be considered a genetic material. This recently discovered function of proteins has important implications for studies of human disorders. Dr. Lindquist also describes current experimental approaches to investigate the mechanism of neurodegenerative diseases based on genetic studies in model organisms.
Neuroscience, issue 17, protein folding, brain, neuron, prion, neurodegenerative disease, yeast, screen, Translational Research