Genome sequencing projects have ciphered millions of protein sequence, which require knowledge of their structure and function to improve the understanding of their biological role. Although experimental methods can provide detailed information for a small fraction of these proteins, computational modeling is needed for the majority of protein molecules which are experimentally uncharacterized. The I-TASSER server is an on-line workbench for high-resolution modeling of protein structure and function. Given a protein sequence, a typical output from the I-TASSER server includes secondary structure prediction, predicted solvent accessibility of each residue, homologous template proteins detected by threading and structure alignments, up to five full-length tertiary structural models, and structure-based functional annotations for enzyme classification, Gene Ontology terms and protein-ligand binding sites. All the predictions are tagged with a confidence score which tells how accurate the predictions are without knowing the experimental data. To facilitate the special requests of end users, the server provides channels to accept user-specified inter-residue distance and contact maps to interactively change the I-TASSER modeling; it also allows users to specify any proteins as template, or to exclude any template proteins during the structure assembly simulations. The structural information could be collected by the users based on experimental evidences or biological insights with the purpose of improving the quality of I-TASSER predictions. The server was evaluated as the best programs for protein structure and function predictions in the recent community-wide CASP experiments. There are currently >20,000 registered scientists from over 100 countries who are using the on-line I-TASSER server.
23 Related JoVE Articles!
ampliPHOX Colorimetric Detection on a DNA Microarray for Influenza
DNA microarrays have emerged as a powerful tool for pathogen detection.1-5
For instance, many examples of the ability to type and subtype influenza virus have been demonstrated.6-11
The identification and subtyping of influenza on DNA microarrays has applications in both public health and the clinic for early detection, rapid intervention, and minimizing the impact of an influenza pandemic. Traditional fluorescence is currently the most commonly used microarray detection method. However, as microarray technology progresses towards clinical use,1
replacing expensive instrumentation with low cost detection technology exhibiting similar performance characteristics to fluorescence will make microarray assays more attractive and cost-effective.
The ampliPHOX colorimetric detection technology is intended for research applications, and has a limit of detection within one order of magnitude of traditional fluorescence11
, with a main advantage being an approximate ten-fold lower instrument cost compared to the confocal microarray scanners required for fluorescence microarray detection. Another advantage is the compact size of the instrument which allows for portability and flexibility, unlike traditional fluorescence instruments. Because the polymerization technology is not as inherently linear as fluorescence detection, however, it is best suited for lower density microarray applications in which a yes/no answer for the presence of a certain sequence is desired, such as for pathogen detection arrays. Currently the maximum spot density compatible with ampliPHOX detection is ˜1800 spots/array. Because of the spot density limitations, higher density microarrays are not suitable for ampliPHOX detection.
Here, we present ampliPHOX colorimetric detection technology as a method of signal amplification on a low density microarray developed for the detection and characterization of influenza viruses (FluChip). Although this protocol uses the FluChip (a DNA microarray) as one specific application of ampliPHOX detection, any microarray incorporating biotinylated target can be labeled and detected in a similar manner. The microarray design and biotinylation of the target to be captured are the responsibility of the user. Once the biotinylated target has been captured on the array, ampliPHOX detection can be performed by first tagging the array with a streptavidin-label conjugate (ampliTAG). Upon light exposure using the ampliPHOX Reader instrument, polymerization of a monomer solution (ampliPHY) occurs only in regions containing ampliTAG-labeled targets. The polymer formed can be subsequently stained with a non-toxic solution to improve visual contrast, followed by imaging and analysis using a simple software package (ampliVIEW). The entire FluChip assay from un-extracted sample to result can be performed in about 6 hours, and the ampliPHOX detection steps described above can be completed in about 30 min.
Immunology, Issue 52, microarrays, colorimetric detection, ampliPHOX, diagnostic, low-density, pathogen detection, influenza
Rapid Generation of Amyloid from Native Proteins In vitro
Institutions: The University of Texas MD Anderson Cancer Center.
Proteins carry out crucial tasks in organisms by exerting functions elicited from their specific three dimensional folds. Although the native structures of polypeptides fulfill many purposes, it is now recognized that most proteins can adopt an alternative assembly of beta-sheet rich amyloid. Insoluble amyloid fibrils are initially associated with multiple human ailments, but they are increasingly shown as functional players participating in various important cellular processes. In addition, amyloid deposited in patient tissues contains nonproteinaceous
components, such as nucleic acids and glycosaminoglycans (GAGs). These cofactors can facilitate the formation of amyloid, resulting in the generation of different types of insoluble precipitates. By taking advantage of our understanding how proteins misfold via an intermediate stage of soluble amyloid precursor, we have devised a method to convert native proteins to amyloid fibrils in vitro
. This approach allows one to prepare amyloid in large quantities, examine the properties of amyloid generated from specific proteins, and evaluate the structural changes accompanying the conversion.
Biochemistry, Issue 82, amyloid, soluble protein oligomer, amyloid precursor, protein misfolding, amyloid fibril, protein aggregate
4D Imaging of Protein Aggregation in Live Cells
Institutions: Hebrew University of Jerusalem .
One of the key tasks of any living cell is maintaining the proper folding of newly synthesized proteins in the face of ever-changing environmental conditions and an intracellular environment that is tightly packed, sticky, and hazardous to protein stability1
. The ability to dynamically balance protein production, folding and degradation demands highly-specialized quality control machinery, whose absolute necessity is observed best when it malfunctions. Diseases such as ALS, Alzheimer's, Parkinson's, and certain forms of Cystic Fibrosis have a direct link to protein folding quality control components2
, and therefore future therapeutic development requires a basic understanding of underlying processes. Our experimental challenge is to understand how cells integrate damage signals and mount responses that are tailored to diverse circumstances.
The primary reason why protein misfolding represents an existential threat to the cell is the propensity of incorrectly folded proteins to aggregate, thus causing a global perturbation of the crowded and delicate intracellular folding environment1
. The folding health, or "proteostasis," of the cellular proteome is maintained, even under the duress of aging, stress and oxidative damage, by the coordinated action of different mechanistic units in an elaborate quality control system3,4
. A specialized machinery of molecular chaperones can bind non-native polypeptides and promote their folding into the native state1
, target them for degradation by the ubiquitin-proteasome system5
, or direct them to protective aggregation inclusions6-9
In eukaryotes, the cytosolic aggregation quality control load is partitioned between two compartments8-10
: the juxtanuclear quality control compartment (JUNQ) and the insoluble protein deposit (IPOD) (Figure 1
- model). Proteins that are ubiquitinated by the protein folding quality control machinery are delivered to the JUNQ, where they are processed for degradation by the proteasome. Misfolded proteins that are not ubiquitinated are diverted to the IPOD, where they are actively aggregated in a protective compartment.
Up until this point, the methodological paradigm of live-cell fluorescence microscopy has largely been to label proteins and track their locations in the cell at specific time-points and usually in two dimensions. As new technologies have begun to grant experimenters unprecedented access to the submicron scale in living cells, the dynamic architecture of the cytosol has come into view as a challenging new frontier for experimental characterization. We present a method for rapidly monitoring the 3D spatial distributions of multiple fluorescently labeled proteins in the yeast cytosol over time. 3D timelapse (4D imaging) is not merely a technical challenge; rather, it also facilitates a dramatic shift in the conceptual framework used to analyze cellular structure.
We utilize a cytosolic folding sensor protein in live yeast to visualize distinct fates for misfolded proteins in cellular aggregation quality control, using rapid 4D fluorescent imaging. The temperature sensitive mutant of the Ubc9 protein10-12
) is extremely effective both as a sensor of cellular proteostasis, and a physiological model for tracking aggregation quality control. As with most ts proteins, Ubc9ts
is fully folded and functional at permissive temperatures due to active cellular chaperones. Above 30 °C, or when the cell faces misfolding stress, Ubc9ts
misfolds and follows the fate of a native globular protein that has been misfolded due to mutation, heat denaturation, or oxidative damage. By fusing it to GFP or other fluorophores, it can be tracked in 3D as it forms Stress Foci, or is directed to JUNQ or IPOD.
Cellular Biology, Issue 74, Molecular Biology, Genetics, Proteins, Aggregation quality control, protein folding quality control, GFP, JUNQ (juxtanuclear quality control compartment), IPOD (insoluble protein deposit), proteostasis sensor, 4D live cell imaging, live cells, laser, cell biology, protein folding, Ubc9ts, yeast, assay, cell, imaging
High-throughput Detection Method for Influenza Virus
Institutions: Blood Research Institute, Mount Sinai School of Medicine , Blood Research Institute, City of Milwaukee Health Department Laboratory, Medical College of Wisconsin , Medical College of Wisconsin .
Influenza virus is a respiratory pathogen that causes a high degree of morbidity and mortality every year in multiple parts of the world. Therefore, precise diagnosis of the infecting strain and rapid high-throughput screening of vast numbers of clinical samples is paramount to control the spread of pandemic infections. Current clinical diagnoses of influenza infections are based on serologic testing, polymerase chain reaction, direct specimen immunofluorescence and cell culture 1,2
Here, we report the development of a novel diagnostic technique used to detect live influenza viruses. We used the mouse-adapted human A/PR/8/34 (PR8, H1N1) virus 3
to test the efficacy of this technique using MDCK cells 4
. MDCK cells (104
or 5 x 103
per well) were cultured in 96- or 384-well plates, infected with PR8 and viral proteins were detected using anti-M2 followed by an IR dye-conjugated secondary antibody. M2 5
and hemagglutinin 1
are two major marker proteins used in many different diagnostic assays. Employing IR-dye-conjugated secondary antibodies minimized the autofluorescence associated with other fluorescent dyes. The use of anti-M2 antibody allowed us to use the antigen-specific fluorescence intensity as a direct metric of viral quantity. To enumerate the fluorescence intensity, we used the LI-COR Odyssey-based IR scanner. This system uses two channel laser-based IR detections to identify fluorophores and differentiate them from background noise. The first channel excites at 680 nm and emits at 700 nm to help quantify the background. The second channel detects fluorophores that excite at 780 nm and emit at 800 nm. Scanning of PR8-infected MDCK cells in the IR scanner indicated a viral titer-dependent bright fluorescence. A positive correlation of fluorescence intensity to virus titer starting from 102
PFU could be consistently observed. Minimal but detectable positivity consistently seen with 102
PFU PR8 viral titers demonstrated the high sensitivity of the near-IR dyes. The signal-to-noise ratio was determined by comparing the mock-infected or isotype antibody-treated MDCK cells.
Using the fluorescence intensities from 96- or 384-well plate formats, we constructed standard titration curves. In these calculations, the first variable is the viral titer while the second variable is the fluorescence intensity. Therefore, we used the exponential distribution to generate a curve-fit to determine the polynomial relationship between the viral titers and fluorescence intensities. Collectively, we conclude that IR dye-based protein detection system can help diagnose infecting viral strains and precisely enumerate the titer of the infecting pathogens.
Immunology, Issue 60, Influenza virus, Virus titer, Epithelial cells
Using a Pan-Viral Microarray Assay (Virochip) to Screen Clinical Samples for Viral Pathogens
Institutions: University of California, San Francisco, University of California, San Francisco.
The diagnosis of viral causes of many infectious diseases is difficult due to the inherent sequence diversity of viruses as well as the ongoing emergence of novel viral pathogens, such as SARS coronavirus and 2009 pandemic H1N1 influenza virus, that are not detectable by traditional methods. To address these challenges, we have previously developed and validated a pan-viral microarray platform called the Virochip with the capacity to detect all known viruses as well as novel variants on the basis of conserved sequence homology1
. Using the Virochip, we have identified the full spectrum of viruses associated with respiratory infections, including cases of unexplained critical illness in hospitalized patients, with a sensitivity equivalent to or superior to conventional clinical testing2-5
. The Virochip has also been used to identify novel viruses, including the SARS coronavirus6,7
, a novel rhinovirus clade5
, XMRV (a retrovirus linked to prostate cancer)8
, avian bornavirus (the cause of a wasting disease in parrots)9
, and a novel cardiovirus in children with respiratory and diarrheal illness10
. The current version of the Virochip has been ported to an Agilent microarray platform and consists of ~36,000 probes derived from over ~1,500 viruses in GenBank as of December of 2009. Here we demonstrate the steps involved in processing a Virochip assay from start to finish (~24 hour turnaround time), including sample nucleic acid extraction, PCR amplification using random primers, fluorescent dye incorporation, and microarray hybridization, scanning, and analysis.
Immunology, Issue 50, virus, microarray, Virochip, viral detection, genomics, clinical diagnostics, viral discovery, metagenomics, novel pathogen discovery
The Xenopus Oocyte Cut-open Vaseline Gap Voltage-clamp Technique With Fluorometry
Institutions: Washington University in St. Louis.
The cut-open oocyte Vaseline gap (COVG) voltage clamp technique allows for analysis of electrophysiological and kinetic properties of heterologous ion channels in oocytes. Recordings from the cut-open setup are particularly useful for resolving low magnitude gating currents, rapid ionic current activation, and deactivation. The main benefits over the two-electrode voltage clamp (TEVC) technique include increased clamp speed, improved signal-to-noise ratio, and the ability to modulate the intracellular and extracellular milieu.
Here, we employ the human cardiac sodium channel (hNaV
1.5), expressed in Xenopus
oocytes, to demonstrate the cut-open setup and protocol as well as modifications that are required to add voltage clamp fluorometry capability.
The properties of fast activating ion channels, such as hNaV
1.5, cannot be fully resolved near room temperature using TEVC, in which the entirety of the oocyte membrane is clamped, making voltage control difficult. However, in the cut-open technique, isolation of only a small portion of the cell membrane allows for the rapid clamping required to accurately record fast kinetics while preventing channel run-down associated with patch clamp techniques.
In conjunction with the COVG technique, ion channel kinetics and electrophysiological properties can be further assayed by using voltage clamp fluorometry, where protein motion is tracked via cysteine conjugation of extracellularly applied fluorophores, insertion of genetically encoded fluorescent proteins, or the incorporation of unnatural amino acids into the region of interest1
. This additional data yields kinetic information about voltage-dependent conformational rearrangements of the protein via changes in the microenvironment surrounding the fluorescent molecule.
Developmental Biology, Issue 85, Voltage clamp, Cut-open, Oocyte, Voltage Clamp Fluorometry, Sodium Channels, Ionic Currents, Xenopus laevis
Structure and Coordination Determination of Peptide-metal Complexes Using 1D and 2D 1H NMR
Institutions: The Hebrew University of Jerusalem, The Hebrew University of Jerusalem.
Copper (I) binding by metallochaperone transport proteins prevents copper oxidation and release of the toxic ions that may participate in harmful redox reactions. The Cu (I) complex of the peptide model of a Cu (I) binding metallochaperone protein, which includes the sequence MTCSGCSRPG (underlined is conserved), was determined in solution under inert conditions by NMR spectroscopy.
NMR is a widely accepted technique for the determination of solution structures of proteins and peptides. Due to difficulty in crystallization to provide single crystals suitable for X-ray crystallography, the NMR technique is extremely valuable, especially as it provides information on the solution state rather than the solid state. Herein we describe all steps that are required for full three-dimensional structure determinations by NMR. The protocol includes sample preparation in an NMR tube, 1D and 2D data collection and processing, peak assignment and integration, molecular mechanics calculations, and structure analysis. Importantly, the analysis was first conducted without any preset metal-ligand bonds, to assure a reliable structure determination in an unbiased manner.
Chemistry, Issue 82, solution structure determination, NMR, peptide models, copper-binding proteins, copper complexes
Proton Transfer and Protein Conformation Dynamics in Photosensitive Proteins by Time-resolved Step-scan Fourier-transform Infrared Spectroscopy
Institutions: Freie Universität Berlin.
Monitoring the dynamics of protonation and protein backbone conformation changes during the function of a protein is an essential step towards understanding its mechanism. Protonation and conformational changes affect the vibration pattern of amino acid side chains and of the peptide bond, respectively, both of which can be probed by infrared (IR) difference spectroscopy. For proteins whose function can be repetitively and reproducibly triggered by light, it is possible to obtain infrared difference spectra with (sub)microsecond resolution over a broad spectral range using the step-scan Fourier transform infrared technique. With ~102
repetitions of the photoreaction, the minimum number to complete a scan at reasonable spectral resolution and bandwidth, the noise level in the absorption difference spectra can be as low as ~10-4
, sufficient to follow the kinetics of protonation changes from a single amino acid. Lower noise levels can be accomplished by more data averaging and/or mathematical processing. The amount of protein required for optimal results is between 5-100 µg, depending on the sampling technique used. Regarding additional requirements, the protein needs to be first concentrated in a low ionic strength buffer and then dried to form a film. The protein film is hydrated prior to the experiment, either with little droplets of water or under controlled atmospheric humidity. The attained hydration level (g of water / g of protein) is gauged from an IR absorption spectrum. To showcase the technique, we studied the photocycle of the light-driven proton-pump bacteriorhodopsin in its native purple membrane environment, and of the light-gated ion channel channelrhodopsin-2 solubilized in detergent.
Biophysics, Issue 88, bacteriorhodopsin, channelrhodopsin, attenuated total reflection, proton transfer, protein dynamics, infrared spectroscopy, time-resolved spectroscopy, step-scan, membrane proteins, singular value decomposition
Monitoring Activation of the Antiviral Pattern Recognition Receptors RIG-I And PKR By Limited Protease Digestion and Native PAGE
Institutions: Philipps-University Marburg.
Host defenses to virus infection are dependent on a rapid detection by pattern recognition receptors (PRRs) of the innate immune system. In the cytoplasm, the PRRs RIG-I and PKR bind to specific viral RNA ligands. This first mediates conformational switching and oligomerization, and then enables activation of an antiviral interferon response. While methods to measure antiviral host gene expression are well established, methods to directly monitor the activation states of RIG-I and PKR are only partially and less well established.
Here, we describe two methods to monitor RIG-I and PKR stimulation upon infection with an established interferon inducer, the Rift Valley fever virus mutant clone 13 (Cl 13). Limited trypsin digestion allows to analyze alterations in protease sensitivity, indicating conformational changes of the PRRs. Trypsin digestion of lysates from mock infected cells results in a rapid degradation of RIG-I and PKR, whereas Cl 13 infection leads to the emergence of a protease-resistant RIG-I fragment. Also PKR shows a virus-induced partial resistance to trypsin digestion, which coincides with its hallmark phosphorylation at Thr 446. The formation of RIG-I and PKR oligomers was validated by native polyacrylamide gel electrophoresis (PAGE). Upon infection, there is a strong accumulation of RIG-I and PKR oligomeric complexes, whereas these proteins remained as monomers in mock infected samples.
Limited protease digestion and native PAGE, both coupled to western blot analysis, allow a sensitive and direct measurement of two diverse steps of RIG-I and PKR activation. These techniques are relatively easy and quick to perform and do not require expensive equipment.
Infectious Diseases, Issue 89, innate immune response, virus infection, pathogen recognition receptor, RIG-I, PKR, IRF-3, limited protease digestion, conformational switch, native PAGE, oligomerization
Optimized Negative Staining: a High-throughput Protocol for Examining Small and Asymmetric Protein Structure by Electron Microscopy
Institutions: The Molecular Foundry.
Structural determination of proteins is rather challenging for proteins with molecular masses between 40 - 200 kDa. Considering that more than half of natural proteins have a molecular mass between 40 - 200 kDa1,2
, a robust and high-throughput method with a nanometer resolution capability is needed. Negative staining (NS) electron microscopy (EM) is an easy, rapid, and qualitative approach which has frequently been used in research laboratories to examine protein structure and protein-protein interactions. Unfortunately, conventional NS protocols often generate structural artifacts on proteins, especially with lipoproteins that usually form presenting rouleaux artifacts. By using images of lipoproteins from cryo-electron microscopy (cryo-EM) as a standard, the key parameters in NS specimen preparation conditions were recently screened and reported as the optimized NS protocol (OpNS), a modified conventional NS protocol 3
. Artifacts like rouleaux can be greatly limited by OpNS, additionally providing high contrast along with reasonably high‐resolution (near 1 nm) images of small and asymmetric proteins. These high-resolution and high contrast images are even favorable for an individual protein (a single object, no average) 3D reconstruction, such as a 160 kDa antibody, through the method of electron tomography4,5
. Moreover, OpNS can be a high‐throughput tool to examine hundreds of samples of small proteins. For example, the previously published mechanism of 53 kDa cholesteryl ester transfer protein (CETP) involved the screening and imaging of hundreds of samples 6
. Considering cryo-EM rarely successfully images proteins less than 200 kDa has yet to publish any study involving screening over one hundred sample conditions, it is fair to call OpNS a high-throughput method for studying small proteins. Hopefully the OpNS protocol presented here can be a useful tool to push the boundaries of EM and accelerate EM studies into small protein structure, dynamics and mechanisms.
Environmental Sciences, Issue 90, small and asymmetric protein structure, electron microscopy, optimized negative staining
Genomic MRI - a Public Resource for Studying Sequence Patterns within Genomic DNA
Institutions: University of Toledo Health Science Campus.
Non-coding genomic regions in complex eukaryotes, including intergenic areas, introns, and untranslated segments of exons, are profoundly non-random in their nucleotide composition and consist of a complex mosaic of sequence patterns. These patterns include so-called Mid-Range Inhomogeneity (MRI) regions -- sequences 30-10000 nucleotides in length that are enriched by a particular base or combination of bases (e.g. (G+T)-rich, purine-rich, etc.). MRI regions are associated with unusual (non-B-form) DNA structures that are often involved in regulation of gene expression, recombination, and other genetic processes (Fedorova & Fedorov 2010). The existence of a strong fixation bias within MRI regions against mutations that tend to reduce their sequence inhomogeneity additionally supports the functionality and importance of these genomic sequences (Prakash et al.
Here we demonstrate a freely available Internet resource -- the Genomic MRI
program package -- designed for computational analysis of genomic sequences in order to find and characterize various MRI patterns within them (Bechtel et al.
2008). This package also allows generation of randomized sequences with various properties and level of correspondence to the natural input DNA sequences. The main goal of this resource is to facilitate examination of vast regions of non-coding DNA that are still scarcely investigated and await thorough exploration and recognition.
Genetics, Issue 51, bioinformatics, computational biology, genomics, non-randomness, signals, gene regulation, DNA conformation
Assessment of Immunologically Relevant Dynamic Tertiary Structural Features of the HIV-1 V3 Loop Crown R2 Sequence by ab initio Folding
Institutions: School of Medicine, New York University.
The antigenic diversity of HIV-1 has long been an obstacle to vaccine design, and this variability is especially pronounced in the V3 loop of the virus' surface envelope glycoprotein. We previously proposed that the crown of the V3 loop, although dynamic and sequence variable, is constrained throughout the population of HIV-1 viruses to an immunologically relevant β-hairpin tertiary structure. Importantly, there are thousands of different V3 loop crown sequences in circulating HIV-1 viruses, making 3D structural characterization of trends across the diversity of viruses difficult or impossible by crystallography or NMR. Our previous successful studies with folding of the V3 crown1, 2
used the ab initio
accessible in the ICM-Pro molecular modeling software package (Molsoft LLC, La Jolla, CA) and suggested that the crown of the V3 loop, specifically from positions 10 to 22, benefits sufficiently from the flexibility and length of its flanking stems to behave to a large degree as if it were an unconstrained peptide freely folding in solution. As such, rapid ab initio
folding of just this portion of the V3 loop of any individual strain of the 60,000+ circulating HIV-1 strains can be informative. Here, we folded the V3 loop of the R2 strain to gain insight into the structural basis of its unique properties. R2 bears a rare V3 loop sequence thought to be responsible for the exquisite sensitivity of this strain to neutralization by patient sera and monoclonal antibodies4, 5
. The strain mediates CD4-independent infection and appears to elicit broadly neutralizing antibodies. We demonstrate how evaluation of the results of the folding can be informative for associating observed structures in the folding with the immunological activities observed for R2.
Infection, Issue 43, HIV-1, structure-activity relationships, ab initio simulations, antibody-mediated neutralization, vaccine design
Expression of Functional Recombinant Hemagglutinin and Neuraminidase Proteins from the Novel H7N9 Influenza Virus Using the Baculovirus Expression System
Institutions: Icahn School of Medicine at Mount Sinai, Icahn School of Medicine at Mount Sinai, Icahn School of Medicine at Mount Sinai.
The baculovirus expression system is a powerful tool for expression of recombinant proteins. Here we use it to produce correctly folded and glycosylated versions of the influenza A virus surface glycoproteins - the hemagglutinin (HA) and the neuraminidase (NA). As an example, we chose the HA and NA proteins expressed by the novel H7N9 virus that recently emerged in China. However the protocol can be easily adapted for HA and NA proteins expressed by any other influenza A and B virus strains. Recombinant HA (rHA) and NA (rNA) proteins are important reagents for immunological assays such as ELISPOT and ELISA, and are also in wide use for vaccine standardization, antibody discovery, isolation and characterization. Furthermore, recombinant NA molecules can be used to screen for small molecule inhibitors and are useful for characterization of the enzymatic function of the NA, as well as its sensitivity to antivirals. Recombinant HA proteins are also being tested as experimental vaccines in animal models, and a vaccine based on recombinant HA was recently licensed by the FDA for use in humans. The method we describe here to produce these molecules is straight forward and can facilitate research in influenza laboratories, since it allows for production of large amounts of proteins fast and at a low cost. Although here we focus on influenza virus surface glycoproteins, this method can also be used to produce other viral and cellular surface proteins.
Infection, Issue 81, Influenza A virus, Orthomyxoviridae Infections, Influenza, Human, Influenza in Birds, Influenza Vaccines, hemagglutinin, neuraminidase, H7N9, baculovirus, insect cells, recombinant protein expression
Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules
Institutions: Princeton University.
The aim of de novo
protein design is to find the amino acid sequences that will fold into a desired 3-dimensional structure with improvements in specific properties, such as binding affinity, agonist or antagonist behavior, or stability, relative to the native sequence. Protein design lies at the center of current advances drug design and discovery. Not only does protein design provide predictions for potentially useful drug targets, but it also enhances our understanding of the protein folding process and protein-protein interactions. Experimental methods such as directed evolution have shown success in protein design. However, such methods are restricted by the limited sequence space that can be searched tractably. In contrast, computational design strategies allow for the screening of a much larger set of sequences covering a wide variety of properties and functionality. We have developed a range of computational de novo
protein design methods capable of tackling several important areas of protein design. These include the design of monomeric proteins for increased stability and complexes for increased binding affinity.
To disseminate these methods for broader use we present Protein WISDOM (https://www.proteinwisdom.org), a tool that provides automated methods for a variety of protein design problems. Structural templates are submitted to initialize the design process. The first stage of design is an optimization sequence selection stage that aims at improving stability through minimization of potential energy in the sequence space. Selected sequences are then run through a fold specificity stage and a binding affinity stage. A rank-ordered list of the sequences for each step of the process, along with relevant designed structures, provides the user with a comprehensive quantitative assessment of the design. Here we provide the details of each design method, as well as several notable experimental successes attained through the use of the methods.
Genetics, Issue 77, Molecular Biology, Bioengineering, Biochemistry, Biomedical Engineering, Chemical Engineering, Computational Biology, Genomics, Proteomics, Protein, Protein Binding, Computational Biology, Drug Design, optimization (mathematics), Amino Acids, Peptides, and Proteins, De novo protein and peptide design, Drug design, In silico sequence selection, Optimization, Fold specificity, Binding affinity, sequencing
Analyzing Protein Dynamics Using Hydrogen Exchange Mass Spectrometry
Institutions: University of Heidelberg.
All cellular processes depend on the functionality of proteins. Although the functionality of a given protein is the direct consequence of its unique amino acid sequence, it is only realized by the folding of the polypeptide chain into a single defined three-dimensional arrangement or more commonly into an ensemble of interconverting conformations. Investigating the connection between protein conformation and its function is therefore essential for a complete understanding of how proteins are able to fulfill their great variety of tasks. One possibility to study conformational changes a protein undergoes while progressing through its functional cycle is hydrogen-1
H-exchange in combination with high-resolution mass spectrometry (HX-MS). HX-MS is a versatile and robust method that adds a new dimension to structural information obtained by e.g.
crystallography. It is used to study protein folding and unfolding, binding of small molecule ligands, protein-protein interactions, conformational changes linked to enzyme catalysis, and allostery. In addition, HX-MS is often used when the amount of protein is very limited or crystallization of the protein is not feasible. Here we provide a general protocol for studying protein dynamics with HX-MS and describe as an example how to reveal the interaction interface of two proteins in a complex.
Chemistry, Issue 81, Molecular Chaperones, mass spectrometers, Amino Acids, Peptides, Proteins, Enzymes, Coenzymes, Protein dynamics, conformational changes, allostery, protein folding, secondary structure, mass spectrometry
RNA Secondary Structure Prediction Using High-throughput SHAPE
Institutions: Frederick National Laboratory for Cancer Research.
Understanding the function of RNA involved in biological processes requires a thorough knowledge of RNA structure. Toward this end, the methodology dubbed "high-throughput selective 2' hydroxyl acylation analyzed by primer extension", or SHAPE, allows prediction of RNA secondary structure with single nucleotide resolution. This approach utilizes chemical probing agents that preferentially acylate single stranded or flexible regions of RNA in aqueous solution. Sites of chemical modification are detected by reverse transcription of the modified RNA, and the products of this reaction are fractionated by automated capillary electrophoresis (CE). Since reverse transcriptase pauses at those RNA nucleotides modified by the SHAPE reagents, the resulting cDNA library indirectly maps those ribonucleotides that are single stranded in the context of the folded RNA. Using ShapeFinder software, the electropherograms produced by automated CE are processed and converted into nucleotide reactivity tables that are themselves converted into pseudo-energy constraints used in the RNAStructure (v5.3) prediction algorithm. The two-dimensional RNA structures obtained by combining SHAPE probing with in silico
RNA secondary structure prediction have been found to be far more accurate than structures obtained using either method alone.
Genetics, Issue 75, Molecular Biology, Biochemistry, Virology, Cancer Biology, Medicine, Genomics, Nucleic Acid Probes, RNA Probes, RNA, High-throughput SHAPE, Capillary electrophoresis, RNA structure, RNA probing, RNA folding, secondary structure, DNA, nucleic acids, electropherogram, synthesis, transcription, high throughput, sequencing
Characterization of Complex Systems Using the Design of Experiments Approach: Transient Protein Expression in Tobacco as a Case Study
Institutions: RWTH Aachen University, Fraunhofer Gesellschaft.
Plants provide multiple benefits for the production of biopharmaceuticals including low costs, scalability, and safety. Transient expression offers the additional advantage of short development and production times, but expression levels can vary significantly between batches thus giving rise to regulatory concerns in the context of good manufacturing practice. We used a design of experiments (DoE) approach to determine the impact of major factors such as regulatory elements in the expression construct, plant growth and development parameters, and the incubation conditions during expression, on the variability of expression between batches. We tested plants expressing a model anti-HIV monoclonal antibody (2G12) and a fluorescent marker protein (DsRed). We discuss the rationale for selecting certain properties of the model and identify its potential limitations. The general approach can easily be transferred to other problems because the principles of the model are broadly applicable: knowledge-based parameter selection, complexity reduction by splitting the initial problem into smaller modules, software-guided setup of optimal experiment combinations and step-wise design augmentation. Therefore, the methodology is not only useful for characterizing protein expression in plants but also for the investigation of other complex systems lacking a mechanistic description. The predictive equations describing the interconnectivity between parameters can be used to establish mechanistic models for other complex systems.
Bioengineering, Issue 83, design of experiments (DoE), transient protein expression, plant-derived biopharmaceuticals, promoter, 5'UTR, fluorescent reporter protein, model building, incubation conditions, monoclonal antibody
Using Caenorhabditis elegans as a Model System to Study Protein Homeostasis in a Multicellular Organism
Institutions: Ben-Gurion University of the Negev.
The folding and assembly of proteins is essential for protein function, the long-term health of the cell, and longevity of the organism. Historically, the function and regulation of protein folding was studied in vitro
, in isolated tissue culture cells and in unicellular organisms. Recent studies have uncovered links between protein homeostasis (proteostasis), metabolism, development, aging, and temperature-sensing. These findings have led to the development of new tools for monitoring protein folding in the model metazoan organism Caenorhabditis elegans
. In our laboratory, we combine behavioral assays, imaging and biochemical approaches using temperature-sensitive or naturally occurring metastable proteins as sensors of the folding environment to monitor protein misfolding. Behavioral assays that are associated with the misfolding of a specific protein provide a simple and powerful readout for protein folding, allowing for the fast screening of genes and conditions that modulate folding. Likewise, such misfolding can be associated with protein mislocalization in the cell. Monitoring protein localization can, therefore, highlight changes in cellular folding capacity occurring in different tissues, at various stages of development and in the face of changing conditions. Finally, using biochemical tools ex vivo
, we can directly monitor protein stability and conformation. Thus, by combining behavioral assays, imaging and biochemical techniques, we are able to monitor protein misfolding at the resolution of the organism, the cell, and the protein, respectively.
Biochemistry, Issue 82, aging, Caenorhabditis elegans, heat shock response, neurodegenerative diseases, protein folding homeostasis, proteostasis, stress, temperature-sensitive
Optimization and Utilization of Agrobacterium-mediated Transient Protein Production in Nicotiana
Institutions: Fraunhofer USA Center for Molecular Biotechnology.
-mediated transient protein production in plants is a promising approach to produce vaccine antigens and therapeutic proteins within a short period of time. However, this technology is only just beginning to be applied to large-scale production as many technological obstacles to scale up are now being overcome. Here, we demonstrate a simple and reproducible method for industrial-scale transient protein production based on vacuum infiltration of Nicotiana
plants with Agrobacteria
carrying launch vectors. Optimization of Agrobacterium
cultivation in AB medium allows direct dilution of the bacterial culture in Milli-Q water, simplifying the infiltration process. Among three tested species of Nicotiana
, N. excelsiana
× N. excelsior
) was selected as the most promising host due to the ease of infiltration, high level of reporter protein production, and about two-fold higher biomass production under controlled environmental conditions. Induction of Agrobacterium
harboring pBID4-GFP (Tobacco mosaic virus
-based) using chemicals such as acetosyringone and monosaccharide had no effect on the protein production level. Infiltrating plant under 50 to 100 mbar for 30 or 60 sec resulted in about 95% infiltration of plant leaf tissues. Infiltration with Agrobacterium
laboratory strain GV3101 showed the highest protein production compared to Agrobacteria
laboratory strains LBA4404 and C58C1 and wild-type Agrobacteria
strains at6, at10, at77 and A4. Co-expression of a viral RNA silencing suppressor, p23 or p19, in N. benthamiana
resulted in earlier accumulation and increased production (15-25%) of target protein (influenza virus hemagglutinin).
Plant Biology, Issue 86, Agroinfiltration, Nicotiana benthamiana, transient protein production, plant-based expression, viral vector, Agrobacteria
High Throughput Quantitative Expression Screening and Purification Applied to Recombinant Disulfide-rich Venom Proteins Produced in E. coli
Institutions: Aix-Marseille Université, Commissariat à l'énergie atomique et aux énergies alternatives (CEA) Saclay, France.
Escherichia coli (E. coli)
is the most widely used expression system for the production of recombinant proteins for structural and functional studies. However, purifying proteins is sometimes challenging since many proteins are expressed in an insoluble form. When working with difficult or multiple targets it is therefore recommended to use high throughput (HTP) protein expression screening on a small scale (1-4 ml cultures) to quickly identify conditions for soluble expression. To cope with the various structural genomics programs of the lab, a quantitative (within a range of 0.1-100 mg/L culture of recombinant protein) and HTP protein expression screening protocol was implemented and validated on thousands of proteins. The protocols were automated with the use of a liquid handling robot but can also be performed manually without specialized equipment.
Disulfide-rich venom proteins are gaining increasing recognition for their potential as therapeutic drug leads. They can be highly potent and selective, but their complex disulfide bond networks make them challenging to produce. As a member of the FP7 European Venomics project (www.venomics.eu), our challenge is to develop successful production strategies with the aim of producing thousands of novel venom proteins for functional characterization. Aided by the redox properties of disulfide bond isomerase DsbC, we adapted our HTP production pipeline for the expression of oxidized, functional venom peptides in the E. coli
cytoplasm. The protocols are also applicable to the production of diverse disulfide-rich proteins. Here we demonstrate our pipeline applied to the production of animal venom proteins. With the protocols described herein it is likely that soluble disulfide-rich proteins will be obtained in as little as a week. Even from a small scale, there is the potential to use the purified proteins for validating the oxidation state by mass spectrometry, for characterization in pilot studies, or for sensitive micro-assays.
Bioengineering, Issue 89, E. coli, expression, recombinant, high throughput (HTP), purification, auto-induction, immobilized metal affinity chromatography (IMAC), tobacco etch virus protease (TEV) cleavage, disulfide bond isomerase C (DsbC) fusion, disulfide bonds, animal venom proteins/peptides
Nanomanipulation of Single RNA Molecules by Optical Tweezers
Institutions: University at Albany, State University of New York, University at Albany, State University of New York, University at Albany, State University of New York, University at Albany, State University of New York, University at Albany, State University of New York.
A large portion of the human genome is transcribed but not translated. In this post genomic era, regulatory functions of RNA have been shown to be increasingly important. As RNA function often depends on its ability to adopt alternative structures, it is difficult to predict RNA three-dimensional structures directly from sequence. Single-molecule approaches show potentials to solve the problem of RNA structural polymorphism by monitoring molecular structures one molecule at a time. This work presents a method to precisely manipulate the folding and structure of single RNA molecules using optical tweezers. First, methods to synthesize molecules suitable for single-molecule mechanical work are described. Next, various calibration procedures to ensure the proper operations of the optical tweezers are discussed. Next, various experiments are explained. To demonstrate the utility of the technique, results of mechanically unfolding RNA hairpins and a single RNA kissing complex are used as evidence. In these examples, the nanomanipulation technique was used to study folding of each structural domain, including secondary and tertiary, independently. Lastly, the limitations and future applications of the method are discussed.
Bioengineering, Issue 90, RNA folding, single-molecule, optical tweezers, nanomanipulation, RNA secondary structure, RNA tertiary structure
Determination of Protein-ligand Interactions Using Differential Scanning Fluorimetry
Institutions: University of Exeter.
A wide range of methods are currently available for determining the dissociation constant between a protein and interacting small molecules. However, most of these require access to specialist equipment, and often require a degree of expertise to effectively establish reliable experiments and analyze data. Differential scanning fluorimetry (DSF) is being increasingly used as a robust method for initial screening of proteins for interacting small molecules, either for identifying physiological partners or for hit discovery. This technique has the advantage that it requires only a PCR machine suitable for quantitative PCR, and so suitable instrumentation is available in most institutions; an excellent range of protocols are already available; and there are strong precedents in the literature for multiple uses of the method. Past work has proposed several means of calculating dissociation constants from DSF data, but these are mathematically demanding. Here, we demonstrate a method for estimating dissociation constants from a moderate amount of DSF experimental data. These data can typically be collected and analyzed within a single day. We demonstrate how different models can be used to fit data collected from simple binding events, and where cooperative binding or independent binding sites are present. Finally, we present an example of data analysis in a case where standard models do not apply. These methods are illustrated with data collected on commercially available control proteins, and two proteins from our research program. Overall, our method provides a straightforward way for researchers to rapidly gain further insight into protein-ligand interactions using DSF.
Biophysics, Issue 91, differential scanning fluorimetry, dissociation constant, protein-ligand interactions, StepOne, cooperativity, WcbI.
Interview: Protein Folding and Studies of Neurodegenerative Diseases
Institutions: MIT - Massachusetts Institute of Technology.
In this interview, Dr. Lindquist describes relationships between protein folding, prion diseases and neurodegenerative disorders. The problem of the protein folding is at the core of the modern biology. In addition to their traditional biochemical functions, proteins can mediate transfer of biological information and therefore can be considered a genetic material. This recently discovered function of proteins has important implications for studies of human disorders. Dr. Lindquist also describes current experimental approaches to investigate the mechanism of neurodegenerative diseases based on genetic studies in model organisms.
Neuroscience, issue 17, protein folding, brain, neuron, prion, neurodegenerative disease, yeast, screen, Translational Research