The aim of de novo protein design is to find the amino acid sequences that will fold into a desired 3-dimensional structure with improvements in specific properties, such as binding affinity, agonist or antagonist behavior, or stability, relative to the native sequence. Protein design lies at the center of current advances drug design and discovery. Not only does protein design provide predictions for potentially useful drug targets, but it also enhances our understanding of the protein folding process and protein-protein interactions. Experimental methods such as directed evolution have shown success in protein design. However, such methods are restricted by the limited sequence space that can be searched tractably. In contrast, computational design strategies allow for the screening of a much larger set of sequences covering a wide variety of properties and functionality. We have developed a range of computational de novo protein design methods capable of tackling several important areas of protein design. These include the design of monomeric proteins for increased stability and complexes for increased binding affinity.
To disseminate these methods for broader use we present Protein WISDOM (http://www.proteinwisdom.org), a tool that provides automated methods for a variety of protein design problems. Structural templates are submitted to initialize the design process. The first stage of design is an optimization sequence selection stage that aims at improving stability through minimization of potential energy in the sequence space. Selected sequences are then run through a fold specificity stage and a binding affinity stage. A rank-ordered list of the sequences for each step of the process, along with relevant designed structures, provides the user with a comprehensive quantitative assessment of the design. Here we provide the details of each design method, as well as several notable experimental successes attained through the use of the methods.
16 Related JoVE Articles!
Scalable Nanohelices for Predictive Studies and Enhanced 3D Visualization
Institutions: University of California Merced, University of California Merced.
Spring-like materials are ubiquitous in nature and of interest in nanotechnology for energy harvesting, hydrogen storage, and biological sensing applications. For predictive simulations, it has become increasingly important to be able to model the structure of nanohelices accurately. To study the effect of local structure on the properties of these complex geometries one must develop realistic models. To date, software packages are rather limited in creating atomistic helical models. This work focuses on producing atomistic models of silica glass (SiO2
) nanoribbons and nanosprings for molecular dynamics (MD) simulations. Using an MD model of “bulk” silica glass, two computational procedures to precisely create the shape of nanoribbons and nanosprings are presented. The first method employs the AWK programming language and open-source software to effectively carve various shapes of silica nanoribbons from the initial bulk model, using desired dimensions and parametric equations to define a helix. With this method, accurate atomistic silica nanoribbons can be generated for a range of pitch values and dimensions. The second method involves a more robust code which allows flexibility in modeling nanohelical structures. This approach utilizes a C++ code particularly written to implement pre-screening methods as well as the mathematical equations for a helix, resulting in greater precision and efficiency when creating nanospring models. Using these codes, well-defined and scalable nanoribbons and nanosprings suited for atomistic simulations can be effectively created. An added value in both open-source codes is that they can be adapted to reproduce different helical structures, independent of material. In addition, a MATLAB graphical user interface (GUI) is used to enhance learning through visualization and interaction for a general user with the atomistic helical structures. One application of these methods is the recent study of nanohelices via MD simulations for mechanical energy harvesting purposes.
Physics, Issue 93, Helical atomistic models; open-source coding; graphical user interface; visualization software; molecular dynamics simulations; graphical processing unit accelerated simulations.
Helical Organization of Blood Coagulation Factor VIII on Lipid Nanotubes
Institutions: University of Texas Medical Branch, University of Texas Medical Branch, University of Texas Medical Branch.
Cryo-electron microscopy (Cryo-EM)1
is a powerful approach to investigate the functional structure of proteins and complexes in a hydrated state and membrane environment2
Coagulation Factor VIII (FVIII)3
is a multi-domain blood plasma glycoprotein. Defect or deficiency of FVIII is the cause for Hemophilia type A - a severe bleeding disorder. Upon proteolytic activation, FVIII binds to the serine protease Factor IXa on the negatively charged platelet membrane, which is critical for normal blood clotting4
. Despite the pivotal role FVIII plays in coagulation, structural information for its membrane-bound state is incomplete5
. Recombinant FVIII concentrate is the most effective drug against Hemophilia type A and commercially available FVIII can be expressed as human or porcine, both forming functional complexes with human Factor IXa6,7
In this study we present a combination of Cryo-electron microscopy (Cryo-EM), lipid nanotechnology and structure analysis applied to resolve the membrane-bound structure of two highly homologous FVIII forms: human and porcine. The methodology developed in our laboratory to helically organize the two functional recombinant FVIII forms on negatively charged lipid nanotubes (LNT) is described. The representative results demonstrate that our approach is sufficiently sensitive to define the differences in the helical organization between the two highly homologous in sequence (86% sequence identity) proteins. Detailed protocols for the helical organization, Cryo-EM and electron tomography (ET) data acquisition are given. The two-dimensional (2D) and three-dimensional (3D) structure analysis applied to obtain the 3D reconstructions of human and porcine FVIII-LNT is discussed. The presented human and porcine FVIII-LNT structures show the potential of the proposed methodology to calculate the functional, membrane-bound organization of blood coagulation Factor VIII at high resolution.
Bioengineering, Issue 88, Cryo-electron microscopy, Lipid nanotubes, Helical assembly, Membrane-bound organization, Coagulation factor VIII
Designing a Bio-responsive Robot from DNA Origami
Institutions: Bar-Ilan University.
Nucleic acids are astonishingly versatile. In addition to their natural role as storage medium for biological information1
, they can be utilized in parallel computing2,3
, recognize and bind molecular or cellular targets4,5
, catalyze chemical reactions6,7
, and generate calculated responses in a biological system8,9
. Importantly, nucleic acids can be programmed to self-assemble into 2D and 3D structures10-12
, enabling the integration of all these remarkable features in a single robot linking the sensing of biological cues to a preset response in order to exert a desired effect.
Creating shapes from nucleic acids was first proposed by Seeman13
, and several variations on this theme have since been realized using various techniques11,12,14,15
. However, the most significant is perhaps the one proposed by Rothemund, termed scaffolded DNA origami16
. In this technique, the folding of a long (>7,000 bases) single-stranded DNA 'scaffold'
is directed to a desired shape by hundreds of short complementary strands termed 'staples'
. Folding is carried out by temperature annealing ramp. This technique was successfully demonstrated in the creation of a diverse array of 2D shapes with remarkable precision and robustness. DNA origami was later extended to 3D as well17,18
The current paper will focus on the caDNAno 2.0 software19
developed by Douglas and colleagues. caDNAno is a robust, user-friendly CAD tool enabling the design of 2D and 3D DNA origami shapes with versatile features. The design process relies on a systematic and accurate abstraction scheme for DNA structures, making it relatively straightforward and efficient.
In this paper we demonstrate the design of a DNA origami nanorobot that has been recently described20
. This robot is 'robotic' in the sense that it links sensing to actuation, in order to perform a task. We explain how various sensing schemes can be integrated into the structure, and how this can be relayed to a desired effect. Finally we use Cando21
to simulate the mechanical properties of the designed shape. The concept we discuss can be adapted to multiple tasks and settings.
Bioengineering, Issue 77, Genetics, Biomedical Engineering, Molecular Biology, Medicine, Genomics, Nanotechnology, Nanomedicine, DNA origami, nanorobot, caDNAno, DNA, DNA Origami, nucleic acids, DNA structures, CAD, sequencing
Nucleocapsid Annealing-Mediated Electrophoresis (NAME) Assay Allows the Rapid Identification of HIV-1 Nucleocapsid Inhibitors
Institutions: University of Padova, SUNY Albany.
RNA or DNA folded in stable tridimensional folding are interesting targets in the development of antitumor or antiviral drugs. In the case of HIV-1, viral proteins involved in the regulation of the virus activity recognize several nucleic acids. The nucleocapsid protein NCp7 (NC) is a key protein regulating several processes during virus replication. NC is in fact a chaperone destabilizing the secondary structures of RNA and DNA and facilitating their annealing. The inactivation of NC is a new approach and an interesting target for anti-HIV therapy. The N
lectrophoresis (NAME) assay was developed to identify molecules able to inhibit the melting and annealing of RNA and DNA folded in thermodynamically stable tridimensional conformations, such as hairpin structures of TAR and cTAR elements of HIV, by the nucleocapsid protein of HIV-1. The new assay employs either the recombinant or the synthetic protein, and oligonucleotides without the need of their previous labeling. The analysis of the results is achieved by standard polyacrylamide gel electrophoresis (PAGE) followed by conventional nucleic acid staining. The protocol reported in this work describes how to perform the NAME assay with the full-length protein or its truncated version lacking the basic N-terminal domain, both competent as nucleic acids chaperones, and how to assess the inhibition of NC chaperone activity by a threading intercalator. Moreover, NAME can be performed in two different modes, useful to obtain indications on the putative mechanism of action of the identified NC inhibitors.
Immunology, Issue 95, HIV-1, Nucleocapsid protein, NCp7, TAR-RNA, DNA, oligonucleotides, annealing, Gel electrophoresis, NAME
Do's and Don'ts of Cryo-electron Microscopy: A Primer on Sample Preparation and High Quality Data Collection for Macromolecular 3D Reconstruction
Institutions: Virginia Commonwealth University.
Cryo-electron microscopy (cryoEM) entails flash-freezing a thin layer of sample on a support, and then visualizing the sample in its frozen hydrated state by transmission electron microscopy (TEM). This can be achieved with very low quantity of protein and in the buffer of choice, without the use of any stain, which is very useful to determine structure-function correlations of macromolecules. When combined with single-particle image processing, the technique has found widespread usefulness for 3D structural determination of purified macromolecules.
The protocol presented here explains how to perform cryoEM and examines the causes of most commonly encountered problems for rational troubleshooting; following all these steps should lead to acquisition of high quality cryoEM images. The technique requires access to the electron microscope instrument and to a vitrification device. Knowledge of the 3D reconstruction concepts and software is also needed for computerized image processing. Importantly, high quality results depend on finding the right purification conditions leading to a uniform population of structurally intact macromolecules.
The ability of cryoEM to visualize macromolecules combined with the versatility of single particle image processing has proven very successful for structural determination of large proteins and macromolecular machines in their near-native state, identification of their multiple components by 3D difference mapping, and creation of pseudo-atomic structures by docking of x-ray structures. The relentless development of cryoEM instrumentation and image processing techniques for the last 30 years has resulted in the possibility to generate de novo
3D reconstructions at atomic resolution level.
Structural Biology, Issue 95, 3D electron microscopy, cryo-electron microscopy, membrane proteins, ryanodine receptor, single particle image processing, transmission electron microscopy
Characterization of Complex Systems Using the Design of Experiments Approach: Transient Protein Expression in Tobacco as a Case Study
Institutions: RWTH Aachen University, Fraunhofer Gesellschaft.
Plants provide multiple benefits for the production of biopharmaceuticals including low costs, scalability, and safety. Transient expression offers the additional advantage of short development and production times, but expression levels can vary significantly between batches thus giving rise to regulatory concerns in the context of good manufacturing practice. We used a design of experiments (DoE) approach to determine the impact of major factors such as regulatory elements in the expression construct, plant growth and development parameters, and the incubation conditions during expression, on the variability of expression between batches. We tested plants expressing a model anti-HIV monoclonal antibody (2G12) and a fluorescent marker protein (DsRed). We discuss the rationale for selecting certain properties of the model and identify its potential limitations. The general approach can easily be transferred to other problems because the principles of the model are broadly applicable: knowledge-based parameter selection, complexity reduction by splitting the initial problem into smaller modules, software-guided setup of optimal experiment combinations and step-wise design augmentation. Therefore, the methodology is not only useful for characterizing protein expression in plants but also for the investigation of other complex systems lacking a mechanistic description. The predictive equations describing the interconnectivity between parameters can be used to establish mechanistic models for other complex systems.
Bioengineering, Issue 83, design of experiments (DoE), transient protein expression, plant-derived biopharmaceuticals, promoter, 5'UTR, fluorescent reporter protein, model building, incubation conditions, monoclonal antibody
Imaging Denatured Collagen Strands In vivo and Ex vivo via Photo-triggered Hybridization of Caged Collagen Mimetic Peptides
Institutions: University of Utah, Johns Hopkins University School of Medicine, Johns Hopkins University.
Collagen is a major structural component of the extracellular matrix that supports tissue formation and maintenance. Although collagen remodeling is an integral part of normal tissue renewal, excessive amount of remodeling activity is involved in tumors, arthritis, and many other pathological conditions. During collagen remodeling, the triple helical structure of collagen molecules is disrupted by proteases in the extracellular environment. In addition, collagens present in many histological tissue samples are partially denatured by the fixation and preservation processes. Therefore, these denatured collagen strands can serve as effective targets for biological imaging. We previously developed a caged collagen mimetic peptide (CMP) that can be photo-triggered to hybridize with denatured collagen strands by forming triple helical structure, which is unique to collagens. The overall goals of this procedure are i)
to image denatured collagen strands resulting from normal remodeling activities in vivo
, and ii)
to visualize collagens in ex vivo
tissue sections using the photo-triggered caged CMPs. To achieve effective hybridization and successful in vivo
and ex vivo
imaging, fluorescently labeled caged CMPs are either photo-activated immediately before intravenous injection, or are directly activated on tissue sections. Normal skeletal collagen remolding in nude mice and collagens in prefixed mouse cornea tissue sections are imaged in this procedure. The imaging method based on the CMP-collagen hybridization technology presented here could lead to deeper understanding of the tissue remodeling process, as well as allow development of new diagnostics for diseases associated with high collagen remodeling activity.
Bioengineering, Issue 83, collagen remodeling, triple helix, near infrared fluorescence, bioimaging, tissue staining
Structure and Coordination Determination of Peptide-metal Complexes Using 1D and 2D 1H NMR
Institutions: The Hebrew University of Jerusalem, The Hebrew University of Jerusalem.
Copper (I) binding by metallochaperone transport proteins prevents copper oxidation and release of the toxic ions that may participate in harmful redox reactions. The Cu (I) complex of the peptide model of a Cu (I) binding metallochaperone protein, which includes the sequence MTCSGCSRPG (underlined is conserved), was determined in solution under inert conditions by NMR spectroscopy.
NMR is a widely accepted technique for the determination of solution structures of proteins and peptides. Due to difficulty in crystallization to provide single crystals suitable for X-ray crystallography, the NMR technique is extremely valuable, especially as it provides information on the solution state rather than the solid state. Herein we describe all steps that are required for full three-dimensional structure determinations by NMR. The protocol includes sample preparation in an NMR tube, 1D and 2D data collection and processing, peak assignment and integration, molecular mechanics calculations, and structure analysis. Importantly, the analysis was first conducted without any preset metal-ligand bonds, to assure a reliable structure determination in an unbiased manner.
Chemistry, Issue 82, solution structure determination, NMR, peptide models, copper-binding proteins, copper complexes
Chromatin Immunoprecipitation Assay for Tissue-specific Genes using Early-stage Mouse Embryos
Institutions: University of Massachusetts Medical School.
Chromatin immunoprecipitation (ChIP) is a powerful tool to identify protein:chromatin interactions that occur in the context of living cells 1-3
. This technique has been widely exploited in tissue culture cells, and to a lesser extent, in primary tissue. The application of ChIP to rodent embryonic tissue, especially at early times of development, is complicated by the limited amount of tissue and the heterogeneity of cell and tissue types in the embryo. Here we present a method to perform ChIP using a dissociated embryonic day 8.5 (E8.5) embryo. Sheared chromatin from a single E8.5 embryo can be divided into up to five aliquots, which allows the investigator sufficient material for controls and for investigation of specific protein:chromatin interactions.
We have utilized this technique to begin to document protein:chromatin interactions during the specification of tissue-specific gene expression programs. The heterogeneity of cell types in an embryo necessarily restricts the application of this technique because the result is the detection of protein:chromatin interactions without distinguishing whether the interactions occur in all, a subset of, or a single cell type(s). However, examination of tissue-specific genes during or following the onset of tissue-specific gene expression is feasible for two reasons. First, immunoprecipitation of tissue specific factors necessarily isolates chromatin from the cell type where the factor is expressed. Second, immunoprecipitation of coactivators and histones containing post-translational modifications that are associated with gene activation should only be found at genes and gene regulatory sequences in the cell type where the gene is being or has been activated. The technique should be applicable to the study of most tissue-specific gene activation events.
In the example described below, we utilized E8.5 and E9.5 mouse embryos to examine factor binding at a skeletal muscle specific gene promoter. Somites, which are the precursor tissues from which the skeletal muscles of the trunk and limbs will form, are present at E8.5-9.54,5
. Myogenin is a regulatory factor required for skeletal muscle differentiation 6-9
. The data demonstrate that myogenin is associated with its own promoter in E8.5 and E9.5 embryos. Because myogenin is only expressed in somites at this stage of development 6,10
, the data indicate that myogenin interactions with its own promoter have already occurred in skeletal muscle precursor cells in E8.5 embryos.
Developmental Biology, Issue 50, Myogenesis, Chromatin, Gene Regulation, Chromatin Immunoprecipitation, Embryo, Mouse
Acquiring Fluorescence Time-lapse Movies of Budding Yeast and Analyzing Single-cell Dynamics using GRAFTS
Institutions: Massachusetts Institute of Technology.
Fluorescence time-lapse microscopy has become a powerful tool in the study of many biological processes at the single-cell level. In particular, movies depicting the temporal dependence of gene expression provide insight into the dynamics of its regulation; however, there are many technical challenges to obtaining and analyzing fluorescence movies of single cells. We describe here a simple protocol using a commercially available microfluidic culture device to generate such data, and a MATLAB-based, graphical user interface (GUI) -based software package to quantify the fluorescence images. The software segments and tracks cells, enables the user to visually curate errors in the data, and automatically assigns lineage and division times. The GUI further analyzes the time series to produce whole cell traces as well as their first and second time derivatives. While the software was designed for S. cerevisiae
, its modularity and versatility should allow it to serve as a platform for studying other cell types with few modifications.
Microbiology, Issue 77, Cellular Biology, Molecular Biology, Genetics, Biophysics, Saccharomyces cerevisiae, Microscopy, Fluorescence, Cell Biology, microscopy/fluorescence and time-lapse, budding yeast, gene expression dynamics, segmentation, lineage tracking, image tracking, software, yeast, cells, imaging
Extracellularly Identifying Motor Neurons for a Muscle Motor Pool in Aplysia californica
Institutions: Case Western Reserve University , Case Western Reserve University , Case Western Reserve University .
In animals with large identified neurons (e.g.
mollusks), analysis of motor pools is done using intracellular techniques1,2,3,4
. Recently, we developed a technique to extracellularly stimulate and record individual neurons in Aplysia californica5
. We now describe a protocol for using this technique to uniquely identify and characterize motor neurons within a motor pool.
This extracellular technique has advantages. First, extracellular electrodes can stimulate and record neurons through the sheath5
, so it does not need to be removed. Thus, neurons will be healthier in extracellular experiments than in intracellular ones. Second, if ganglia are rotated by appropriate pinning of the sheath, extracellular electrodes can access neurons on both sides of the ganglion, which makes it easier and more efficient to identify multiple neurons in the same preparation. Third, extracellular electrodes do not need to penetrate cells, and thus can be easily moved back and forth among neurons, causing less damage to them. This is especially useful when one tries to record multiple neurons during repeating motor patterns that may only persist for minutes. Fourth, extracellular electrodes are more flexible than intracellular ones during muscle movements. Intracellular electrodes may pull out and damage neurons during muscle contractions. In contrast, since extracellular electrodes are gently pressed onto the sheath above neurons, they usually stay above the same neuron during muscle contractions, and thus can be used in more intact preparations.
To uniquely identify motor neurons for a motor pool (in particular, the I1/I3 muscle in Aplysia
) using extracellular electrodes, one can use features that do not require intracellular measurements as criteria: soma size and location, axonal projection, and muscle innervation4,6,7
. For the particular motor pool used to illustrate the technique, we recorded from buccal nerves 2 and 3 to measure axonal projections, and measured the contraction forces of the I1/I3 muscle to determine the pattern of muscle innervation for the individual motor neurons.
We demonstrate the complete process of first identifying motor neurons using muscle innervation, then characterizing their timing during motor patterns, creating a simplified diagnostic method for rapid identification. The simplified and more rapid diagnostic method is superior for more intact preparations, e.g.
in the suspended buccal mass preparation8
or in vivo9
. This process can also be applied in other motor pools10,11,12
or in other animal systems2,3,13,14
Neuroscience, Issue 73, Physiology, Biomedical Engineering, Anatomy, Behavior, Neurobiology, Animal, Neurosciences, Neurophysiology, Electrophysiology, Aplysia, Aplysia californica, California sea slug, invertebrate, feeding, buccal mass, ganglia, motor neurons, neurons, extracellular stimulation and recordings, extracellular electrodes, animal model
Analyzing and Building Nucleic Acid Structures with 3DNA
Institutions: Rutgers - The State University of New Jersey, Columbia University .
The 3DNA software package is a popular and versatile bioinformatics tool with capabilities to analyze, construct, and visualize three-dimensional nucleic acid structures. This article presents detailed protocols for a subset of new and popular features available in 3DNA, applicable to both individual structures and ensembles of related structures. Protocol 1 lists the set of instructions needed to download and install the software. This is followed, in Protocol 2, by the analysis of a nucleic acid structure, including the assignment of base pairs and the determination of rigid-body parameters that describe the structure and, in Protocol 3, by a description of the reconstruction of an atomic model of a structure from its rigid-body parameters. The most recent version of 3DNA, version 2.1, has new features for the analysis and manipulation of ensembles of structures, such as those deduced from nuclear magnetic resonance (NMR) measurements and molecular dynamic (MD) simulations; these features are presented in Protocols 4 and 5. In addition to the 3DNA stand-alone software package, the w3DNA web server, located at http://w3dna.rutgers.edu, provides a user-friendly interface to selected features of the software. Protocol 6 demonstrates a novel feature of the site for building models of long DNA molecules decorated with bound proteins at user-specified locations.
Genetics, Issue 74, Molecular Biology, Biochemistry, Bioengineering, Biophysics, Genomics, Chemical Biology, Quantitative Biology, conformational analysis, DNA, high-resolution structures, model building, molecular dynamics, nucleic acid structure, RNA, visualization, bioinformatics, three-dimensional, 3DNA, software
Determination of the Gas-phase Acidities of Oligopeptides
Institutions: University of the Pacific.
Amino acid residues located at different positions in folded proteins often exhibit different degrees of acidities. For example, a cysteine residue located at or near the N-terminus of a helix is often more acidic than that at or near the C-terminus 1-6
. Although extensive experimental studies on the acid-base properties of peptides have been carried out in the condensed phase, in particular in aqueous solutions 6-8
, the results are often complicated by solvent effects 7
. In fact, most of the active sites in proteins are located near the interior region where solvent effects have been minimized 9,10
. In order to understand intrinsic acid-base properties of peptides and proteins, it is important to perform the studies in a solvent-free environment.
We present a method to measure the acidities of oligopeptides in the gas-phase. We use a cysteine-containing oligopeptide, Ala3
CH), as the model compound. The measurements are based on the well-established extended Cooks kinetic method (Figure 1
. The experiments are carried out using a triple-quadrupole mass spectrometer interfaced with an electrospray ionization (ESI) ion source (Figure 2
). For each peptide sample, several reference acids are selected. The reference acids are structurally similar organic compounds with known gas-phase acidities. A solution of the mixture of the peptide and a reference acid is introduced into the mass spectrometer, and a gas-phase proton-bound anionic cluster of peptide-reference acid is formed. The proton-bound cluster is mass isolated and subsequently fragmented via collision-induced dissociation (CID) experiments. The resulting fragment ion abundances are analyzed using a relationship between the acidities and the cluster ion dissociation kinetics. The gas-phase acidity of the peptide is then obtained by linear regression of the thermo-kinetic plots 17,18
The method can be applied to a variety of molecular systems, including organic compounds, amino acids and their derivatives, oligonucleotides, and oligopeptides. By comparing the gas-phase acidities measured experimentally with those values calculated for different conformers, conformational effects on the acidities can be evaluated.
Chemistry, Issue 76, Biochemistry, Molecular Biology, Oligopeptide, gas-phase acidity, kinetic method, collision-induced dissociation, triple-quadrupole mass spectrometry, oligopeptides, peptides, mass spectrometry, MS
A Protocol for Computer-Based Protein Structure and Function Prediction
Institutions: University of Michigan , University of Kansas.
Genome sequencing projects have ciphered millions of protein sequence, which require knowledge of their structure and function to improve the understanding of their biological role. Although experimental methods can provide detailed information for a small fraction of these proteins, computational modeling is needed for the majority of protein molecules which are experimentally uncharacterized. The I-TASSER server is an on-line workbench for high-resolution modeling of protein structure and function. Given a protein sequence, a typical output from the I-TASSER server includes secondary structure prediction, predicted solvent accessibility of each residue, homologous template proteins detected by threading and structure alignments, up to five full-length tertiary structural models, and structure-based functional annotations for enzyme classification, Gene Ontology terms and protein-ligand binding sites. All the predictions are tagged with a confidence score which tells how accurate the predictions are without knowing the experimental data. To facilitate the special requests of end users, the server provides channels to accept user-specified inter-residue distance and contact maps to interactively change the I-TASSER modeling; it also allows users to specify any proteins as template, or to exclude any template proteins during the structure assembly simulations. The structural information could be collected by the users based on experimental evidences or biological insights with the purpose of improving the quality of I-TASSER predictions. The server was evaluated as the best programs for protein structure and function predictions in the recent community-wide CASP experiments. There are currently >20,000 registered scientists from over 100 countries who are using the on-line I-TASSER server.
Biochemistry, Issue 57, On-line server, I-TASSER, protein structure prediction, function prediction
Structure of HIV-1 Capsid Assemblies by Cryo-electron Microscopy and Iterative Helical Real-space Reconstruction
Institutions: University of Pittsburgh School of Medicine.
Cryo-electron microscopy (cryo-EM), combined with image processing, is an increasingly powerful tool for structure determination of macromolecular protein complexes and assemblies. In fact, single particle electron microscopy1
and two-dimensional (2D) electron crystallography2
have become relatively routine methodologies and a large number of structures have been solved using these methods. At the same time, image processing and three-dimensional (3D) reconstruction of helical objects has rapidly developed, especially, the iterative helical real-space reconstruction (IHRSR) method3
, which uses single particle analysis tools in conjunction with helical symmetry. Many biological entities function in filamentous or helical forms, including actin filaments4
, amyloid fibers6
, tobacco mosaic viruses7
, and bacteria flagella8
, and, because a 3D density map of a helical entity can be attained from a single projection image, compared to the many images required for 3D reconstruction of a non-helical object, with the IHRSR method, structural analysis of such flexible and disordered helical assemblies is now attainable.
In this video article, we provide detailed protocols for obtaining a 3D density map of a helical protein assembly (HIV-1 capsid9
is our example), including protocols for cryo-EM specimen preparation, low dose data collection by cryo-EM, indexing of helical diffraction patterns, and image processing and 3D reconstruction using IHRSR. Compared to other techniques, cryo-EM offers optimal specimen preservation under near native conditions. Samples are embedded in a thin layer of vitreous ice, by rapid freezing, and imaged in electron microscopes at liquid nitrogen temperature, under low dose conditions to minimize the radiation damage. Sample images are obtained under near native conditions at the expense of low signal and low contrast in the recorded micrographs. Fortunately, the process of helical reconstruction has largely been automated, with the exception of indexing the helical diffraction pattern. Here, we describe an approach to index helical structure and determine helical symmetries (helical parameters) from digitized micrographs, an essential step for 3D helical reconstruction. Briefly, we obtain an initial 3D density map by applying the IHRSR method. This initial map is then iteratively refined by introducing constraints for the alignment parameters of each segment, thus controlling their degrees of freedom. Further improvement is achieved by correcting for the contrast transfer function (CTF) of the electron microscope (amplitude and phase correction) and by optimizing the helical symmetry of the assembly.
Immunology, Issue 54, cryo-electron microscopy, helical indexing, helical real-space reconstruction, tubular assemblies, HIV-1 capsid
Self-assembly of Complex Two-dimensional Shapes from Single-stranded DNA Tiles
Institutions: Tsinghua University, Harvard University, Harvard Medical School.
Current methods in DNA nano-architecture have successfully engineered a variety of 2D and 3D structures using principles of self-assembly. In this article, we describe detailed protocols on how to fabricate sophisticated 2D shapes through the self-assembly of uniquely addressable single-stranded DNA tiles which act as molecular pixels on a molecular canvas. Each single-stranded tile (SST) is a 42-nucleotide DNA strand composed of four concatenated modular domains which bind to four neighbors during self-assembly. The molecular canvas is a rectangle structure self-assembled from SSTs. A prescribed complex 2D shape is formed by selecting the constituent molecular pixels (SSTs) from a 310-pixel molecular canvas and then subjecting the corresponding strands to one-pot annealing. Due to the modular nature of the SST approach we demonstrate the scalability, versatility and robustness of this method. Compared with alternative methods, the SST method enables a wider selection of information polymers and sequences through the use of de novo
designed and synthesized short DNA strands.
Chemistry, Issue 99, self-assembly, DNA tiles, single-stranded tiles, molecular canvas, molecular pixel, programmable nanostructures, DNA nanotechnology