Many researchers, across incredibly diverse foci, are applying phylogenetics to their research question(s). However, many researchers are new to this topic and so it presents inherent problems. Here we compile a practical introduction to phylogenetics for nonexperts. We outline in a step-by-step manner, a pipeline for generating reliable phylogenies from gene sequence datasets. We begin with a user-guide for similarity search tools via online interfaces as well as local executables. Next, we explore programs for generating multiple sequence alignments followed by protocols for using software to determine best-fit models of evolution. We then outline protocols for reconstructing phylogenetic relationships via maximum likelihood and Bayesian criteria and finally describe tools for visualizing phylogenetic trees. While this is not by any means an exhaustive description of phylogenetic approaches, it does provide the reader with practical starting information on key software applications commonly utilized by phylogeneticists. The vision for this article would be that it could serve as a practical training tool for researchers embarking on phylogenetic studies and also serve as an educational resource that could be incorporated into a classroom or teaching-lab.
23 Related JoVE Articles!
An Experimental and Bioinformatics Protocol for RNA-seq Analyses of Photoperiodic Diapause in the Asian Tiger Mosquito, Aedes albopictus
Institutions: Georgetown University, The Ohio State University.
Photoperiodic diapause is an important adaptation that allows individuals to escape harsh seasonal environments via a series of physiological changes, most notably developmental arrest and reduced metabolism. Global gene expression profiling via RNA-Seq can provide important insights into the transcriptional mechanisms of photoperiodic diapause. The Asian tiger mosquito, Aedes albopictus
, is an outstanding organism for studying the transcriptional bases of diapause due to its ease of rearing, easily induced diapause, and the genomic resources available. This manuscript presents a general experimental workflow for identifying diapause-induced transcriptional differences in A. albopictus.
Rearing techniques, conditions necessary to induce diapause and non-diapause development, methods to estimate percent diapause in a population, and RNA extraction and integrity assessment for mosquitoes are documented. A workflow to process RNA-Seq data from Illumina sequencers culminates in a list of differentially expressed genes. The representative results demonstrate that this protocol can be used to effectively identify genes differentially regulated at the transcriptional level in A. albopictus
due to photoperiodic differences. With modest adjustments, this workflow can be readily adapted to study the transcriptional bases of diapause or other important life history traits in other mosquitoes.
Genetics, Issue 93, Aedes albopictus Asian tiger mosquito, photoperiodic diapause, RNA-Seq de novo transcriptome assembly, mosquito husbandry
Isolation of Translating Ribosomes Containing Peptidyl-tRNAs for Functional and Structural Analyses
Institutions: University of Alabama Huntsville, Stanford University .
Recently, structural and biochemical studies have detailed many of the molecular events that occur in the ribosome during inhibition of protein synthesis by antibiotics and during nascent polypeptide synthesis. Some of these antibiotics, and regulatory nascent polypeptides mostly in the form of peptidyl-tRNAs, inhibit either peptide bond formation or translation termination1-7
. These inhibitory events can stop the movement of the ribosome, a phenomenon termed "translational arrest". Translation arrest induced by either an antibiotic or a nascent polypeptide has been shown to regulate the expression of genes involved in diverse cellular functions such as cell growth, antibiotic resistance, protein translocation and cell metabolism8-13
. Knowledge of how antibiotics and regulatory nascent polypeptides alter ribosome function is essential if we are to understand the complete role of the ribosome in translation, in every organism.
Here, we describe a simple methodology that can be used to purify, exclusively, for analysis, those ribosomes translating a specific mRNA and containing a specific peptidyl-tRNA14
. This procedure is based on selective isolation of translating ribosomes bound to a biotin-labeled mRNA. These translational complexes are separated from other ribosomes in the same mixture, using streptavidin paramagnetic beads (SMB) and a magnetic field (MF). Biotin-labeled mRNAs are synthesized by run-off transcription assays using as templates PCR-generated DNA fragments that contain T7 transcriptional promoters. T7 RNA polymerase incorporates biotin-16-UMP from biotin-UTP; under our conditions approximately ten biotin-16-UMP molecules are incorporated in a 600 nt mRNA with a 25% UMP content. These biotin-labeled mRNAs are then isolated, and used in in vitro
translation assays performed with release factor 2 (RF2)-depleted cell-free extracts obtained from Escherichia coli
strains containing wild type or mutant ribosomes. Ribosomes translating the biotin-labeled mRNA sequences are stalled at the stop codon region, due to the absence of the RF2 protein, which normally accomplishes translation termination. Stalled ribosomes containing the newly synthesized peptidyl-tRNA are isolated and removed from the translation reactions using SMB and an MF. These beads only bind biotin-containing messages.
The isolated, translational complexes, can be used to analyze the structural and functional features of wild type or mutant ribosomal components, or peptidyl-tRNA sequences, as well as determining ribosome interaction with antibiotics or other molecular factors 1,14-16
. To examine the function of these isolated ribosome complexes, peptidyl-transferase assays can be performed in the presence of the antibiotic puromycin1
. To study structural changes in translational complexes, well established procedures can be used, such as i) crosslinking to specific amino acids14
and/or ii) alkylation protection assays1,14,17
Molecular Biology, Issue 48, Ribosome stalling, ribosome isolation, peptidyl-tRNA, in vitro translation, RNA chemical modification, puromycin, antibiotics.
Quantitative Analyses of all Influenza Type A Viral Hemagglutinins and Neuraminidases using Universal Antibodies in Simple Slot Blot Assays
Institutions: Health canada, The State Food and Drug Administration, Beijing, University of Ottawa, King Abdulaziz University, Public Health Agency of Canada.
Hemagglutinin (HA) and neuraminidase (NA) are two surface proteins of influenza viruses which are known to play important roles in the viral life cycle and the induction of protective immune responses1,2
. As the main target for neutralizing antibodies, HA is currently used as the influenza vaccine potency marker and is measured by single radial immunodiffusion (SRID)3
. However, the dependence of SRID on the availability of the corresponding subtype-specific antisera causes a minimum of 2-3 months delay for the release of every new vaccine. Moreover, despite evidence that NA also induces protective immunity4
, the amount of NA in influenza vaccines is not yet standardized due to a lack of appropriate reagents or analytical method5
. Thus, simple alternative methods capable of quantifying HA and NA antigens are desirable for rapid release and better quality control of influenza vaccines.
Universally conserved regions in all available influenza A HA and NA sequences were identified by bioinformatics analyses6-7
. One sequence (designated as Uni-1) was identified in the only universally conserved epitope of HA, the fusion peptide6
, while two conserved sequences were identified in neuraminidases, one close to the enzymatic active site (designated as HCA-2) and the other close to the N-terminus (designated as HCA-3)7
. Peptides with these amino acid sequences were synthesized and used to immunize rabbits for the production of antibodies. The antibody against the Uni-1 epitope of HA was able to bind to 13 subtypes of influenza A HA (H1-H13) while the antibodies against the HCA-2 and HCA-3 regions of NA were capable of binding all 9 NA subtypes. All antibodies showed remarkable specificity against the viral sequences as evidenced by the observation that no cross-reactivity to allantoic proteins was detected. These universal antibodies were then used to develop slot blot assays to quantify HA and NA in influenza A vaccines without the need for specific antisera7,8
. Vaccine samples were applied onto a PVDF membrane using a slot blot apparatus along with reference standards diluted to various concentrations. For the detection of HA, samples and standard were first diluted in Tris-buffered saline (TBS) containing 4M urea while for the measurement of NA they were diluted in TBS containing 0.01% Zwittergent as these conditions significantly improved the detection sensitivity. Following the detection of the HA and NA antigens by immunoblotting with their respective universal antibodies, signal intensities were quantified by densitometry. Amounts of HA and NA in the vaccines were then calculated using a standard curve established with the signal intensities of the various concentrations of the references used.
Given that these antibodies bind to universal epitopes in HA or NA, interested investigators could use them as research tools in immunoassays other than the slot blot only.
Immunology, Issue 50, Virology, influenza, hemagglutinin, neuraminidase, quantification, universal antibody
Growth Assays to Assess Polyglutamine Toxicity in Yeast
Institutions: Boston Biomedical Research Institute.
Protein misfolding is associated with many human diseases, particularly neurodegenerative diseases, such as Alzheimer’s disease, Parkinson's disease, and Huntington's disease 1
. Huntington's disease (HD) is caused by the abnormal expansion of a polyglutamine (polyQ) region within the protein huntingtin. The polyQ-expanded huntingtin protein attains an aberrant conformation (i.e. it misfolds) and causes cellular toxicity 2
. At least eight further neurodegenerative diseases are caused by polyQ-expansions, including the Spinocerebellar Ataxias and Kennedy’s disease 3
The model organism yeast has facilitated significant insights into the cellular and molecular basis of polyQ-toxicity, including the impact of intra- and inter-molecular factors of polyQ-toxicity, and the identification of cellular pathways that are impaired in cells expressing polyQ-expansion proteins 3-8
. Importantly, many aspects of polyQ-toxicity that were found in yeast were reproduced in other experimental systems and to some extent in samples from HD patients, thus demonstrating the significance of the yeast model for the discovery of basic mechanisms underpinning polyQ-toxicity.
A direct and relatively simple way to determine polyQ-toxicity in yeast is to measure growth defects of yeast cells expressing polyQ-expansion proteins. This manuscript describes three complementary experimental approaches to determine polyQ-toxicity in yeast by measuring the growth of yeast cells expressing polyQ-expansion proteins. The first two experimental approaches monitor yeast growth on plates, the third approach monitors the growth of liquid yeast cultures using the BioscreenC instrument.
Furthermore, this manuscript describes experimental difficulties that can occur when handling yeast polyQ models and outlines strategies that will help to avoid or minimize these difficulties. The protocols described here can be used to identify and to characterize genetic pathways and small molecules that modulate polyQ-toxicity. Moreover, the described assays may serve as templates for accurate analyses of the toxicity caused by other disease-associated misfolded proteins in yeast models.
Molecular Biology, Issue 61, Protein misfolding, yeast, polyglutamine diseases, growth assays
The ITS2 Database
Institutions: University of Würzburg, University of Würzburg.
The internal transcribed spacer 2 (ITS2) has been used as a phylogenetic marker for more than two decades. As ITS2 research mainly focused on the very variable ITS2 sequence, it confined this marker to low-level phylogenetics only. However, the combination of the ITS2 sequence and its highly conserved secondary structure improves the phylogenetic resolution1
and allows phylogenetic inference at multiple taxonomic ranks, including species delimitation2-8
The ITS2 Database9
presents an exhaustive dataset of internal transcribed spacer 2 sequences from NCBI GenBank11
. Following an annotation by profile Hidden Markov Models (HMMs), the secondary structure of each sequence is predicted. First, it is tested whether a minimum energy based fold12
(direct fold) results in a correct, four helix conformation. If this is not the case, the structure is predicted by homology modeling13
. In homology modeling, an already known secondary structure is transferred to another ITS2 sequence, whose secondary structure was not able to fold correctly in a direct fold.
The ITS2 Database is not only a database for storage and retrieval of ITS2 sequence-structures. It also provides several tools to process your own ITS2 sequences, including annotation, structural prediction, motif detection and BLAST14
search on the combined sequence-structure information. Moreover, it integrates trimmed versions of 4SALE15,16
for multiple sequence-structure alignment calculation and Neighbor Joining18
tree reconstruction. Together they form a coherent analysis pipeline from an initial set of sequences to a phylogeny based on sequence and secondary structure.
In a nutshell, this workbench simplifies first phylogenetic analyses to only a few mouse-clicks, while additionally providing tools and data for comprehensive large-scale analyses.
Genetics, Issue 61, alignment, internal transcribed spacer 2, molecular systematics, secondary structure, ribosomal RNA, phylogenetic tree, homology modeling, phylogeny
Analysis of Translation Initiation During Stress Conditions by Polysome Profiling
Institutions: Laval University, CHU de Quebec Research Center.
Precise control of mRNA translation is fundamental for eukaryotic cell homeostasis, particularly in response to physiological and pathological stress. Alterations of this program can lead to the growth of damaged cells, a hallmark of cancer development, or to premature cell death such as seen in neurodegenerative diseases. Much of what is known concerning the molecular basis for translational control has been obtained from polysome analysis using a density gradient fractionation system. This technique relies on ultracentrifugation of cytoplasmic extracts on a linear sucrose gradient. Once the spin is completed, the system allows fractionation and quantification of centrifuged zones corresponding to different translating ribosomes populations, thus resulting in a polysome profile. Changes in the polysome profile are indicative of changes or defects in translation initiation that occur in response to various types of stress. This technique also allows to assess the role of specific proteins on translation initiation, and to measure translational activity of specific mRNAs. Here we describe our protocol to perform polysome profiles in order to assess translation initiation of eukaryotic cells and tissues under either normal or stress growth conditions.
Cellular Biology, Issue 87, Translation initiation, polysome profile, sucrose gradient, protein and RNA isolation, stress conditions
Assessment of Selective mRNA Translation in Mammalian Cells by Polysome Profiling
Institutions: University of Ottawa, Montreal Neurological Institute, University of Ottawa.
Regulation of protein synthesis represents a key control point in cellular response to stress. In particular, discreet RNA regulatory elements were shown to allow to selective translation of specific mRNAs, which typically encode for proteins required for a particular stress response. Identification of these mRNAs, as well as the characterization of regulatory mechanisms responsible for selective translation has been at the forefront of molecular biology for some time. Polysome profiling is a cornerstone method in these studies. The goal of polysome profiling is to capture mRNA translation by immobilizing actively translating ribosomes on different transcripts and separate the resulting polyribosomes by ultracentrifugation on a sucrose gradient, thus allowing for a distinction between highly translated transcripts and poorly translated ones. These can then be further characterized by traditional biochemical and molecular biology methods. Importantly, combining polysome profiling with high throughput genomic approaches allows for a large scale analysis of translational regulation.
Cellular Biology, Issue 92, cellular stress, translation initiation, internal ribosome entry site, polysome, RT-qPCR, gradient
In vivo Interrogation of Central Nervous System Translatome by Polyribosome Fractionation
Institutions: German Cancer Research Center (DKFZ).
Multiple processes are involved in gene expression including transcription, translation and stability of mRNAs and proteins. Each of these steps are tightly regulated, affecting the final dynamics of protein abundance. Various regulatory mechanisms exist at the translation step, rendering mRNA levels alone an unreliable indicator of gene expression. In addition, local regulation of mRNA translation has been particularly implicated in neuronal functions, shifting 'translatomics' to the focus of attention in neurobiology. The presented method can be used to bridge transcriptomics and proteomics.
Here we describe essential modifications to the technique of polyribosome fractionation, which interrogates the translatome based on the association of actively translated mRNAs to multiple ribosomes and their differential sedimentation in sucrose gradients. Traditionally, working with in vivo
samples, particularly of the central nervous system (CNS), has proven challenging due to the restricted amounts of material and the presence of fatty tissue components. In order to address this, the described protocol is specifically optimized for use with minimal amount of CNS material, as demonstrated by the use of single mouse spinal cord and brain. Briefly, CNS tissues are extracted and translating ribosomes are immobilized on mRNAs with cycloheximide. Myelin flotation is then performed to remove lipid rich components. Fractionation is performed on a sucrose gradient where mRNAs are separated according to their ribosomal loading. Isolated fractions are suitable for a range of downstream assays, including new genome wide assay technologies.
Neuroscience, Issue 86, central nervous system, CNS, translation, polyribosome fractionation, RNA, Brain, spinal cord, microarray, next-generation sequencing, gradient, translatome
The Encapsulation of Cell-free Transcription and Translation Machinery in Vesicles for the Construction of Cellular Mimics
Institutions: University of Trento.
As interest shifts from individual molecules to systems of molecules, an increasing number of laboratories have sought to build from the bottom up cellular mimics that better represent the complexity of cellular life. To date there are a number of paths that could be taken to build compartmentalized cellular mimics, including the exploitation of water-in-oil emulsions, microfluidic devices, and vesicles. Each of the available options has specific advantages and disadvantages. For example, water-in-oil emulsions give high encapsulation efficiency but do not mimic well the permeability barrier of living cells. The primary advantage of the methods described herein is that they are all easy and cheap to implement. Transcription-translation machinery is encapsulated inside of phospholipid vesicles through a process that exploits common instrumentation, such as a centrifugal evaporator and an extruder. Reactions are monitored by fluorescence spectroscopy. The protocols can be adapted for recombinant protein expression, the construction of cellular mimics, the exploration of the minimum requirements for cellular life, or the assembly of genetic circuitry.
Bioengineering, Issue 80, synthetic biology, minimal cell, protocell, artificial cell, cell-free, in vitro transcription-translation, liposome, vesicle
Stable Isotopic Profiling of Intermediary Metabolic Flux in Developing and Adult Stage Caenorhabditis elegans
Institutions: The Children's Hospital of Philadelphia, University of Pennsylvania.
Stable isotopic profiling has long permitted sensitive investigations of the metabolic consequences of genetic mutations and/or pharmacologic therapies in cellular and mammalian models. Here, we describe detailed methods to perform stable isotopic profiling of intermediary metabolism and metabolic flux in the nematode, Caenorhabditis elegans
. Methods are described for profiling whole worm free amino acids, labeled carbon dioxide, labeled organic acids, and labeled amino acids in animals exposed to stable isotopes either from early development on nematode growth media agar plates or beginning as young adults while exposed to various pharmacologic treatments in liquid culture. Free amino acids are quantified by high performance liquid chromatography (HPLC) in whole worm aliquots extracted in 4% perchloric acid. Universally labeled 13
C-glucose or 1,6-13
-glucose is utilized as the stable isotopic precursor whose labeled carbon is traced by mass spectrometry in carbon dioxide (both atmospheric and dissolved) as well as in metabolites indicative of flux through glycolysis, pyruvate metabolism, and the tricarboxylic acid cycle. Representative results are included to demonstrate effects of isotope exposure time, various bacterial clearing protocols, and alternative worm disruption methods in wild-type nematodes, as well as the relative extent of isotopic incorporation in mitochondrial complex III mutant worms (isp-1(qm150)
) relative to wild-type worms. Application of stable isotopic profiling in living nematodes provides a novel capacity to investigate at the whole animal level real-time metabolic alterations that are caused by individual genetic disorders and/or pharmacologic therapies.
Developmental Biology, Issue 48, Stable isotope, amino acid quantitation, organic acid quantitation, nematodes, metabolism
Non-radioactive in situ Hybridization Protocol Applicable for Norway Spruce and a Range of Plant Species
Institutions: Uppsala University, Swedish University of Agricultural Sciences.
The high-throughput expression analysis technologies available today give scientists an overflow of expression profiles but their resolution in terms of tissue specific expression is limited because of problems in dissecting individual tissues. Expression data needs to be confirmed and complemented with expression patterns using e.g. in situ
hybridization, a technique used to localize cell specific mRNA expression. The in situ
hybridization method is laborious, time-consuming and often requires extensive optimization depending on species and tissue. In situ
experiments are relatively more difficult to perform in woody species such as the conifer Norway spruce (Picea abies
). Here we present a modified DIG in situ
hybridization protocol, which is fast and applicable on a wide range of plant species including P. abies
. With just a few adjustments, including altered RNase treatment and proteinase K concentration, we could use the protocol to study tissue specific expression of homologous genes in male reproductive organs of one gymnosperm and two angiosperm species; P. abies, Arabidopsis thaliana
and Brassica napus
. The protocol worked equally well for the species and genes studied. AtAP3
were observed in second and third whorl floral organs in A. thaliana
and B. napus
and DAL13 in microsporophylls of male cones from P. abies
. For P. abies
the proteinase K concentration, used to permeablize the tissues, had to be increased to 3 g/ml instead of 1 g/ml, possibly due to more compact tissues and higher levels of phenolics and polysaccharides. For all species the RNase treatment was removed due to reduced signal strength without a corresponding increase in specificity. By comparing tissue specific expression patterns of homologous genes from both flowering plants and a coniferous tree we demonstrate that the DIG in situ
protocol presented here, with only minute adjustments, can be applied to a wide range of plant species. Hence, the protocol avoids both extensive species specific optimization and the laborious use of radioactively labeled probes in favor of DIG labeled probes. We have chosen to illustrate the technically demanding steps of the protocol in our film.
Anna Karlgren and Jenny Carlsson contributed equally to this study.
Corresponding authors: Anna Karlgren at Anna.Karlgren@ebc.uu.se and Jens F. Sundström at Jens.Sundstrom@vbsg.slu.se
Plant Biology, Issue 26, RNA, expression analysis, Norway spruce, Arabidopsis, rapeseed, conifers
Metabolic Labeling and Membrane Fractionation for Comparative Proteomic Analysis of Arabidopsis thaliana Suspension Cell Cultures
Institutions: Max Plank Institute of Molecular Plant Physiology, University of Hohenheim.
Plasma membrane microdomains are features based on the physical properties of the lipid and sterol environment and have particular roles in signaling processes. Extracting sterol-enriched membrane microdomains from plant cells for proteomic analysis is a difficult task mainly due to multiple preparation steps and sources for contaminations from other cellular compartments. The plasma membrane constitutes only about 5-20% of all the membranes in a plant cell, and therefore isolation of highly purified plasma membrane fraction is challenging. A frequently used method involves aqueous two-phase partitioning in polyethylene glycol and dextran, which yields plasma membrane vesicles with a purity of 95% 1
. Sterol-rich membrane microdomains within the plasma membrane are insoluble upon treatment with cold nonionic detergents at alkaline pH. This detergent-resistant membrane fraction can be separated from the bulk plasma membrane by ultracentrifugation in a sucrose gradient 2
. Subsequently, proteins can be extracted from the low density band of the sucrose gradient by methanol/chloroform precipitation. Extracted protein will then be trypsin digested, desalted and finally analyzed by LC-MS/MS. Our extraction protocol for sterol-rich microdomains is optimized for the preparation of clean detergent-resistant membrane fractions from Arabidopsis thaliana
We use full metabolic labeling of Arabidopsis thaliana
suspension cell cultures with K15
as the only nitrogen source for quantitative comparative proteomic studies following biological treatment of interest 3
. By mixing equal ratios of labeled and unlabeled cell cultures for joint protein extraction the influence of preparation steps on final quantitative result is kept at a minimum. Also loss of material during extraction will affect both control and treatment samples in the same way, and therefore the ratio of light and heave peptide will remain constant. In the proposed method either labeled or unlabeled cell culture undergoes a biological treatment, while the other serves as control 4
Empty Value, Issue 79, Cellular Structures, Plants, Genetically Modified, Arabidopsis, Membrane Lipids, Intracellular Signaling Peptides and Proteins, Membrane Proteins, Isotope Labeling, Proteomics, plants, Arabidopsis thaliana, metabolic labeling, stable isotope labeling, suspension cell cultures, plasma membrane fractionation, two phase system, detergent resistant membranes (DRM), mass spectrometry, membrane microdomains, quantitative proteomics
A New Approach for the Comparative Analysis of Multiprotein Complexes Based on 15N Metabolic Labeling and Quantitative Mass Spectrometry
Institutions: University of Münster, Carnegie Institution for Science.
The introduced protocol provides a tool for the analysis of multiprotein complexes in the thylakoid membrane, by revealing insights into complex composition under different conditions. In this protocol the approach is demonstrated by comparing the composition of the protein complex responsible for cyclic electron flow (CEF) in Chlamydomonas reinhardtii
, isolated from genetically different strains. The procedure comprises the isolation of thylakoid membranes, followed by their separation into multiprotein complexes by sucrose density gradient centrifugation, SDS-PAGE, immunodetection and comparative, quantitative mass spectrometry (MS) based on differential metabolic labeling (14
N) of the analyzed strains. Detergent solubilized thylakoid membranes are loaded on sucrose density gradients at equal chlorophyll concentration. After ultracentrifugation, the gradients are separated into fractions, which are analyzed by mass-spectrometry based on equal volume. This approach allows the investigation of the composition within the gradient fractions and moreover to analyze the migration behavior of different proteins, especially focusing on ANR1, CAS, and PGRL1. Furthermore, this method is demonstrated by confirming the results with immunoblotting and additionally by supporting the findings from previous studies (the identification and PSI-dependent migration of proteins that were previously described to be part of the CEF-supercomplex such as PGRL1, FNR, and cyt f
). Notably, this approach is applicable to address a broad range of questions for which this protocol can be adopted and e.g.
used for comparative analyses of multiprotein complex composition isolated from distinct environmental conditions.
Microbiology, Issue 85, Sucrose density gradients, Chlamydomonas, multiprotein complexes, 15N metabolic labeling, thylakoids
A Protocol for Computer-Based Protein Structure and Function Prediction
Institutions: University of Michigan , University of Kansas.
Genome sequencing projects have ciphered millions of protein sequence, which require knowledge of their structure and function to improve the understanding of their biological role. Although experimental methods can provide detailed information for a small fraction of these proteins, computational modeling is needed for the majority of protein molecules which are experimentally uncharacterized. The I-TASSER server is an on-line workbench for high-resolution modeling of protein structure and function. Given a protein sequence, a typical output from the I-TASSER server includes secondary structure prediction, predicted solvent accessibility of each residue, homologous template proteins detected by threading and structure alignments, up to five full-length tertiary structural models, and structure-based functional annotations for enzyme classification, Gene Ontology terms and protein-ligand binding sites. All the predictions are tagged with a confidence score which tells how accurate the predictions are without knowing the experimental data. To facilitate the special requests of end users, the server provides channels to accept user-specified inter-residue distance and contact maps to interactively change the I-TASSER modeling; it also allows users to specify any proteins as template, or to exclude any template proteins during the structure assembly simulations. The structural information could be collected by the users based on experimental evidences or biological insights with the purpose of improving the quality of I-TASSER predictions. The server was evaluated as the best programs for protein structure and function predictions in the recent community-wide CASP experiments. There are currently >20,000 registered scientists from over 100 countries who are using the on-line I-TASSER server.
Biochemistry, Issue 57, On-line server, I-TASSER, protein structure prediction, function prediction
Analysis of Nephron Composition and Function in the Adult Zebrafish Kidney
Institutions: University of Notre Dame.
The zebrafish model has emerged as a relevant system to study kidney development, regeneration and disease. Both the embryonic and adult zebrafish kidneys are composed of functional units known as nephrons, which are highly conserved with other vertebrates, including mammals. Research in zebrafish has recently demonstrated that two distinctive phenomena transpire after adult nephrons incur damage: first, there is robust regeneration within existing nephrons that replaces the destroyed tubule epithelial cells; second, entirely new nephrons are produced from renal progenitors in a process known as neonephrogenesis. In contrast, humans and other mammals seem to have only a limited ability for nephron epithelial regeneration. To date, the mechanisms responsible for these kidney regeneration phenomena remain poorly understood. Since adult zebrafish kidneys undergo both nephron epithelial regeneration and neonephrogenesis, they provide an outstanding experimental paradigm to study these events. Further, there is a wide range of genetic and pharmacological tools available in the zebrafish model that can be used to delineate the cellular and molecular mechanisms that regulate renal regeneration. One essential aspect of such research is the evaluation of nephron structure and function. This protocol describes a set of labeling techniques that can be used to gauge renal composition and test nephron functionality in the adult zebrafish kidney. Thus, these methods are widely applicable to the future phenotypic characterization of adult zebrafish kidney injury paradigms, which include but are not limited to, nephrotoxicant exposure regimes or genetic methods of targeted cell death such as the nitroreductase mediated cell ablation technique. Further, these methods could be used to study genetic perturbations in adult kidney formation and could also be applied to assess renal status during chronic disease modeling.
Cellular Biology, Issue 90,
zebrafish; kidney; nephron; nephrology; renal; regeneration; proximal tubule; distal tubule; segment; mesonephros; physiology; acute kidney injury (AKI)
Polysome Fractionation and Analysis of Mammalian Translatomes on a Genome-wide Scale
Institutions: McGill University, Karolinska Institutet, McGill University.
mRNA translation plays a central role in the regulation of gene expression and represents the most energy consuming process in mammalian cells. Accordingly, dysregulation of mRNA translation is considered to play a major role in a variety of pathological states including cancer. Ribosomes also host chaperones, which facilitate folding of nascent polypeptides, thereby modulating function and stability of newly synthesized polypeptides. In addition, emerging data indicate that ribosomes serve as a platform for a repertoire of signaling molecules, which are implicated in a variety of post-translational modifications of newly synthesized polypeptides as they emerge from the ribosome, and/or components of translational machinery. Herein, a well-established method of ribosome fractionation using sucrose density gradient centrifugation is described. In conjunction with the in-house developed “anota” algorithm this method allows direct determination of differential translation of individual mRNAs on a genome-wide scale. Moreover, this versatile protocol can be used for a variety of biochemical studies aiming to dissect the function of ribosome-associated protein complexes, including those that play a central role in folding and degradation of newly synthesized polypeptides.
Biochemistry, Issue 87, Cells, Eukaryota, Nutritional and Metabolic Diseases, Neoplasms, Metabolic Phenomena, Cell Physiological Phenomena, mRNA translation, ribosomes,
protein synthesis, genome-wide analysis, translatome, mTOR, eIF4E, 4E-BP1
Protocols for Implementing an Escherichia coli Based TX-TL Cell-Free Expression System for Synthetic Biology
Institutions: California Institute of Technology, California Institute of Technology, Massachusetts Institute of Technology, University of Minnesota.
Ideal cell-free expression systems can theoretically emulate an in vivo
cellular environment in a controlled in vitro
This is useful for expressing proteins and genetic circuits in a controlled manner as well as for providing a prototyping environment for synthetic biology.2,3
To achieve the latter goal, cell-free expression systems that preserve endogenous Escherichia coli transcription-translation mechanisms are able to more accurately reflect in vivo
cellular dynamics than those based on T7 RNA polymerase transcription. We describe the preparation and execution of an efficient endogenous E. coli
based transcription-translation (TX-TL) cell-free expression system that can produce equivalent amounts of protein as T7-based systems at a 98% cost reduction to similar commercial systems.4,5
The preparation of buffers and crude cell extract are described, as well as the execution of a three tube TX-TL reaction. The entire protocol takes five days to prepare and yields enough material for up to 3000 single reactions in one preparation. Once prepared, each reaction takes under 8 hr from setup to data collection and analysis. Mechanisms of regulation and transcription exogenous to E. coli
, such as lac/tet repressors and T7 RNA polymerase, can be supplemented.6
Endogenous properties, such as mRNA and DNA degradation rates, can also be adjusted.7
The TX-TL cell-free expression system has been demonstrated for large-scale circuit assembly, exploring biological phenomena, and expression of proteins under both T7- and endogenous promoters.6,8
Accompanying mathematical models are available.9,10
The resulting system has unique applications in synthetic biology as a prototyping environment, or "TX-TL biomolecular breadboard."
Cellular Biology, Issue 79, Bioengineering, Synthetic Biology, Chemistry Techniques, Synthetic, Molecular Biology, control theory, TX-TL, cell-free expression, in vitro, transcription-translation, cell-free protein synthesis, synthetic biology, systems biology, Escherichia coli cell extract, biological circuits, biomolecular breadboard
Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules
Institutions: Princeton University.
The aim of de novo
protein design is to find the amino acid sequences that will fold into a desired 3-dimensional structure with improvements in specific properties, such as binding affinity, agonist or antagonist behavior, or stability, relative to the native sequence. Protein design lies at the center of current advances drug design and discovery. Not only does protein design provide predictions for potentially useful drug targets, but it also enhances our understanding of the protein folding process and protein-protein interactions. Experimental methods such as directed evolution have shown success in protein design. However, such methods are restricted by the limited sequence space that can be searched tractably. In contrast, computational design strategies allow for the screening of a much larger set of sequences covering a wide variety of properties and functionality. We have developed a range of computational de novo
protein design methods capable of tackling several important areas of protein design. These include the design of monomeric proteins for increased stability and complexes for increased binding affinity.
To disseminate these methods for broader use we present Protein WISDOM (http://www.proteinwisdom.org), a tool that provides automated methods for a variety of protein design problems. Structural templates are submitted to initialize the design process. The first stage of design is an optimization sequence selection stage that aims at improving stability through minimization of potential energy in the sequence space. Selected sequences are then run through a fold specificity stage and a binding affinity stage. A rank-ordered list of the sequences for each step of the process, along with relevant designed structures, provides the user with a comprehensive quantitative assessment of the design. Here we provide the details of each design method, as well as several notable experimental successes attained through the use of the methods.
Genetics, Issue 77, Molecular Biology, Bioengineering, Biochemistry, Biomedical Engineering, Chemical Engineering, Computational Biology, Genomics, Proteomics, Protein, Protein Binding, Computational Biology, Drug Design, optimization (mathematics), Amino Acids, Peptides, and Proteins, De novo protein and peptide design, Drug design, In silico sequence selection, Optimization, Fold specificity, Binding affinity, sequencing
Using SecM Arrest Sequence as a Tool to Isolate Ribosome Bound Polypeptides
Institutions: Cleveland State University.
Extensive research has provided ample evidences suggesting that protein folding in the cell is a co-translational process1-5
. However, the exact pathway that polypeptide chain follows during co-translational folding to achieve its functional form is still an enigma. In order to understand this process and to determine the exact conformation of the co-translational folding intermediates, it is essential to develop techniques that allow the isolation of RNCs carrying nascent chains of predetermined sizes to allow their further structural analysis.
SecM (secretion monitor) is a 170 amino acid E. coli
protein that regulates expression of the downstream SecA (secretion driving) ATPase in the secM-secA
. Nakatogawa and Ito originally found that a 17 amino acid long sequence (150-FSTPVWISQAQGIRAG
P-166) in the C-terminal region of the SecM protein is sufficient and necessary to cause stalling of SecM elongation at Gly165, thereby producing peptidyl-glycyl-tRNA stably bound to the ribosomal P-site7-9
. More importantly, it was found that this 17 amino acid long sequence can be fused to the C-terminus of virtually any full-length and/or truncated protein thus allowing the production of RNCs carrying nascent chains of predetermined sizes7
. Thus, when fused or inserted into the target protein, SecM stalling sequence produces arrest of the polypeptide chain elongation and generates stable RNCs both in vivo
in E. coli
cells and in vitro
in a cell-free system. Sucrose gradient centrifugation is further utilized to isolate RNCs.
The isolated RNCs can be used to analyze structural and functional features of the co-translational folding intermediates. Recently, this technique has been successfully used to gain insights into the structure of several ribosome bound nascent chains10,11
. Here we describe the isolation of bovine Gamma-B Crystallin RNCs fused to SecM and generated in an in vitro
Molecular Biology, Issue 64, Ribosome, nascent polypeptides, co-translational protein folding, translational arrest, in vitro translation
Eukaryotic Polyribosome Profile Analysis
Institutions: University of Medicine and Dentistry of New Jersey, Robert Wood Johnson Medical School.
Protein synthesis is a complex cellular process that is regulated at many levels. For example, global translation can be inhibited at the initiation phase or the elongation phase by a variety of cellular stresses such as amino acid starvation or growth factor withdrawal. Alternatively, translation of individual mRNAs can be regulated by mRNA localization or the presence of cognate microRNAs. Studies of protein synthesis frequently utilize polyribosome analysis to shed light on the mechanisms of translation regulation or defects in protein synthesis. In this assay, mRNA/ribosome complexes are isolated from eukaryotic cells. A sucrose density gradient separates mRNAs bound to multiple ribosomes known as polyribosomes from mRNAs bound to a single ribosome or monosome. Fractionation of the gradients allows isolation and quantification of the different ribosomal populations and their associated mRNAs or proteins. Differences in the ratio of polyribosomes to monosomes under defined conditions can be indicative of defects in either translation initiation or elongation/termination. Examination of the mRNAs present in the polyribosome fractions can reveal whether the cohort of individual mRNAs being translated changes with experimental conditions. In addition, ribosome assembly can be monitored by analysis of the small and large ribosomal subunit peaks which are also separated by the gradient. In this video, we present a method for the preparation of crude ribosomal extracts from yeast cells, separation of the extract by sucrose gradient and interpretation of the results. This procedure is readily adaptable to mammalian cells.
Cellular Biology, Issue 40, translation, ribosome, polyribosome, gradient, fractionation
Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm
Institutions: University of Washington, Iowa State University, North Carolina A&T University, Iowa Geological and Water Survey.
Finding the cost-efficient (i.e.
, lowest-cost) ways of targeting conservation practice investments for the achievement of specific water quality goals across the landscape is of primary importance in watershed management. Traditional economics methods of finding the lowest-cost solution in the watershed context (e.g.
) assume that off-site impacts can be accurately described as a proportion of on-site pollution generated. Such approaches are unlikely to be representative of the actual pollution process in a watershed, where the impacts of polluting sources are often determined by complex biophysical processes. The use of modern physically-based, spatially distributed hydrologic simulation models allows for a greater degree of realism in terms of process representation but requires a development of a simulation-optimization framework where the model becomes an integral part of optimization.
Evolutionary algorithms appear to be a particularly useful optimization tool, able to deal with the combinatorial nature of a watershed simulation-optimization problem and allowing the use of the full water quality model. Evolutionary algorithms treat a particular spatial allocation of conservation practices in a watershed as a candidate solution and utilize sets (populations) of candidate solutions iteratively applying stochastic operators of selection, recombination, and mutation to find improvements with respect to the optimization objectives. The optimization objectives in this case are to minimize nonpoint-source pollution in the watershed, simultaneously minimizing the cost of conservation practices. A recent and expanding set of research is attempting to use similar methods and integrates water quality models with broadly defined evolutionary optimization methods3,4,9,10,13-15,17-19,22,23,25
. In this application, we demonstrate a program which follows Rabotyagov et al.'s approach and integrates a modern and commonly used SWAT water quality model7
with a multiobjective evolutionary algorithm SPEA226
, and user-specified set of conservation practices and their costs to search for the complete tradeoff frontiers between costs of conservation practices and user-specified water quality objectives. The frontiers quantify the tradeoffs faced by the watershed managers by presenting the full range of costs associated with various water quality improvement goals. The program allows for a selection of watershed configurations achieving specified water quality improvement goals and a production of maps of optimized placement of conservation practices.
Environmental Sciences, Issue 70, Plant Biology, Civil Engineering, Forest Sciences, Water quality, multiobjective optimization, evolutionary algorithms, cost efficiency, agriculture, development
Predicting the Effectiveness of Population Replacement Strategy Using Mathematical Modeling
Institutions: University of California, Los Angeles.
Charles Taylor and John Marshall explain the utility of mathematical modeling for evaluating the effectiveness of population replacement strategy. Insight is given into how computational models can provide information on the population dynamics of mosquitoes and the spread of transposable elements through A. gambiae subspecies. The ethical considerations of releasing genetically modified mosquitoes into the wild are discussed.
Cellular Biology, Issue 5, mosquito, malaria, popuulation, replacement, modeling, infectious disease
Molecular Evolution of the Tre Recombinase
Institutions: Max Plank Institute for Molecular Cell Biology and Genetics, Dresden.
Here we report the generation of Tre recombinase through directed, molecular evolution. Tre recombinase recognizes a pre-defined target sequence within the LTR sequences of the HIV-1 provirus, resulting in the excision and eradication of the provirus from infected human cells.
We started with Cre, a 38-kDa recombinase, that recognizes a 34-bp double-stranded DNA sequence known as loxP. Because Cre can effectively eliminate genomic sequences, we set out to tailor a recombinase that could remove the sequence between the 5'-LTR and 3'-LTR of an integrated HIV-1 provirus. As a first step we identified sequences within the LTR sites that were similar to loxP and tested for recombination activity. Initially Cre and mutagenized Cre libraries failed to recombine the chosen loxLTR sites of the HIV-1 provirus. As the start of any directed molecular evolution process requires at least residual activity, the original asymmetric loxLTR sequences were split into subsets and tested again for recombination activity. Acting as intermediates, recombination activity was shown with the subsets. Next, recombinase libraries were enriched through reiterative evolution cycles. Subsequently, enriched libraries were shuffled and recombined. The combination of different mutations proved synergistic and recombinases were created that were able to recombine loxLTR1 and loxLTR2. This was evidence that an evolutionary strategy through intermediates can be successful. After a total of 126 evolution cycles individual recombinases were functionally and structurally analyzed. The most active recombinase -- Tre -- had 19 amino acid changes as compared to Cre. Tre recombinase was able to excise the HIV-1 provirus from the genome HIV-1 infected HeLa cells (see "HIV-1 Proviral DNA Excision Using an Evolved Recombinase", Hauber J., Heinrich-Pette-Institute for Experimental Virology and Immunology, Hamburg, Germany). While still in its infancy, directed molecular evolution will allow the creation of custom enzymes that will serve as tools of "molecular surgery" and molecular medicine.
Cell Biology, Issue 15, HIV-1, Tre recombinase, Site-specific recombination, molecular evolution