Many researchers, across incredibly diverse foci, are applying phylogenetics to their research question(s). However, many researchers are new to this topic and so it presents inherent problems. Here we compile a practical introduction to phylogenetics for nonexperts. We outline in a step-by-step manner, a pipeline for generating reliable phylogenies from gene sequence datasets. We begin with a user-guide for similarity search tools via online interfaces as well as local executables. Next, we explore programs for generating multiple sequence alignments followed by protocols for using software to determine best-fit models of evolution. We then outline protocols for reconstructing phylogenetic relationships via maximum likelihood and Bayesian criteria and finally describe tools for visualizing phylogenetic trees. While this is not by any means an exhaustive description of phylogenetic approaches, it does provide the reader with practical starting information on key software applications commonly utilized by phylogeneticists. The vision for this article would be that it could serve as a practical training tool for researchers embarking on phylogenetic studies and also serve as an educational resource that could be incorporated into a classroom or teaching-lab.
20 Related JoVE Articles!
The ITS2 Database
Institutions: University of Würzburg, University of Würzburg.
The internal transcribed spacer 2 (ITS2) has been used as a phylogenetic marker for more than two decades. As ITS2 research mainly focused on the very variable ITS2 sequence, it confined this marker to low-level phylogenetics only. However, the combination of the ITS2 sequence and its highly conserved secondary structure improves the phylogenetic resolution1
and allows phylogenetic inference at multiple taxonomic ranks, including species delimitation2-8
The ITS2 Database9
presents an exhaustive dataset of internal transcribed spacer 2 sequences from NCBI GenBank11
. Following an annotation by profile Hidden Markov Models (HMMs), the secondary structure of each sequence is predicted. First, it is tested whether a minimum energy based fold12
(direct fold) results in a correct, four helix conformation. If this is not the case, the structure is predicted by homology modeling13
. In homology modeling, an already known secondary structure is transferred to another ITS2 sequence, whose secondary structure was not able to fold correctly in a direct fold.
The ITS2 Database is not only a database for storage and retrieval of ITS2 sequence-structures. It also provides several tools to process your own ITS2 sequences, including annotation, structural prediction, motif detection and BLAST14
search on the combined sequence-structure information. Moreover, it integrates trimmed versions of 4SALE15,16
for multiple sequence-structure alignment calculation and Neighbor Joining18
tree reconstruction. Together they form a coherent analysis pipeline from an initial set of sequences to a phylogeny based on sequence and secondary structure.
In a nutshell, this workbench simplifies first phylogenetic analyses to only a few mouse-clicks, while additionally providing tools and data for comprehensive large-scale analyses.
Genetics, Issue 61, alignment, internal transcribed spacer 2, molecular systematics, secondary structure, ribosomal RNA, phylogenetic tree, homology modeling, phylogeny
Annotation of Plant Gene Function via Combined Genomics, Metabolomics and Informatics
Given the ever expanding number of model plant species for which complete genome sequences are available and the abundance of bio-resources such as knockout mutants, wild accessions and advanced breeding populations, there is a rising burden for gene functional annotation. In this protocol, annotation of plant gene function using combined co-expression gene analysis, metabolomics and informatics is provided (Figure 1
). This approach is based on the theory of using target genes of known function to allow the identification of non-annotated genes likely to be involved in a certain metabolic process, with the identification of target compounds via metabolomics. Strategies are put forward for applying this information on populations generated by both forward and reverse genetics approaches in spite of none of these are effortless. By corollary this approach can also be used as an approach to characterise unknown peaks representing new or specific secondary metabolites in the limited tissues, plant species or stress treatment, which is currently the important trial to understanding plant metabolism.
Plant Biology, Issue 64, Genetics, Bioinformatics, Metabolomics, Plant metabolism, Transcriptome analysis, Functional annotation, Computational biology, Plant biology, Theoretical biology, Spectroscopy and structural analysis
A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types
Institutions: Stony Brook University, Cold Spring Harbor Laboratory, University of Texas at Dallas.
ChIPseq is a widely used technique for investigating protein-DNA interactions. Read density profiles are generated by using next-sequencing of protein-bound DNA and aligning the short reads to a reference genome. Enriched regions are revealed as peaks, which often differ dramatically in shape, depending on the target protein1
. For example, transcription factors often bind in a site- and sequence-specific manner and tend to produce punctate peaks, while histone modifications are more pervasive and are characterized by broad, diffuse islands of enrichment2
. Reliably identifying these regions was the focus of our work.
Algorithms for analyzing ChIPseq data have employed various methodologies, from heuristics3-5
to more rigorous statistical models, e.g.
Hidden Markov Models (HMMs)6-8
. We sought a solution that minimized the necessity for difficult-to-define, ad hoc parameters that often compromise resolution and lessen the intuitive usability of the tool. With respect to HMM-based methods, we aimed to curtail parameter estimation procedures and simple, finite state classifications that are often utilized.
Additionally, conventional ChIPseq data analysis involves categorization of the expected read density profiles as either punctate or diffuse followed by subsequent application of the appropriate tool. We further aimed to replace the need for these two distinct models with a single, more versatile model, which can capably address the entire spectrum of data types.
To meet these objectives, we first constructed a statistical framework that naturally modeled ChIPseq data structures using a cutting edge advance in HMMs9
, which utilizes only explicit formulas-an innovation crucial to its performance advantages. More sophisticated then heuristic models, our HMM accommodates infinite hidden states through a Bayesian model. We applied it to identifying reasonable change points in read density, which further define segments of enrichment. Our analysis revealed how our Bayesian Change Point (BCP) algorithm had a reduced computational complexity-evidenced by an abridged run time and memory footprint. The BCP algorithm was successfully applied to both punctate peak and diffuse island identification with robust accuracy and limited user-defined parameters. This illustrated both its versatility and ease of use. Consequently, we believe it can be implemented readily across broad ranges of data types and end users in a manner that is easily compared and contrasted, making it a great tool for ChIPseq data analysis that can aid in collaboration and corroboration between research groups. Here, we demonstrate the application of BCP to existing transcription factor10,11
and epigenetic data12
to illustrate its usefulness.
Genetics, Issue 70, Bioinformatics, Genomics, Molecular Biology, Cellular Biology, Immunology, Chromatin immunoprecipitation, ChIP-Seq, histone modifications, segmentation, Bayesian, Hidden Markov Models, epigenetics
Qualitative Identification of Carboxylic Acids, Boronic Acids, and Amines Using Cruciform Fluorophores
Institutions: Ruprecht-Karls-Universität Heidelberg, University of Houston.
Molecular cruciforms are X-shaped systems in which two conjugation axes intersect at a central core. If one axis of these molecules is substituted with electron-donors, and the other with electron-acceptors, cruciforms' HOMO will localize along the electron-rich and LUMO along the electron-poor axis. This spatial isolation of cruciforms' frontier molecular orbitals (FMOs) is essential to their use as sensors, since analyte binding to the cruciform invariably changes its HOMO-LUMO gap and the associated optical properties. Using this principle, Bunz and Miljanić groups developed 1,4-distyryl-2,5-bis(arylethynyl)benzene and benzobisoxazole cruciforms, respectively, which act as fluorescent sensors for metal ions, carboxylic acids, boronic acids, phenols, amines, and anions. The emission colors observed when these cruciform are mixed with analytes are highly sensitive to the details of analyte's structure and - because of cruciforms' charge-separated excited states - to the solvent in which emission is observed. Structurally closely related species can be qualitatively distinguished within several analyte classes: (a
) carboxylic acids; (b
) boronic acids, and (c
) metals. Using a hybrid sensing system composed from benzobisoxazole cruciforms and boronic acid additives, we were also able to discern among structurally similar: (d
) small organic and inorganic anions, (e
) amines, and (f
) phenols. The method used for this qualitative distinction is exceedingly simple. Dilute solutions (typically 10-6
M) of cruciforms in several off-the-shelf solvents are placed in UV/Vis vials. Then, analytes of interest are added, either directly as solids or in concentrated solution. Fluorescence changes occur virtually instantaneously and can be recorded through standard digital photography using a semi-professional digital camera in a dark room. With minimal graphic manipulation, representative cut-outs of emission color photographs can be arranged into panels which permit quick naked-eye distinction among analytes. For quantification purposes, Red/Green/Blue values can be extracted from these photographs and the obtained numeric data can be statistically processed.
Chemistry, Issue 78, Chemical Engineering, Organic Chemistry, Amines, analytical chemistry, organic chemistry, spectrophotometry (application), spectroscopic chemical analysis (application), Heterocyclic Compounds, fluorescence, cruciform, benzobisoxazole, alkyne, pharmaceuticals, quality control, imaging
Measurement of Antibody Effects on Cellular Function of Isolated Cardiomyocytes
Institutions: University Medicine Greifswald.
Dilated cardiomyopathy (DCM) is one of the main causes for heart failure in younger adults1
. Although genetic disposition and exposition to toxic substances are known causes for this disease in about one third of the patients, the origin of DCM remains largely unclear. In a substantial number of these patients, autoantibodies against cardiac epitopes have been detected and are suspected to play a pivotal role in the onset and progression of the disease2,3
. The importance of cardiac autoantibodies is underlined by a hemodynamic improvement observed in DCM patients after elimination of autoantibodies by immunoadsorption3-5
. A variety of specific antigens have already been identified2,3
and antibodies against these targets may be detected by immunoassays. However, these assays cannot discriminate between stimulating (and therefore functionally effective) and blocking autoantibodies. There is increasing evidence that this distinction is crucial6,7
. It can also be assumed that the targets for a number of cardiotropic antibodies are still unidentified and therefore cannot be detected by immunoassays. Therefore, we established a method for the detection of functionally active cardiotropic antibodies, independent of their respective antigen. The background for the method is the high homology usually observed for functional regions of cardiac proteins in between mammals8,9
. This suggests that cardiac antibodies directed against human antigens will cross-react with non-human target cells, which allows testing of IgG from DCM patients on adult rat cardiomyocytes. Our method consists of 3 steps: first, IgG is isolated from patient plasma using sepharose coupled anti-IgG antibodies obtained from immunoadsorption columns (PlasmaSelect, Teterow, Germany). Second, adult cardiomyocytes are isolated by collagenase perfusion in a Langendorff perfusion apparatus using a protocol modified from previous works10,11
. The obtained cardiomyocytes are attached to laminin-coated chambered coverglasses and stained with Fura-2, a calcium-selective fluorescent dye which can be easily brought into the cell to observe intracellular calcium (Ca2+
. In the last step, the effect of patient IgG on the cell shortening and Ca2+
transients of field stimulated cardiomyocytes is monitored online using a commercial myocyte calcium and contractility monitoring system (IonOptix, Milton, MA, USA) connected to a standard inverse fluorescent microscope.
Immunology, Issue 73, Medicine, Cellular Biology, Molecular Biology, Biomedical Engineering, Physiology, Anatomy, Cardiology, cardiomyocytes, cell shortening, intracellular Ca2+, Fura-2, antibodies, dilated cardiomyopathy, DCM, IgG, cardiac proteins, Langendorff perfusion, electrode, immunoassay, assay, cell culture, animal model
Soft Lithographic Functionalization and Patterning Oxide-free Silicon and Germanium
Institutions: Duke University , University of Rochester , University of Rochester .
The development of hybrid electronic devices relies in large part on the integration of (bio)organic materials and inorganic semiconductors through a stable interface that permits efficient electron transport and protects underlying substrates from oxidative degradation. Group IV semiconductors can be effectively protected with highly-ordered self-assembled monolayers (SAMs) composed of simple alkyl chains that act as impervious barriers to both organic and aqueous solutions. Simple alkyl SAMs, however, are inert and not amenable to traditional patterning techniques. The motivation for immobilizing organic molecular systems on semiconductors is to impart new functionality to the surface that can provide optical, electronic, and mechanical function, as well as chemical and biological activity.
Microcontact printing (μ
CP) is a soft-lithographic technique for patterning SAMs on myriad surfaces.1-9
Despite its simplicity and versatility, the approach has been largely limited to noble metal surfaces and has not been well developed for pattern transfer to technologically important substrates such as oxide-free silicon and germanium. Furthermore, because this technique relies on the ink diffusion to transfer pattern from the elastomer to substrate, the resolution of such traditional printing is essentially limited to near 1 μ
In contrast to traditional printing, inkless μ
CP patterning relies on a specific reaction between a surface-immobilized substrate and a stamp-bound catalyst. Because the technique does not rely on diffusive SAM formation, it significantly expands the diversity of patternable surfaces. In addition, the inkless technique obviates the feature size limitations imposed by molecular diffusion, facilitating replication of very small (<200 nm) features.17-23
However, up till now, inkless μ
CP has been mainly used for patterning relatively disordered molecular systems, which do not protect underlying surfaces from degradation.
Here, we report a simple, reliable high-throughput method for patterning passivated silicon and germanium with reactive organic monolayers and demonstrate selective functionalization of the patterned substrates with both small molecules and proteins. The technique utilizes a preformed NHS-reactive bilayered system on oxide-free silicon and germanium. The NHS moiety is hydrolyzed in a pattern-specific manner with a sulfonic acid-modified acrylate stamp to produce chemically distinct patterns of NHS-activated and free carboxylic acids. A significant limitation to the resolution of many μ
CP techniques is the use of PDMS material which lacks the mechanical rigidity necessary for high fidelity transfer. To alleviate this limitation we utilized a polyurethane acrylate polymer, a relatively rigid material that can be easily functionalized with different organic moieties. Our patterning approach completely protects both silicon and germanium from chemical oxidation, provides precise control over the shape and size of the patterned features, and gives ready access to chemically discriminated patterns that can be further functionalized with both organic and biological molecules. The approach is general and applicable to other technologically-relevant surfaces.
Bioengineering, Issue 58, Soft lithography, microcontact printing, protein arrays, catalytic printing, oxide-free silicon
A Technique to Screen American Beech for Resistance to the Beech Scale Insect (Cryptococcus fagisuga Lind.)
Institutions: US Forest Service.
Beech bark disease (BBD) results in high levels of initial mortality, leaving behind survivor trees that are greatly weakened and deformed. The disease is initiated by feeding activities of the invasive beech scale insect, Cryptococcus fagisuga
, which creates entry points for infection by one of the Neonectria
species of fungus. Without scale infestation, there is little opportunity for fungal infection. Using scale eggs to artificially infest healthy trees in heavily BBD impacted stands demonstrated that these trees were resistant to the scale insect portion of the disease complex1
. Here we present a protocol that we have developed, based on the artificial infestation technique by Houston2
, which can be used to screen for scale-resistant trees in the field and in smaller potted seedlings and grafts. The identification of scale-resistant trees is an important component of management of BBD through tree improvement programs and silvicultural manipulation.
Environmental Sciences, Issue 87, Forestry, Insects, Disease Resistance, American beech, Fagus grandifolia, beech scale, Cryptococcus fagisuga, resistance, screen, bioassay
Transabdominal Ultrasound for Pregnancy Diagnosis in Reeves' Muntjac Deer
Institutions: Colorado State University.
Reeves' muntjac deer (Muntiacus reevesi
) are a small cervid species native to southeast Asia, and are currently being investigated as a potential model of prion disease transmission and pathogenesis. Vertical transmission is an area of interest among researchers studying infectious diseases, including prion disease, and these investigations require efficient methods for evaluating the effects of maternal infection on reproductive performance. Ultrasonographic examination is a well-established tool for diagnosing pregnancy and assessing fetal health in many animal species1-7
, including several species of farmed cervids8-19
, however this technique has not been described in Reeves' muntjac deer. Here we describe the application of transabdominal ultrasound to detect pregnancy in muntjac does and to evaluate fetal growth and development throughout the gestational period. Using this procedure, pregnant animals were identified as early as 35 days following doe-buck pairing and this was an effective means to safely monitor the pregnancy at regular intervals. Future goals of this work will include establishing normal fetal measurement references for estimation of gestational age, determining sensitivity and specificity of the technique for diagnosing pregnancy at various stages of gestation, and identifying variations in fetal growth and development under different experimental conditions.
Medicine, Issue 83, Ultrasound, Reeves' muntjac deer, Muntiacus reevesi, fetal development, fetal growth, captive cervids
A Restriction Enzyme Based Cloning Method to Assess the In vitro Replication Capacity of HIV-1 Subtype C Gag-MJ4 Chimeric Viruses
Institutions: Emory University, Emory University.
The protective effect of many HLA class I alleles on HIV-1 pathogenesis and disease progression is, in part, attributed to their ability to target conserved portions of the HIV-1 genome that escape with difficulty. Sequence changes attributed to cellular immune pressure arise across the genome during infection, and if found within conserved regions of the genome such as Gag, can affect the ability of the virus to replicate in vitro
. Transmission of HLA-linked polymorphisms in Gag to HLA-mismatched recipients has been associated with reduced set point viral loads. We hypothesized this may be due to a reduced replication capacity of the virus. Here we present a novel method for assessing the in vitro
replication of HIV-1 as influenced by the gag
gene isolated from acute time points from subtype C infected Zambians. This method uses restriction enzyme based cloning to insert the gag
gene into a common subtype C HIV-1 proviral backbone, MJ4. This makes it more appropriate to the study of subtype C sequences than previous recombination based methods that have assessed the in vitro
replication of chronically derived gag-pro
sequences. Nevertheless, the protocol could be readily modified for studies of viruses from other subtypes. Moreover, this protocol details a robust and reproducible method for assessing the replication capacity of the Gag-MJ4 chimeric viruses on a CEM-based T cell line. This method was utilized for the study of Gag-MJ4 chimeric viruses derived from 149 subtype C acutely infected Zambians, and has allowed for the identification of residues in Gag that affect replication. More importantly, the implementation of this technique has facilitated a deeper understanding of how viral replication defines parameters of early HIV-1 pathogenesis such as set point viral load and longitudinal CD4+ T cell decline.
Infectious Diseases, Issue 90, HIV-1, Gag, viral replication, replication capacity, viral fitness, MJ4, CEM, GXR25
Setting Limits on Supersymmetry Using Simplified Models
Institutions: University College London, CERN, Lawrence Berkeley National Laboratories.
Experimental limits on supersymmetry and similar theories are difficult to set because of the enormous available parameter space and difficult to generalize because of the complexity of single points. Therefore, more phenomenological, simplified models are becoming popular for setting experimental limits, as they have clearer physical interpretations. The use of these simplified model limits to set a real limit on a concrete theory has not, however, been demonstrated. This paper recasts simplified model limits into limits on a specific and complete supersymmetry model, minimal supergravity. Limits obtained under various physical assumptions are comparable to those produced by directed searches. A prescription is provided for calculating conservative and aggressive limits on additional theories. Using acceptance and efficiency tables along with the expected and observed numbers of events in various signal regions, LHC experimental results can be recast in this manner into almost any theoretical framework, including nonsupersymmetric theories with supersymmetry-like signatures.
Physics, Issue 81, high energy physics, particle physics, Supersymmetry, LHC, ATLAS, CMS, New Physics Limits, Simplified Models
Characterization of Complex Systems Using the Design of Experiments Approach: Transient Protein Expression in Tobacco as a Case Study
Institutions: RWTH Aachen University, Fraunhofer Gesellschaft.
Plants provide multiple benefits for the production of biopharmaceuticals including low costs, scalability, and safety. Transient expression offers the additional advantage of short development and production times, but expression levels can vary significantly between batches thus giving rise to regulatory concerns in the context of good manufacturing practice. We used a design of experiments (DoE) approach to determine the impact of major factors such as regulatory elements in the expression construct, plant growth and development parameters, and the incubation conditions during expression, on the variability of expression between batches. We tested plants expressing a model anti-HIV monoclonal antibody (2G12) and a fluorescent marker protein (DsRed). We discuss the rationale for selecting certain properties of the model and identify its potential limitations. The general approach can easily be transferred to other problems because the principles of the model are broadly applicable: knowledge-based parameter selection, complexity reduction by splitting the initial problem into smaller modules, software-guided setup of optimal experiment combinations and step-wise design augmentation. Therefore, the methodology is not only useful for characterizing protein expression in plants but also for the investigation of other complex systems lacking a mechanistic description. The predictive equations describing the interconnectivity between parameters can be used to establish mechanistic models for other complex systems.
Bioengineering, Issue 83, design of experiments (DoE), transient protein expression, plant-derived biopharmaceuticals, promoter, 5'UTR, fluorescent reporter protein, model building, incubation conditions, monoclonal antibody
From Voxels to Knowledge: A Practical Guide to the Segmentation of Complex Electron Microscopy 3D-Data
Institutions: Lawrence Berkeley National Laboratory, Lawrence Berkeley National Laboratory, Lawrence Berkeley National Laboratory.
Modern 3D electron microscopy approaches have recently allowed unprecedented insight into the 3D ultrastructural organization of cells and tissues, enabling the visualization of large macromolecular machines, such as adhesion complexes, as well as higher-order structures, such as the cytoskeleton and cellular organelles in their respective cell and tissue context. Given the inherent complexity of cellular volumes, it is essential to first extract the features of interest in order to allow visualization, quantification, and therefore comprehension of their 3D organization. Each data set is defined by distinct characteristics, e.g.
, signal-to-noise ratio, crispness (sharpness) of the data, heterogeneity of its features, crowdedness of features, presence or absence of characteristic shapes that allow for easy identification, and the percentage of the entire volume that a specific region of interest occupies. All these characteristics need to be considered when deciding on which approach to take for segmentation.
The six different 3D ultrastructural data sets presented were obtained by three different imaging approaches: resin embedded stained electron tomography, focused ion beam- and serial block face- scanning electron microscopy (FIB-SEM, SBF-SEM) of mildly stained and heavily stained samples, respectively. For these data sets, four different segmentation approaches have been applied: (1) fully manual model building followed solely by visualization of the model, (2) manual tracing segmentation of the data followed by surface rendering, (3) semi-automated approaches followed by surface rendering, or (4) automated custom-designed segmentation algorithms followed by surface rendering and quantitative analysis. Depending on the combination of data set characteristics, it was found that typically one of these four categorical approaches outperforms the others, but depending on the exact sequence of criteria, more than one approach may be successful. Based on these data, we propose a triage scheme that categorizes both objective data set characteristics and subjective personal criteria for the analysis of the different data sets.
Bioengineering, Issue 90, 3D electron microscopy, feature extraction, segmentation, image analysis, reconstruction, manual tracing, thresholding
Improved In-gel Reductive β-Elimination for Comprehensive O-linked and Sulfo-glycomics by Mass Spectrometry
Institutions: University of Georgia, University of Georgia, Ishikawa Prefectural University.
Separation of proteins by SDS-PAGE followed by in-gel proteolytic digestion of resolved protein bands has produced high-resolution proteomic analysis of biological samples. Similar approaches, that would allow in-depth analysis of the glycans carried by glycoproteins resolved by SDS-PAGE, require special considerations in order to maximize recovery and sensitivity when using mass spectrometry (MS) as the detection method. A major hurdle to be overcome in achieving high-quality data is the removal of gel-derived contaminants that interfere with MS analysis. The sample workflow presented here is robust, efficient, and eliminates the need for in-line HPLC clean-up prior to MS. Gel pieces containing target proteins are washed in acetonitrile, water, and ethyl acetate to remove contaminants, including polymeric acrylamide fragments. O-linked glycans are released from target proteins by in-gel reductive β-elimination and recovered through robust, simple clean-up procedures. An advantage of this workflow is that it improves sensitivity for detecting and characterizing sulfated glycans. These procedures produce an efficient separation of sulfated permethylated glycans from non-sulfated (sialylated and neutral) permethylated glycans by a rapid phase-partition prior to MS analysis, and thereby enhance glycomic and sulfoglycomic analyses of glycoproteins resolved by SDS-PAGE.
Chemistry, Issue 93, glycoprotein, glycosylation, in-gel reductive β-elimination, O-linked glycan, sulfated glycan, mass spectrometry, protein ID, SDS-PAGE, glycomics, sulfoglycomics
Split-and-pool Synthesis and Characterization of Peptide Tertiary Amide Library
Institutions: The Scripps Research Institute.
Peptidomimetics are great sources of protein ligands. The oligomeric nature of these compounds enables us to access large synthetic libraries on solid phase by using combinatorial chemistry. One of the most well studied classes of peptidomimetics is peptoids. Peptoids are easy to synthesize and have been shown to be proteolysis-resistant and cell-permeable. Over the past decade, many useful protein ligands have been identified through screening of peptoid libraries. However, most of the ligands identified from peptoid libraries do not display high affinity, with rare exceptions. This may be due, in part, to the lack of chiral centers and conformational constraints in peptoid molecules. Recently, we described a new synthetic route to access peptide tertiary amides (PTAs). PTAs are a superfamily of peptidomimetics that include but are not limited to peptides, peptoids and N-methylated peptides. With side chains on both α-carbon and main chain nitrogen atoms, the conformation of these molecules are greatly constrained by sterical hindrance and allylic 1,3 strain. (Figure 1
) Our study suggests that these PTA molecules are highly structured in solution and can be used to identify protein ligands. We believe that these molecules can be a future source of high-affinity protein ligands. Here we describe the synthetic method combining the power of both split-and-pool and sub-monomer strategies to synthesize a sample one-bead one-compound (OBOC) library of PTAs.
Chemistry, Issue 88, Split-and-pool synthesis, peptide tertiary amide, PTA, peptoid, high-throughput screening, combinatorial library, solid phase, triphosgene (BTC), one-bead one-compound, OBOC
Solid Phase Synthesis of a Functionalized Bis-Peptide Using "Safety Catch" Methodology
Institutions: Temple University .
In 1962, R.B. Merrifield published the first procedure using solid-phase peptide synthesis as a novel route to efficiently synthesize peptides. This technique quickly proved advantageous over its solution-phase predecessor in both time and labor. Improvements concerning the nature of solid support, the protecting groups employed and the coupling methods employed over the last five decades have only increased the usefulness of Merrifield's original system. Today, use of a Boc-based protection and base/nucleophile cleavable resin strategy or Fmoc-based protection and acidic cleavable resin strategy, pioneered by R.C. Sheppard, are most commonly used for the synthesis of peptides1
Inspired by Merrifield's solid supported strategy, we have developed a Boc/tert-butyl solid-phase synthesis strategy for the assembly of functionalized bis-peptides2
, which is described herein. The use of solid-phase synthesis compared to solution-phase methodology is not only advantageous in both time and labor as described by Merrifield1
, but also allows greater ease in the synthesis of bis-peptide libraries. The synthesis that we demonstrate here incorporates a final cleavage stage that uses a two-step "safety catch" mechanism to release the functionalized bis-peptide from the resin by diketopiperazine formation.
Bis-peptides are rigid, spiro-ladder oligomers of bis-amino acids that are able to position functionality in a predictable and designable way, controlled by the type and stereochemistry of the monomeric units and the connectivity between each monomer. Each bis-amino acid is a stereochemically pure, cyclic scaffold that contains two amino acids (a carboxylic acid with an α-amine)3,4
. Our laboratory is currently investigating the potential of functional bis-peptides across a wide variety of fields including catalysis, protein-protein interactions and nanomaterials.
Chemistry, Issue 63, bis-peptides, solid phase peptide synthesis, bis-amino acids, safety catch, HMBA, DTRA
The Analysis of Purkinje Cell Dendritic Morphology in Organotypic Slice Cultures
Institutions: University of Basel.
Purkinje cells are an attractive model system for studying dendritic development, because they have an impressive dendritic tree which is strictly oriented in the sagittal plane and develops mostly in the postnatal period in small rodents 3
. Furthermore, several antibodies are available which selectively and intensively label Purkinje cells including all processes, with anti-Calbindin D28K being the most widely used. For viewing of dendrites in living cells, mice expressing EGFP selectively in Purkinje cells 11
are available through Jackson labs. Organotypic cerebellar slice cultures cells allow easy experimental manipulation of Purkinje cell dendritic development because most of the dendritic expansion of the Purkinje cell dendritic tree is actually taking place during the culture period 4
. We present here a short, reliable and easy protocol for viewing and analyzing the dendritic morphology of Purkinje cells grown in organotypic cerebellar slice cultures. For many purposes, a quantitative evaluation of the Purkinje cell dendritic tree is desirable. We focus here on two parameters, dendritic tree size and branch point numbers, which can be rapidly and easily determined from anti-calbindin stained cerebellar slice cultures. These two parameters yield a reliable and sensitive measure of changes of the Purkinje cell dendritic tree. Using the example of treatments with the protein kinase C (PKC) activator PMA and the metabotropic glutamate receptor 1 (mGluR1) we demonstrate how differences in the dendritic development are visualized and quantitatively assessed. The combination of the presence of an extensive dendritic tree, selective and intense immunostaining methods, organotypic slice cultures which cover the period of dendritic growth and a mouse model with Purkinje cell specific EGFP expression make Purkinje cells a powerful model system for revealing the mechanisms of dendritic development.
Neuroscience, Issue 61, dendritic development, dendritic branching, cerebellum, Purkinje cells
Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules
Institutions: Princeton University.
The aim of de novo
protein design is to find the amino acid sequences that will fold into a desired 3-dimensional structure with improvements in specific properties, such as binding affinity, agonist or antagonist behavior, or stability, relative to the native sequence. Protein design lies at the center of current advances drug design and discovery. Not only does protein design provide predictions for potentially useful drug targets, but it also enhances our understanding of the protein folding process and protein-protein interactions. Experimental methods such as directed evolution have shown success in protein design. However, such methods are restricted by the limited sequence space that can be searched tractably. In contrast, computational design strategies allow for the screening of a much larger set of sequences covering a wide variety of properties and functionality. We have developed a range of computational de novo
protein design methods capable of tackling several important areas of protein design. These include the design of monomeric proteins for increased stability and complexes for increased binding affinity.
To disseminate these methods for broader use we present Protein WISDOM (http://www.proteinwisdom.org), a tool that provides automated methods for a variety of protein design problems. Structural templates are submitted to initialize the design process. The first stage of design is an optimization sequence selection stage that aims at improving stability through minimization of potential energy in the sequence space. Selected sequences are then run through a fold specificity stage and a binding affinity stage. A rank-ordered list of the sequences for each step of the process, along with relevant designed structures, provides the user with a comprehensive quantitative assessment of the design. Here we provide the details of each design method, as well as several notable experimental successes attained through the use of the methods.
Genetics, Issue 77, Molecular Biology, Bioengineering, Biochemistry, Biomedical Engineering, Chemical Engineering, Computational Biology, Genomics, Proteomics, Protein, Protein Binding, Computational Biology, Drug Design, optimization (mathematics), Amino Acids, Peptides, and Proteins, De novo protein and peptide design, Drug design, In silico sequence selection, Optimization, Fold specificity, Binding affinity, sequencing
Simultaneous Multicolor Imaging of Biological Structures with Fluorescence Photoactivation Localization Microscopy
Institutions: University of Maine.
Localization-based super resolution microscopy can be applied to obtain a spatial map (image) of the distribution of individual fluorescently labeled single molecules within a sample with a spatial resolution of tens of nanometers. Using either photoactivatable (PAFP) or photoswitchable (PSFP) fluorescent proteins fused to proteins of interest, or organic dyes conjugated to antibodies or other molecules of interest, fluorescence photoactivation localization microscopy (FPALM) can simultaneously image multiple species of molecules within single cells. By using the following approach, populations of large numbers (thousands to hundreds of thousands) of individual molecules are imaged in single cells and localized with a precision of ~10-30 nm. Data obtained can be applied to understanding the nanoscale spatial distributions of multiple protein types within a cell. One primary advantage of this technique is the dramatic increase in spatial resolution: while diffraction limits resolution to ~200-250 nm in conventional light microscopy, FPALM can image length scales more than an order of magnitude smaller. As many biological hypotheses concern the spatial relationships among different biomolecules, the improved resolution of FPALM can provide insight into questions of cellular organization which have previously been inaccessible to conventional fluorescence microscopy. In addition to detailing the methods for sample preparation and data acquisition, we here describe the optical setup for FPALM. One additional consideration for researchers wishing to do super-resolution microscopy is cost: in-house setups are significantly cheaper than most commercially available imaging machines. Limitations of this technique include the need for optimizing the labeling of molecules of interest within cell samples, and the need for post-processing software to visualize results. We here describe the use of PAFP and PSFP expression to image two protein species in fixed cells. Extension of the technique to living cells is also described.
Basic Protocol, Issue 82, Microscopy, Super-resolution imaging, Multicolor, single molecule, FPALM, Localization microscopy, fluorescent proteins
An Affordable HIV-1 Drug Resistance Monitoring Method for Resource Limited Settings
Institutions: University of KwaZulu-Natal, Durban, South Africa, Jembi Health Systems, University of Amsterdam, Stanford Medical School.
HIV-1 drug resistance has the potential to seriously compromise the effectiveness and impact of antiretroviral therapy (ART). As ART programs in sub-Saharan Africa continue to expand, individuals on ART should be closely monitored for the emergence of drug resistance. Surveillance of transmitted drug resistance to track transmission of viral strains already resistant to ART is also critical. Unfortunately, drug resistance testing is still not readily accessible in resource limited settings, because genotyping is expensive and requires sophisticated laboratory and data management infrastructure. An open access genotypic drug resistance monitoring method to manage individuals and assess transmitted drug resistance is described. The method uses free open source software for the interpretation of drug resistance patterns and the generation of individual patient reports. The genotyping protocol has an amplification rate of greater than 95% for plasma samples with a viral load >1,000 HIV-1 RNA copies/ml. The sensitivity decreases significantly for viral loads <1,000 HIV-1 RNA copies/ml. The method described here was validated against a method of HIV-1 drug resistance testing approved by the United States Food and Drug Administration (FDA), the Viroseq genotyping method. Limitations of the method described here include the fact that it is not automated and that it also failed to amplify the circulating recombinant form CRF02_AG from a validation panel of samples, although it amplified subtypes A and B from the same panel.
Medicine, Issue 85, Biomedical Technology, HIV-1, HIV Infections, Viremia, Nucleic Acids, genetics, antiretroviral therapy, drug resistance, genotyping, affordable
Using SCOPE to Identify Potential Regulatory Motifs in Coregulated Genes
Institutions: Dartmouth College.
SCOPE is an ensemble motif finder that uses three component algorithms in parallel to identify potential regulatory motifs by over-representation and motif position preference1
. Each component algorithm is optimized to find a different kind of motif. By taking the best of these three approaches, SCOPE performs better than any single algorithm, even in the presence of noisy data1
. In this article, we utilize a web version of SCOPE2
to examine genes that are involved in telomere maintenance. SCOPE has been incorporated into at least two other motif finding programs3,4
and has been used in other studies5-8
The three algorithms that comprise SCOPE are BEAM9
, which finds non-degenerate motifs (ACCGGT), PRISM10
, which finds degenerate motifs (ASCGWT), and SPACER11
, which finds longer bipartite motifs (ACCnnnnnnnnGGT). These three algorithms have been optimized to find their corresponding type of motif. Together, they allow SCOPE to perform extremely well.
Once a gene set has been analyzed and candidate motifs identified, SCOPE can look for other genes that contain the motif which, when added to the original set, will improve the motif score. This can occur through over-representation or motif position preference. Working with partial gene sets that have biologically verified transcription factor binding sites, SCOPE was able to identify most of the rest of the genes also regulated by the given transcription factor.
Output from SCOPE shows candidate motifs, their significance, and other information both as a table and as a graphical motif map. FAQs and video tutorials are available at the SCOPE web site which also includes a "Sample Search" button that allows the user to perform a trial run.
Scope has a very friendly user interface that enables novice users to access the algorithm's full power without having to become an expert in the bioinformatics of motif finding. As input, SCOPE can take a list of genes, or FASTA sequences. These can be entered in browser text fields, or read from a file. The output from SCOPE contains a list of all identified motifs with their scores, number of occurrences, fraction of genes containing the motif, and the algorithm used to identify the motif. For each motif, result details include a consensus representation of the motif, a sequence logo, a position weight matrix, and a list of instances for every motif occurrence (with exact positions and "strand" indicated). Results are returned in a browser window and also optionally by email. Previous papers describe the SCOPE algorithms in detail1,2,9-11
Genetics, Issue 51, gene regulation, computational biology, algorithm, promoter sequence motif