The high-throughput expression analysis technologies available today give scientists an overflow of expression profiles but their resolution in terms of tissue specific expression is limited because of problems in dissecting individual tissues. Expression data needs to be confirmed and complemented with expression patterns using e.g. in situ hybridization, a technique used to localize cell specific mRNA expression. The in situ hybridization method is laborious, time-consuming and often requires extensive optimization depending on species and tissue. In situ experiments are relatively more difficult to perform in woody species such as the conifer Norway spruce (Picea abies). Here we present a modified DIG in situ hybridization protocol, which is fast and applicable on a wide range of plant species including P. abies. With just a few adjustments, including altered RNase treatment and proteinase K concentration, we could use the protocol to study tissue specific expression of homologous genes in male reproductive organs of one gymnosperm and two angiosperm species; P. abies, Arabidopsis thaliana and Brassica napus. The protocol worked equally well for the species and genes studied. AtAP3 and BnAP3 were observed in second and third whorl floral organs in A. thaliana and B. napus and DAL13 in microsporophylls of male cones from P. abies. For P. abies the proteinase K concentration, used to permeablize the tissues, had to be increased to 3 g/ml instead of 1 g/ml, possibly due to more compact tissues and higher levels of phenolics and polysaccharides. For all species the RNase treatment was removed due to reduced signal strength without a corresponding increase in specificity. By comparing tissue specific expression patterns of homologous genes from both flowering plants and a coniferous tree we demonstrate that the DIG in situ protocol presented here, with only minute adjustments, can be applied to a wide range of plant species. Hence, the protocol avoids both extensive species specific optimization and the laborious use of radioactively labeled probes in favor of DIG labeled probes. We have chosen to illustrate the technically demanding steps of the protocol in our film.
Anna Karlgren and Jenny Carlsson contributed equally to this study.
Corresponding authors: Anna Karlgren at Anna.Karlgren@ebc.uu.se and Jens F. Sundström at Jens.Sundstrom@vbsg.slu.se
26 Related JoVE Articles!
Identification of Post-translational Modifications of Plant Protein Complexes
Institutions: University of Warwick, Norwich Research Park, The Australian National University.
Plants adapt quickly to changing environments due to elaborate perception and signaling systems. During pathogen attack, plants rapidly respond to infection via
the recruitment and activation of immune complexes. Activation of immune complexes is associated with post-translational modifications (PTMs) of proteins, such as phosphorylation, glycosylation, or ubiquitination. Understanding how these PTMs are choreographed will lead to a better understanding of how resistance is achieved.
Here we describe a protein purification method for nucleotide-binding leucine-rich repeat (NB-LRR)-interacting proteins and the subsequent identification of their post-translational modifications (PTMs). With small modifications, the protocol can be applied for the purification of other plant protein complexes. The method is based on the expression of an epitope-tagged version of the protein of interest, which is subsequently partially purified by immunoprecipitation and subjected to mass spectrometry for identification of interacting proteins and PTMs.
This protocol demonstrates that: i). Dynamic changes in PTMs such as phosphorylation can be detected by mass spectrometry; ii). It is important to have sufficient quantities of the protein of interest, and this can compensate for the lack of purity of the immunoprecipitate; iii). In order to detect PTMs of a protein of interest, this protein has to be immunoprecipitated to get a sufficient quantity of protein.
Plant Biology, Issue 84, plant-microbe interactions, protein complex purification, mass spectrometry, protein phosphorylation, Prf, Pto, AvrPto, AvrPtoB
Setting-up an In Vitro Model of Rat Blood-brain Barrier (BBB): A Focus on BBB Impermeability and Receptor-mediated Transport
Institutions: VECT-HORUS SAS, CNRS, NICN UMR 7259.
The blood brain barrier (BBB) specifically regulates molecular and cellular flux between the blood and the nervous tissue. Our aim was to develop and characterize a highly reproducible rat syngeneic in vitro
model of the BBB using co-cultures of primary rat brain endothelial cells (RBEC) and astrocytes to study receptors involved in transcytosis across the endothelial cell monolayer. Astrocytes were isolated by mechanical dissection following trypsin digestion and were frozen for later co-culture. RBEC were isolated from 5-week-old rat cortices. The brains were cleaned of meninges and white matter, and mechanically dissociated following enzymatic digestion. Thereafter, the tissue homogenate was centrifuged in bovine serum albumin to separate vessel fragments from nervous tissue. The vessel fragments underwent a second enzymatic digestion to free endothelial cells from their extracellular matrix. The remaining contaminating cells such as pericytes were further eliminated by plating the microvessel fragments in puromycin-containing medium. They were then passaged onto filters for co-culture with astrocytes grown on the bottom of the wells. RBEC expressed high levels of tight junction (TJ) proteins such as occludin, claudin-5 and ZO-1 with a typical localization at the cell borders. The transendothelial electrical resistance (TEER) of brain endothelial monolayers, indicating the tightness of TJs reached 300 ohm·cm2
on average. The endothelial permeability coefficients (Pe) for lucifer yellow (LY) was highly reproducible with an average of 0.26 ± 0.11 x 10-3
cm/min. Brain endothelial cells organized in monolayers expressed the efflux transporter P-glycoprotein (P-gp), showed a polarized transport of rhodamine 123, a ligand for P-gp, and showed specific transport of transferrin-Cy3 and DiILDL across the endothelial cell monolayer. In conclusion, we provide a protocol for setting up an in vitro
BBB model that is highly reproducible due to the quality assurance methods, and that is suitable for research on BBB transporters and receptors.
Medicine, Issue 88, rat brain endothelial cells (RBEC), mouse, spinal cord, tight junction (TJ), receptor-mediated transport (RMT), low density lipoprotein (LDL), LDLR, transferrin, TfR, P-glycoprotein (P-gp), transendothelial electrical resistance (TEER),
Purification of the Cystic Fibrosis Transmembrane Conductance Regulator Protein Expressed in Saccharomyces cerevisiae
Institutions: University of Manchester.
Defects in the cystic fibrosis transmembrane conductance regulator (CFTR) protein cause cystic fibrosis (CF), an autosomal recessive disease that currently limits the average life expectancy of sufferers to <40 years of age. The development of novel drug molecules to restore the activity of CFTR is an important goal in the treatment CF, and the isolation of functionally active CFTR is a useful step towards achieving this goal.
We describe two methods for the purification of CFTR from a eukaryotic heterologous expression system, S. cerevisiae
. Like prokaryotic systems, S. cerevisiae
can be rapidly grown in the lab at low cost, but can also traffic and posttranslationally modify large membrane proteins. The selection of detergents for solubilization and purification is a critical step in the purification of any membrane protein. Having screened for the solubility of CFTR in several detergents, we have chosen two contrasting detergents for use in the purification that allow the final CFTR preparation to be tailored to the subsequently planned experiments.
In this method, we provide comparison of the purification of CFTR in dodecyl-β-D-maltoside (DDM) and 1-tetradecanoyl-sn
-glycerol) (LPG-14). Protein purified in DDM by this method shows ATPase activity in functional assays. Protein purified in LPG-14 shows high purity and yield, can be employed to study post-translational modifications, and can be used for structural methods such as small-angle X-ray scattering and electron microscopy. However it displays significantly lower ATPase activity.
Biochemistry, Issue 87, Membrane protein, cystic fibrosis, CFTR, ABCC7, protein purification, Cystic Fibrosis Foundation, green fluorescent protein
Assessing Species-specific Contributions To Craniofacial Development Using Quail-duck Chimeras
Institutions: University of California at San Francisco.
The generation of chimeric embryos is a widespread and powerful approach to study cell fates, tissue interactions, and species-specific contributions to the histological and morphological development of vertebrate embryos. In particular, the use of chimeric embryos has established the importance of neural crest in directing the species-specific morphology of the craniofacial complex. The method described herein utilizes two avian species, duck and quail, with remarkably different craniofacial morphology. This method greatly facilitates the investigation of molecular and cellular regulation of species-specific pattern in the craniofacial complex. Experiments in quail and duck chimeric embryos have already revealed neural crest-mediated tissue interactions and cell-autonomous behaviors that regulate species-specific pattern in the craniofacial skeleton, musculature, and integument. The great diversity of neural crest derivatives suggests significant potential for future applications of the quail-duck chimeric system to understanding vertebrate development, disease, and evolution.
Developmental Biology, Issue 87, neural crest, quail-duck chimeras, craniofacial development, epithelial-mesenchymal interactions, tissue transplants, evolutionary developmental biology
Analysis of Nephron Composition and Function in the Adult Zebrafish Kidney
Institutions: University of Notre Dame.
The zebrafish model has emerged as a relevant system to study kidney development, regeneration and disease. Both the embryonic and adult zebrafish kidneys are composed of functional units known as nephrons, which are highly conserved with other vertebrates, including mammals. Research in zebrafish has recently demonstrated that two distinctive phenomena transpire after adult nephrons incur damage: first, there is robust regeneration within existing nephrons that replaces the destroyed tubule epithelial cells; second, entirely new nephrons are produced from renal progenitors in a process known as neonephrogenesis. In contrast, humans and other mammals seem to have only a limited ability for nephron epithelial regeneration. To date, the mechanisms responsible for these kidney regeneration phenomena remain poorly understood. Since adult zebrafish kidneys undergo both nephron epithelial regeneration and neonephrogenesis, they provide an outstanding experimental paradigm to study these events. Further, there is a wide range of genetic and pharmacological tools available in the zebrafish model that can be used to delineate the cellular and molecular mechanisms that regulate renal regeneration. One essential aspect of such research is the evaluation of nephron structure and function. This protocol describes a set of labeling techniques that can be used to gauge renal composition and test nephron functionality in the adult zebrafish kidney. Thus, these methods are widely applicable to the future phenotypic characterization of adult zebrafish kidney injury paradigms, which include but are not limited to, nephrotoxicant exposure regimes or genetic methods of targeted cell death such as the nitroreductase mediated cell ablation technique. Further, these methods could be used to study genetic perturbations in adult kidney formation and could also be applied to assess renal status during chronic disease modeling.
Cellular Biology, Issue 90,
zebrafish; kidney; nephron; nephrology; renal; regeneration; proximal tubule; distal tubule; segment; mesonephros; physiology; acute kidney injury (AKI)
DNA-affinity-purified Chip (DAP-chip) Method to Determine Gene Targets for Bacterial Two component Regulatory Systems
Institutions: Lawrence Berkeley National Laboratory.
methods such as ChIP-chip are well-established techniques used to determine global gene targets for transcription factors. However, they are of limited use in exploring bacterial two component regulatory systems with uncharacterized activation conditions. Such systems regulate transcription only when activated in the presence of unique signals. Since these signals are often unknown, the in vitro
microarray based method described in this video article can be used to determine gene targets and binding sites for response regulators. This DNA-affinity-purified-chip method may be used for any purified regulator in any organism with a sequenced genome. The protocol involves allowing the purified tagged protein to bind to sheared genomic DNA and then affinity purifying the protein-bound DNA, followed by fluorescent labeling of the DNA and hybridization to a custom tiling array. Preceding steps that may be used to optimize the assay for specific regulators are also described. The peaks generated by the array data analysis are used to predict binding site motifs, which are then experimentally validated. The motif predictions can be further used to determine gene targets of orthologous response regulators in closely related species. We demonstrate the applicability of this method by determining the gene targets and binding site motifs and thus predicting the function for a sigma54-dependent response regulator DVU3023 in the environmental bacterium Desulfovibrio vulgaris
Genetics, Issue 89, DNA-Affinity-Purified-chip, response regulator, transcription factor binding site, two component system, signal transduction, Desulfovibrio, lactate utilization regulator, ChIP-chip
Modeling Astrocytoma Pathogenesis In Vitro and In Vivo Using Cortical Astrocytes or Neural Stem Cells from Conditional, Genetically Engineered Mice
Institutions: University of North Carolina School of Medicine, University of North Carolina School of Medicine, University of North Carolina School of Medicine, University of North Carolina School of Medicine, University of North Carolina School of Medicine, Emory University School of Medicine, University of North Carolina School of Medicine.
Current astrocytoma models are limited in their ability to define the roles of oncogenic mutations in specific brain cell types during disease pathogenesis and their utility for preclinical drug development. In order to design a better model system for these applications, phenotypically wild-type cortical astrocytes and neural stem cells (NSC) from conditional, genetically engineered mice (GEM) that harbor various combinations of floxed oncogenic alleles were harvested and grown in culture. Genetic recombination was induced in vitro
using adenoviral Cre-mediated recombination, resulting in expression of mutated oncogenes and deletion of tumor suppressor genes. The phenotypic consequences of these mutations were defined by measuring proliferation, transformation, and drug response in vitro
. Orthotopic allograft models, whereby transformed cells are stereotactically injected into the brains of immune-competent, syngeneic littermates, were developed to define the role of oncogenic mutations and cell type on tumorigenesis in vivo
. Unlike most established human glioblastoma cell line xenografts, injection of transformed GEM-derived cortical astrocytes into the brains of immune-competent littermates produced astrocytomas, including the most aggressive subtype, glioblastoma, that recapitulated the histopathological hallmarks of human astrocytomas, including diffuse invasion of normal brain parenchyma. Bioluminescence imaging of orthotopic allografts from transformed astrocytes engineered to express luciferase was utilized to monitor in vivo
tumor growth over time. Thus, astrocytoma models using astrocytes and NSC harvested from GEM with conditional oncogenic alleles provide an integrated system to study the genetics and cell biology of astrocytoma pathogenesis in vitro
and in vivo
and may be useful in preclinical drug development for these devastating diseases.
Neuroscience, Issue 90, astrocytoma, cortical astrocytes, genetically engineered mice, glioblastoma, neural stem cells, orthotopic allograft
An Experimental and Bioinformatics Protocol for RNA-seq Analyses of Photoperiodic Diapause in the Asian Tiger Mosquito, Aedes albopictus
Institutions: Georgetown University, The Ohio State University.
Photoperiodic diapause is an important adaptation that allows individuals to escape harsh seasonal environments via a series of physiological changes, most notably developmental arrest and reduced metabolism. Global gene expression profiling via RNA-Seq can provide important insights into the transcriptional mechanisms of photoperiodic diapause. The Asian tiger mosquito, Aedes albopictus
, is an outstanding organism for studying the transcriptional bases of diapause due to its ease of rearing, easily induced diapause, and the genomic resources available. This manuscript presents a general experimental workflow for identifying diapause-induced transcriptional differences in A. albopictus.
Rearing techniques, conditions necessary to induce diapause and non-diapause development, methods to estimate percent diapause in a population, and RNA extraction and integrity assessment for mosquitoes are documented. A workflow to process RNA-Seq data from Illumina sequencers culminates in a list of differentially expressed genes. The representative results demonstrate that this protocol can be used to effectively identify genes differentially regulated at the transcriptional level in A. albopictus
due to photoperiodic differences. With modest adjustments, this workflow can be readily adapted to study the transcriptional bases of diapause or other important life history traits in other mosquitoes.
Genetics, Issue 93, Aedes albopictus Asian tiger mosquito, photoperiodic diapause, RNA-Seq de novo transcriptome assembly, mosquito husbandry
Inhibitory Synapse Formation in a Co-culture Model Incorporating GABAergic Medium Spiny Neurons and HEK293 Cells Stably Expressing GABAA Receptors
Institutions: University College London.
Inhibitory neurons act in the central nervous system to regulate the dynamics and spatio-temporal co-ordination of neuronal networks. GABA (γ-aminobutyric acid) is the predominant inhibitory neurotransmitter in the brain. It is released from the presynaptic terminals of inhibitory neurons within highly specialized intercellular junctions known as synapses, where it binds to GABAA
Rs) present at the plasma membrane of the synapse-receiving, postsynaptic neurons. Activation of these GABA-gated ion channels leads to influx of chloride resulting in postsynaptic potential changes that decrease the probability that these neurons will generate action potentials.
During development, diverse types of inhibitory neurons with distinct morphological, electrophysiological and neurochemical characteristics have the ability to recognize their target neurons and form synapses which incorporate specific GABAA
Rs subtypes. This principle of selective innervation of neuronal targets raises the question as to how the appropriate synaptic partners identify each other.
To elucidate the underlying molecular mechanisms, a novel in vitro
co-culture model system was established, in which medium spiny GABAergic neurons, a highly homogenous population of neurons isolated from the embryonic striatum, were cultured with stably transfected HEK293 cell lines that express different GABAA
R subtypes. Synapses form rapidly, efficiently and selectively in this system, and are easily accessible for quantification. Our results indicate that various GABAA
R subtypes differ in their ability to promote synapse formation, suggesting that this reduced in vitro
model system can be used to reproduce, at least in part, the in vivo
conditions required for the recognition of the appropriate synaptic partners and formation of specific synapses. Here the protocols for culturing the medium spiny neurons and generating HEK293 cells lines expressing GABAA
Rs are first described, followed by detailed instructions on how to combine these two cell types in co-culture and analyze the formation of synaptic contacts.
Neuroscience, Issue 93, Developmental neuroscience, synaptogenesis, synaptic inhibition, co-culture, stable cell lines, GABAergic, medium spiny neurons, HEK 293 cell line
Forward Genetics Screens Using Macrophages to Identify Toxoplasma gondii Genes Important for Resistance to IFN-γ-Dependent Cell Autonomous Immunity
Institutions: New York Medical College.
the causative agent of toxoplasmosis, is an obligate intracellular protozoan pathogen. The parasite invades and replicates within virtually any warm blooded vertebrate cell type. During parasite invasion of a host cell, the parasite creates a parasitophorous vacuole (PV) that originates from the host cell membrane independent of phagocytosis within which the parasite replicates. While IFN-dependent-innate and cell mediated immunity is important for eventual control of infection, innate immune cells, including neutrophils, monocytes and dendritic cells, can also serve as vehicles for systemic dissemination of the parasite early in infection. An approach is described that utilizes the host innate immune response, in this case macrophages, in a forward genetic screen to identify parasite mutants with a fitness defect in infected macrophages following activation but normal invasion and replication in naïve macrophages. Thus, the screen isolates parasite mutants that have a specific defect in their ability to resist the effects of macrophage activation. The paper describes two broad phenotypes of mutant parasites following activation of infected macrophages: parasite stasis versus parasite degradation, often in amorphous vacuoles. The parasite mutants are then analyzed to identify the responsible parasite genes specifically important for resistance to induced mediators of cell autonomous immunity. The paper presents a general approach for the forward genetics screen that, in theory, can be modified to target parasite genes important for resistance to specific antimicrobial mediators. It also describes an approach to evaluate the specific macrophage antimicrobial mediators to which the parasite mutant is susceptible. Activation of infected macrophages can also promote parasite differentiation from the tachyzoite to bradyzoite stage that maintains chronic infection. Therefore, methodology is presented to evaluate the importance of the identified parasite gene to establishment of chronic infection.
Immunology, Issue 97, Toxoplasma, macrophages, innate immunity, intracellular pathogen, immune evasion, infectious disease, forward genetics, parasite
Phage Phenomics: Physiological Approaches to Characterize Novel Viral Proteins
Institutions: San Diego State University, San Diego State University, San Diego State University, San Diego State University, San Diego State University, Argonne National Laboratory, Broad Institute.
Current investigations into phage-host interactions are dependent on extrapolating knowledge from (meta)genomes. Interestingly, 60 - 95% of all phage sequences share no homology to current annotated proteins. As a result, a large proportion of phage genes are annotated as hypothetical. This reality heavily affects the annotation of both structural and auxiliary metabolic genes. Here we present phenomic methods designed to capture the physiological response(s) of a selected host during expression of one of these unknown phage genes. Multi-phenotype Assay Plates (MAPs) are used to monitor the diversity of host substrate utilization and subsequent biomass formation, while metabolomics provides bi-product analysis by monitoring metabolite abundance and diversity. Both tools are used simultaneously to provide a phenotypic profile associated with expression of a single putative phage open reading frame (ORF). Representative results for both methods are compared, highlighting the phenotypic profile differences of a host carrying either putative structural or metabolic phage genes. In addition, the visualization techniques and high throughput computational pipelines that facilitated experimental analysis are presented.
Immunology, Issue 100, phenomics, phage, viral metagenome, Multi-phenotype Assay Plates (MAPs), continuous culture, metabolomics
Bone Marrow-derived Macrophage Production
Institutions: Aix-Marseille Université, University of Naples "Federico II".
Macrophages are critical components of the innate and adaptive immune responses, and they are the first line of defense against foreign invaders because of their powerful microbicidal activities. Macrophages are widely distributed throughout the body and are present in the lymphoid organs, liver, lungs, gastrointestinal tract, central nervous system, bone, and skin. Because of their repartition, they participate in a wide range of physiological and pathological processes. Macrophages are highly versatile cells that are able to recognize microenvironmental alterations and to maintain tissue homeostasis. Numerous pathogens have evolved mechanisms to use macrophages as Trojan horses to survive, replicate in, and infect both humans and animals and to propagate throughout the body. The recent explosion of interest in evolutionary, genetic, and biochemical aspects of host-pathogen interactions has renewed scientific attention regarding macrophages. Here, we describe a procedure to isolate and cultivate macrophages from murine bone marrow that will provide large numbers of macrophages for studying host-pathogen interactions as well as other processes.
Immunology, Issue 81, biology (general), immunology, Life Sciences (General) macrophages, bone marrow, phagocytosis, phagosomes, lysosomes, endocytosis
Rescue of Recombinant Newcastle Disease Virus from cDNA
Institutions: Icahn School of Medicine at Mount Sinai, Icahn School of Medicine at Mount Sinai, Icahn School of Medicine at Mount Sinai, University of Rochester.
Newcastle disease virus (NDV), the prototype member of the Avulavirus
genus of the family Paramyxoviridae1
, is a non-segmented, negative-sense, single-stranded, enveloped RNA virus (Figure 1)
with potential applications as a vector for vaccination and treatment of human diseases. In-depth exploration of these applications has only become possible after the establishment of reverse genetics techniques to rescue recombinant viruses from plasmids encoding their complete genomes as cDNA2-5
. Viral cDNA can be conveniently modified in vitro
by using standard cloning procedures to alter the genotype of the virus and/or to include new transcriptional units. Rescue of such genetically modified viruses provides a valuable tool to understand factors affecting multiple stages of infection, as well as allows for the development and improvement of vectors for the expression and delivery of antigens for vaccination and therapy. Here we describe a protocol for the rescue of recombinant NDVs.
Immunology, Issue 80, Paramyxoviridae, Vaccines, Oncolytic Virotherapy, Immunity, Innate, Newcastle disease virus (NDV), MVA-T7, reverse genetics techniques, plasmid transfection, recombinant virus, HA assay
Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules
Institutions: Princeton University.
The aim of de novo
protein design is to find the amino acid sequences that will fold into a desired 3-dimensional structure with improvements in specific properties, such as binding affinity, agonist or antagonist behavior, or stability, relative to the native sequence. Protein design lies at the center of current advances drug design and discovery. Not only does protein design provide predictions for potentially useful drug targets, but it also enhances our understanding of the protein folding process and protein-protein interactions. Experimental methods such as directed evolution have shown success in protein design. However, such methods are restricted by the limited sequence space that can be searched tractably. In contrast, computational design strategies allow for the screening of a much larger set of sequences covering a wide variety of properties and functionality. We have developed a range of computational de novo
protein design methods capable of tackling several important areas of protein design. These include the design of monomeric proteins for increased stability and complexes for increased binding affinity.
To disseminate these methods for broader use we present Protein WISDOM (http://www.proteinwisdom.org), a tool that provides automated methods for a variety of protein design problems. Structural templates are submitted to initialize the design process. The first stage of design is an optimization sequence selection stage that aims at improving stability through minimization of potential energy in the sequence space. Selected sequences are then run through a fold specificity stage and a binding affinity stage. A rank-ordered list of the sequences for each step of the process, along with relevant designed structures, provides the user with a comprehensive quantitative assessment of the design. Here we provide the details of each design method, as well as several notable experimental successes attained through the use of the methods.
Genetics, Issue 77, Molecular Biology, Bioengineering, Biochemistry, Biomedical Engineering, Chemical Engineering, Computational Biology, Genomics, Proteomics, Protein, Protein Binding, Computational Biology, Drug Design, optimization (mathematics), Amino Acids, Peptides, and Proteins, De novo protein and peptide design, Drug design, In silico sequence selection, Optimization, Fold specificity, Binding affinity, sequencing
Mosaic Zebrafish Transgenesis for Evaluating Enhancer Sequences
Institutions: University of Pennsylvania .
The completion of the human genome sequence, along with that of many other species, has highlighted the challenge of ascribing specific function to non coding sequences. One prominent function carried out by the non coding fraction of the genome is to regulate gene transcription; however, there are no effective methods to broadly predict cis-regulatory elements from primary DNA sequence. We have developed an efficient protocol to functionally evaluate potential cis-regulatory elements through zebrafish transgenesis. Our approach offers significant advantages over cell-culture based techniques for developmentally important genes, since it provides information on spatial and temporal gene regulation. Conversely, it is faster and less expensive than similar experiments in transgenic mice, and we routinely apply it to sequences isolated from the human genome. Here we demonstrate our approach to selecting elements for testing based on sequence conservation and our protocol for cloning sequences and microinjecting them into zebrafish embryos.
Cellular Biology, Issue 41, zebrafish, transgenesis, microinjection, GFP, enhancers, transposon
A Strategy to Identify de Novo Mutations in Common Disorders such as Autism and Schizophrenia
Institutions: Universite de Montreal, Universite de Montreal, Universite de Montreal.
There are several lines of evidence supporting the role of de novo
mutations as a mechanism for common disorders, such as autism and schizophrenia. First, the de novo
mutation rate in humans is relatively high, so new mutations are generated at a high frequency in the population. However, de novo
mutations have not been reported in most common diseases. Mutations in genes leading to severe diseases where there is a strong negative selection against the phenotype, such as lethality in embryonic stages or reduced reproductive fitness, will not be transmitted to multiple family members, and therefore will not be detected by linkage gene mapping or association studies. The observation of very high concordance in monozygotic twins and very low concordance in dizygotic twins also strongly supports the hypothesis that a significant fraction of cases may result from new mutations. Such is the case for diseases such as autism and schizophrenia. Second, despite reduced reproductive fitness1
and extremely variable environmental factors, the incidence of some diseases is maintained worldwide at a relatively high and constant rate. This is the case for autism and schizophrenia, with an incidence of approximately 1% worldwide. Mutational load can be thought of as a balance between selection for or against a deleterious mutation and its production by de novo
mutation. Lower rates of reproduction constitute a negative selection factor that should reduce the number of mutant alleles in the population, ultimately leading to decreased disease prevalence. These selective pressures tend to be of different intensity in different environments. Nonetheless, these severe mental disorders have been maintained at a constant relatively high prevalence in the worldwide population across a wide range of cultures and countries despite a strong negative selection against them2
. This is not what one would predict in diseases with reduced reproductive fitness, unless there was a high new mutation rate. Finally, the effects of paternal age: there is a significantly increased risk of the disease with increasing paternal age, which could result from the age related increase in paternal de novo
mutations. This is the case for autism and schizophrenia3
. The male-to-female ratio of mutation rate is estimated at about 4–6:1, presumably due to a higher number of germ-cell divisions with age in males. Therefore, one would predict that de novo
mutations would more frequently come from males, particularly older males4
. A high rate of new mutations may in part explain why genetic studies have so far failed to identify many genes predisposing to complexes diseases genes, such as autism and schizophrenia, and why diseases have been identified for a mere 3% of genes in the human genome. Identification for de novo
mutations as a cause of a disease requires a targeted molecular approach, which includes studying parents and affected subjects. The process for determining if the genetic basis of a disease may result in part from de novo
mutations and the molecular approach to establish this link will be illustrated, using autism and schizophrenia as examples.
Medicine, Issue 52, de novo mutation, complex diseases, schizophrenia, autism, rare variations, DNA sequencing
Using SCOPE to Identify Potential Regulatory Motifs in Coregulated Genes
Institutions: Dartmouth College.
SCOPE is an ensemble motif finder that uses three component algorithms in parallel to identify potential regulatory motifs by over-representation and motif position preference1
. Each component algorithm is optimized to find a different kind of motif. By taking the best of these three approaches, SCOPE performs better than any single algorithm, even in the presence of noisy data1
. In this article, we utilize a web version of SCOPE2
to examine genes that are involved in telomere maintenance. SCOPE has been incorporated into at least two other motif finding programs3,4
and has been used in other studies5-8
The three algorithms that comprise SCOPE are BEAM9
, which finds non-degenerate motifs (ACCGGT), PRISM10
, which finds degenerate motifs (ASCGWT), and SPACER11
, which finds longer bipartite motifs (ACCnnnnnnnnGGT). These three algorithms have been optimized to find their corresponding type of motif. Together, they allow SCOPE to perform extremely well.
Once a gene set has been analyzed and candidate motifs identified, SCOPE can look for other genes that contain the motif which, when added to the original set, will improve the motif score. This can occur through over-representation or motif position preference. Working with partial gene sets that have biologically verified transcription factor binding sites, SCOPE was able to identify most of the rest of the genes also regulated by the given transcription factor.
Output from SCOPE shows candidate motifs, their significance, and other information both as a table and as a graphical motif map. FAQs and video tutorials are available at the SCOPE web site which also includes a "Sample Search" button that allows the user to perform a trial run.
Scope has a very friendly user interface that enables novice users to access the algorithm's full power without having to become an expert in the bioinformatics of motif finding. As input, SCOPE can take a list of genes, or FASTA sequences. These can be entered in browser text fields, or read from a file. The output from SCOPE contains a list of all identified motifs with their scores, number of occurrences, fraction of genes containing the motif, and the algorithm used to identify the motif. For each motif, result details include a consensus representation of the motif, a sequence logo, a position weight matrix, and a list of instances for every motif occurrence (with exact positions and "strand" indicated). Results are returned in a browser window and also optionally by email. Previous papers describe the SCOPE algorithms in detail1,2,9-11
Genetics, Issue 51, gene regulation, computational biology, algorithm, promoter sequence motif
Rapid Diagnosis of Avian Influenza Virus in Wild Birds: Use of a Portable rRT-PCR and Freeze-dried Reagents in the Field
Institutions: USGS Western Ecological Research Center, University of California, Davis, University of California, Davis, University of Minnesota , Science Applications International Corporation.
Wild birds have been implicated in the spread of highly pathogenic avian influenza (HPAI) of the H5N1 subtype, prompting surveillance along migratory flyways. Sampling of wild birds for avian influenza virus (AIV) is often conducted in remote regions, but results are often delayed because of the need to transport samples to a laboratory equipped for molecular testing. Real-time reverse transcriptase polymerase chain reaction (rRT-PCR) is a molecular technique that offers one of the most accurate and sensitive methods for diagnosis of AIV. The previously strict lab protocols needed for rRT-PCR are now being adapted for the field. Development of freeze-dried (lyophilized) reagents that do not require cold chain, with sensitivity at the level of wet reagents has brought on-site remote testing to a practical goal.
Here we present a method for the rapid diagnosis of AIV in wild birds using an rRT-PCR unit (Ruggedized Advanced Pathogen Identification Device or RAPID, Idaho Technologies, Salt Lake City, UT) that employs lyophilized reagents (Influenza A Target 1 Taqman; ASAY-ASY-0109, Idaho Technologies). The reagents contain all of the necessary components for testing at appropriate concentrations in a single tube: primers, probes, enzymes, buffers and internal positive controls, eliminating errors associated with improper storage or handling of wet reagents. The portable unit performs a screen for Influenza A by targeting the matrix gene and yields results in 2-3 hours. Genetic subtyping is also possible with H5 and H7 primer sets that target the hemagglutinin gene.
The system is suitable for use on cloacal and oropharyngeal samples collected from wild birds, as demonstrated here on the migratory shorebird species, the western sandpiper (Calidrus mauri
) captured in Northern California. Animal handling followed protocols approved by the Animal Care and Use Committee of the U.S. Geological Survey Western Ecological Research Center and permits of the U.S. Geological Survey Bird Banding Laboratory. The primary advantage of this technique is to expedite diagnosis of wild birds, increasing the chances of containing an outbreak in a remote location. On-site diagnosis would also prove useful for identifying and studying infected individuals in wild populations. The opportunity to collect information on host biology (immunological and physiological response to infection) and spatial ecology (migratory performance of infected birds) will provide insights into the extent to which wild birds can act as vectors for AIV over long distances.
Immunology, Issue 54, migratory birds, active surveillance, lyophilized reagents, avian influenza, H5N1
A Protocol for Computer-Based Protein Structure and Function Prediction
Institutions: University of Michigan , University of Kansas.
Genome sequencing projects have ciphered millions of protein sequence, which require knowledge of their structure and function to improve the understanding of their biological role. Although experimental methods can provide detailed information for a small fraction of these proteins, computational modeling is needed for the majority of protein molecules which are experimentally uncharacterized. The I-TASSER server is an on-line workbench for high-resolution modeling of protein structure and function. Given a protein sequence, a typical output from the I-TASSER server includes secondary structure prediction, predicted solvent accessibility of each residue, homologous template proteins detected by threading and structure alignments, up to five full-length tertiary structural models, and structure-based functional annotations for enzyme classification, Gene Ontology terms and protein-ligand binding sites. All the predictions are tagged with a confidence score which tells how accurate the predictions are without knowing the experimental data. To facilitate the special requests of end users, the server provides channels to accept user-specified inter-residue distance and contact maps to interactively change the I-TASSER modeling; it also allows users to specify any proteins as template, or to exclude any template proteins during the structure assembly simulations. The structural information could be collected by the users based on experimental evidences or biological insights with the purpose of improving the quality of I-TASSER predictions. The server was evaluated as the best programs for protein structure and function predictions in the recent community-wide CASP experiments. There are currently >20,000 registered scientists from over 100 countries who are using the on-line I-TASSER server.
Biochemistry, Issue 57, On-line server, I-TASSER, protein structure prediction, function prediction
Creating Objects and Object Categories for Studying Perception and Perceptual Learning
Institutions: Georgia Health Sciences University, Georgia Health Sciences University, Georgia Health Sciences University, Palo Alto Research Center, Palo Alto Research Center, University of Minnesota .
In order to quantitatively study object perception, be it perception by biological systems or by machines, one needs to create objects and object categories with precisely definable, preferably naturalistic, properties1
. Furthermore, for studies on perceptual learning, it is useful to create novel objects and object categories (or object classes
) with such properties2
Many innovative and useful methods currently exist for creating novel objects and object categories3-6
(also see refs. 7,8). However, generally speaking, the existing methods have three broad types of shortcomings.
First, shape variations are generally imposed by the experimenter5,9,10
, and may therefore be different from the variability in natural categories, and optimized for a particular recognition algorithm. It would be desirable to have the variations arise independently of the externally imposed constraints.
Second, the existing methods have difficulty capturing the shape complexity of natural objects11-13
. If the goal is to study natural object perception, it is desirable for objects and object categories to be naturalistic, so as to avoid possible confounds and special cases.
Third, it is generally hard to quantitatively measure the available information in the stimuli created by conventional methods. It would be desirable to create objects and object categories where the available information can be precisely measured and, where necessary, systematically manipulated (or 'tuned'). This allows one to formulate the underlying object recognition tasks in quantitative terms.
Here we describe a set of algorithms, or methods, that meet all three of the above criteria. Virtual morphogenesis (VM) creates novel, naturalistic virtual 3-D objects called 'digital embryos' by simulating the biological process of embryogenesis14
. Virtual phylogenesis (VP) creates novel, naturalistic object categories by simulating the evolutionary process of natural selection9,12,13
. Objects and object categories created by these simulations can be further manipulated by various morphing methods to generate systematic variations of shape characteristics15,16
. The VP and morphing methods can also be applied, in principle, to novel virtual objects other than digital embryos, or to virtual versions of real-world objects9,13
. Virtual objects created in this fashion can be rendered as visual images using a conventional graphical toolkit, with desired manipulations of surface texture, illumination, size, viewpoint and background. The virtual objects can also be 'printed' as haptic objects using a conventional 3-D prototyper.
We also describe some implementations of these computational algorithms to help illustrate the potential utility of the algorithms. It is important to distinguish the algorithms from their implementations. The implementations are demonstrations offered solely as a 'proof of principle' of the underlying algorithms. It is important to note that, in general, an implementation of a computational algorithm often has limitations that the algorithm itself does not have.
Together, these methods represent a set of powerful and flexible tools for studying object recognition and perceptual learning by biological and computational systems alike. With appropriate extensions, these methods may also prove useful in the study of morphogenesis and phylogenesis.
Neuroscience, Issue 69, machine learning, brain, classification, category learning, cross-modal perception, 3-D prototyping, inference
The ITS2 Database
Institutions: University of Würzburg, University of Würzburg.
The internal transcribed spacer 2 (ITS2) has been used as a phylogenetic marker for more than two decades. As ITS2 research mainly focused on the very variable ITS2 sequence, it confined this marker to low-level phylogenetics only. However, the combination of the ITS2 sequence and its highly conserved secondary structure improves the phylogenetic resolution1
and allows phylogenetic inference at multiple taxonomic ranks, including species delimitation2-8
The ITS2 Database9
presents an exhaustive dataset of internal transcribed spacer 2 sequences from NCBI GenBank11
. Following an annotation by profile Hidden Markov Models (HMMs), the secondary structure of each sequence is predicted. First, it is tested whether a minimum energy based fold12
(direct fold) results in a correct, four helix conformation. If this is not the case, the structure is predicted by homology modeling13
. In homology modeling, an already known secondary structure is transferred to another ITS2 sequence, whose secondary structure was not able to fold correctly in a direct fold.
The ITS2 Database is not only a database for storage and retrieval of ITS2 sequence-structures. It also provides several tools to process your own ITS2 sequences, including annotation, structural prediction, motif detection and BLAST14
search on the combined sequence-structure information. Moreover, it integrates trimmed versions of 4SALE15,16
for multiple sequence-structure alignment calculation and Neighbor Joining18
tree reconstruction. Together they form a coherent analysis pipeline from an initial set of sequences to a phylogeny based on sequence and secondary structure.
In a nutshell, this workbench simplifies first phylogenetic analyses to only a few mouse-clicks, while additionally providing tools and data for comprehensive large-scale analyses.
Genetics, Issue 61, alignment, internal transcribed spacer 2, molecular systematics, secondary structure, ribosomal RNA, phylogenetic tree, homology modeling, phylogeny
Avidity-based Extracellular Interaction Screening (AVEXIS) for the Scalable Detection of Low-affinity Extracellular Receptor-Ligand Interactions
Institutions: Wellcome Trust Sanger Institute.
Extracellular protein:protein interactions between secreted or membrane-tethered proteins are critical for both initiating intercellular communication and ensuring cohesion within multicellular organisms. Proteins predicted to form extracellular interactions are encoded by approximately a quarter of human genes1
, but despite their importance and abundance, the majority of these proteins have no documented binding partner. Primarily, this is due to their biochemical intractability: membrane-embedded proteins are difficult to solubilise in their native conformation and contain structurally-important posttranslational modifications. Also, the interaction affinities between receptor proteins are often characterised by extremely low interaction strengths (half-lives < 1 second) precluding their detection with many commonly-used high throughput methods2
Here, we describe an assay, AVEXIS (AVidity-based EXtracellular Interaction Screen) that overcomes these technical challenges enabling the detection of very weak protein interactions (t1/2
≤ 0.1 sec) with a low false positive rate3
. The assay is usually implemented in a high throughput format to enable the systematic screening of many thousands of interactions in a convenient microtitre plate format (Fig. 1). It relies on the production of soluble recombinant protein libraries that contain the ectodomain fragments of cell surface receptors or secreted proteins within which to screen for interactions; therefore, this approach is suitable for type I, type II, GPI-linked cell surface receptors and secreted proteins but not for multipass membrane proteins such as ion channels or transporters.
The recombinant protein libraries are produced using a convenient and high-level mammalian expression system4
, to ensure that important posttranslational modifications such as glycosylation and disulphide bonds are added. Expressed recombinant proteins are secreted into the medium and produced in two forms: a biotinylated bait which can be captured on a streptavidin-coated solid phase suitable for screening, and a pentamerised enzyme-tagged (β-lactamase) prey. The bait and prey proteins are presented to each other in a binary fashion to detect direct interactions between them, similar to a conventional ELISA (Fig. 1). The pentamerisation of the proteins in the prey is achieved through a peptide sequence from the cartilage oligomeric matrix protein (COMP) and increases the local concentration of the ectodomains thereby providing significant avidity gains to enable even very transient interactions to be detected. By normalising the activities of both the bait and prey to predetermined levels prior to screening, we have shown that interactions having monomeric half-lives of 0.1 sec can be detected with low false positive rates3
Molecular Biology, Issue 61, Receptor-ligand pairs, Extracellular protein interactions, AVEXIS, Adhesion receptors, Transient/weak interactions, High throughput screening
Mapping Bacterial Functional Networks and Pathways in Escherichia Coli using Synthetic Genetic Arrays
Institutions: University of Toronto, University of Toronto, University of Regina.
Phenotypes are determined by a complex series of physical (e.g.
protein-protein) and functional (e.g.
gene-gene or genetic) interactions (GI)1
. While physical interactions can indicate which bacterial proteins are associated as complexes, they do not necessarily reveal pathway-level functional relationships1. GI screens, in which the growth of double mutants bearing two deleted or inactivated genes is measured and compared to the corresponding single mutants, can illuminate epistatic dependencies between loci and hence provide a means to query and discover novel functional relationships2
. Large-scale GI maps have been reported for eukaryotic organisms like yeast3-7
, but GI information remains sparse for prokaryotes8
, which hinders the functional annotation of bacterial genomes. To this end, we and others have developed high-throughput quantitative bacterial GI screening methods9, 10
Here, we present the key steps required to perform quantitative E. coli
Synthetic Genetic Array (eSGA) screening procedure on a genome-scale9
, using natural bacterial conjugation and homologous recombination to systemically generate and measure the fitness of large numbers of double mutants in a colony array format.
Briefly, a robot is used to transfer, through conjugation, chloramphenicol (Cm) - marked mutant alleles from engineered Hfr (High frequency of recombination) 'donor strains' into an ordered array of kanamycin (Kan) - marked F- recipient strains. Typically, we use loss-of-function single mutants bearing non-essential gene deletions (e.g.
the 'Keio' collection11
) and essential gene hypomorphic mutations (i.e.
alleles conferring reduced protein expression, stability, or activity9, 12, 13
) to query the functional associations of non-essential and essential genes, respectively. After conjugation and ensuing genetic exchange mediated by homologous recombination, the resulting double mutants are selected on solid medium containing both antibiotics. After outgrowth, the plates are digitally imaged and colony sizes are quantitatively scored using an in-house automated image processing system14
. GIs are revealed when the growth rate of a double mutant is either significantly better or worse than expected9
. Aggravating (or negative) GIs often result between loss-of-function mutations in pairs of genes from compensatory pathways that impinge on the same essential process2
. Here, the loss of a single gene is buffered, such that either single mutant is viable. However, the loss of both pathways is deleterious and results in synthetic lethality or sickness (i.e.
slow growth). Conversely, alleviating (or positive) interactions can occur between genes in the same pathway or protein complex2
as the deletion of either gene alone is often sufficient to perturb the normal function of the pathway or complex such that additional perturbations do not reduce activity, and hence growth, further. Overall, systematically identifying and analyzing GI networks can provide unbiased, global maps of the functional relationships between large numbers of genes, from which pathway-level information missed by other approaches can be inferred9
Genetics, Issue 69, Molecular Biology, Medicine, Biochemistry, Microbiology, Aggravating, alleviating, conjugation, double mutant, Escherichia coli, genetic interaction, Gram-negative bacteria, homologous recombination, network, synthetic lethality or sickness, suppression
Examining BCL-2 Family Function with Large Unilamellar Vesicles
Institutions: Mount Sinai School of Medicine .
The BCL-2 (B cell CLL/Lymphoma) family is comprised of approximately twenty proteins that collaborate to either maintain cell survival or initiate apoptosis1
. Following cellular stress (e.g.,
DNA damage), the pro-apoptotic BCL-2 family effectors BAK (BCL-2 antagonistic killer 1) and/or BAX (BCL-2 associated X protein) become activated and compromise the integrity of the outer mitochondrial membrane (OMM), though the process referred to as mitochondrial outer membrane permeabilization (MOMP)1
. After MOMP occurs, pro-apoptotic proteins (e.g.,
cytochrome c) gain access to the cytoplasm, promote caspase activation, and apoptosis rapidly ensues2
In order for BAK/BAX to induce MOMP, they require transient interactions with members of another pro-apoptotic subset of the BCL-2 family, the BCL-2 homology domain 3 (BH3)-only proteins, such as BID (BH3-interacting domain agonist)3-6
. Anti-apoptotic BCL-2 family proteins (e.g.,
BCL-2 related gene, long isoform, BCL-xL; myeloid cell leukemia 1, MCL-1) regulate cellular survival by tightly controlling the interactions between BAK/BAX and the BH3-only proteins capable of directly inducing BAK/BAX activation7,8
. In addition, anti-apoptotic BCL-2 protein availability is also dictated by sensitizer/de-repressor BH3-only proteins, such as BAD (BCL-2 antagonist of cell death) or PUMA (p53 upregulated modulator of apoptosis), which bind and inhibit anti-apoptotic members7,9
. As most of the anti-apoptotic BCL-2 repertoire is localized to the OMM, the cellular decision to maintain survival or induce MOMP is dictated by multiple BCL-2 family interactions at this membrane.
Large unilamellar vesicles (LUVs) are a biochemical model to explore relationships between BCL-2 family interactions and membrane permeabilization10
. LUVs are comprised of defined lipids that are assembled in ratios identified in lipid composition studies from solvent extracted Xenopus mitochondria (46.5% phosphatidylcholine, 28.5% phosphatidylethanoloamine, 9% phosphatidylinositol, 9% phosphatidylserine, and 7% cardiolipin)10
. This is a convenient model system to directly explore BCL-2 family function because the protein and lipid components are completely defined and tractable, which is not always the case with primary mitochondria. While cardiolipin is not usually this high throughout the OMM, this model does faithfully mimic the OMM to promote BCL-2 family function. Furthermore, a more recent modification of the above protocol allows for kinetic analyses of protein interactions and real-time measurements of membrane permeabilization, which is based on LUVs containing a polyanionic dye (ANTS: 8-aminonaphthalene-1,3,6-trisulfonic acid) and cationic quencher (DPX: p
. As the LUVs permeabilize, ANTS and DPX diffuse apart, and a gain in fluorescence is detected. Here, commonly used recombinant BCL-2 family protein combinations and controls using the LUVs containing ANTS/DPX are described.
Cancer Biology, Issue 68, Genetics, Molecular Biology, Apoptosis, BAX, BCL-2 family, large unilamellar vesicles, MOMP, outer mitochondrial membrane
A High-content Imaging Workflow to Study Grb2 Signaling Complexes by Expression Cloning
Institutions: University College London, Massachusetts General Hospital.
Signal transduction by growth factor receptors is essential for cells to maintain proliferation and differentiation and requires tight control. Signal transduction is initiated by binding of an external ligand to a transmembrane receptor and activation of downstream signaling cascades. A key regulator of mitogenic signaling is Grb2, a modular protein composed of an internal SH2 (Src Homology 2) domain flanked by two SH3 domains that lacks enzymatic activity. Grb2 is constitutively associated with the GTPase Son-Of-Sevenless (SOS) via its N-terminal SH3 domain. The SH2 domain of Grb2 binds to growth factor receptors at phosphorylated tyrosine residues thus coupling receptor activation to the SOS-Ras-MAP kinase signaling cascade. In addition, other roles for Grb2 as a positive or negative regulator of signaling and receptor endocytosis have been described. The modular composition of Grb2 suggests that it can dock to a variety of receptors and transduce signals along a multitude of different pathways1-3
Described here is a simple microscopy assay that monitors recruitment of Grb2 to the plasma membrane. It is adapted from an assay that measures changes in sub-cellular localization of green-fluorescent protein (GFP)-tagged Grb2 in response to a stimulus4-6
. Plasma membrane receptors that bind Grb2 such as activated Epidermal Growth Factor Receptor (EGFR) recruit GFP-Grb2 to the plasma membrane upon cDNA expression and subsequently relocate to endosomal compartments in the cell. In order to identify in vivo
protein complexes of Grb2, this technique can be used to perform a genome-wide high-content screen based on changes in Grb2 sub-cellular localization. The preparation of cDNA expression clones, transfection and image acquisition are described in detail below. Compared to other genomic methods used to identify protein interaction partners, such as yeast-two-hybrid, this technique allows the visualization of protein complexes in mammalian cells at the sub-cellular site of interaction by a simple microscopy-based assay. Hence both qualitative features, such as patterns of localization can be assessed, as well as the quantitative strength of the interaction.
Molecular Biology, Issue 68, Grb2, cDNA preparation, high-throughput, high-content screening, signal transduction, expression cloning, 96-well
Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues
Institutions: The Research Institute at Nationwide Children's Hospital.
Protein alignments are commonly used to evaluate the similarity of protein residues, and the derived consensus sequence used for identifying functional units (e.g.,
domains). Traditional consensus-building models fail to account for interpositional dependencies – functionally required covariation of residues that tend to appear simultaneously throughout evolution and across the phylogentic tree. These relationships can reveal important clues about the processes of protein folding, thermostability, and the formation of functional sites, which in turn can be used to inform the engineering of synthetic proteins. Unfortunately, these relationships essentially form sub-motifs which cannot be predicted by simple “majority rule” or even HMM-based consensus models, and the result can be a biologically invalid “consensus” which is not only never seen in nature but is less viable than any extant protein. We have developed a visual analytics tool, StickWRLD, which creates an interactive 3D representation of a protein alignment and clearly displays covarying residues. The user has the ability to pan and zoom, as well as dynamically change the statistical threshold underlying the identification of covariants. StickWRLD has previously been successfully used to identify functionally-required covarying residues in proteins such as Adenylate Kinase and in DNA sequences such as endonuclease target sites.
Chemistry, Issue 101, protein engineering, covariation, codependent residues, visualization