Genome sequencing projects have ciphered millions of protein sequence, which require knowledge of their structure and function to improve the understanding of their biological role. Although experimental methods can provide detailed information for a small fraction of these proteins, computational modeling is needed for the majority of protein molecules which are experimentally uncharacterized. The I-TASSER server is an on-line workbench for high-resolution modeling of protein structure and function. Given a protein sequence, a typical output from the I-TASSER server includes secondary structure prediction, predicted solvent accessibility of each residue, homologous template proteins detected by threading and structure alignments, up to five full-length tertiary structural models, and structure-based functional annotations for enzyme classification, Gene Ontology terms and protein-ligand binding sites. All the predictions are tagged with a confidence score which tells how accurate the predictions are without knowing the experimental data. To facilitate the special requests of end users, the server provides channels to accept user-specified inter-residue distance and contact maps to interactively change the I-TASSER modeling; it also allows users to specify any proteins as template, or to exclude any template proteins during the structure assembly simulations. The structural information could be collected by the users based on experimental evidences or biological insights with the purpose of improving the quality of I-TASSER predictions. The server was evaluated as the best programs for protein structure and function predictions in the recent community-wide CASP experiments. There are currently >20,000 registered scientists from over 100 countries who are using the on-line I-TASSER server.
23 Related JoVE Articles!
DNA-affinity-purified Chip (DAP-chip) Method to Determine Gene Targets for Bacterial Two component Regulatory Systems
Institutions: Lawrence Berkeley National Laboratory.
methods such as ChIP-chip are well-established techniques used to determine global gene targets for transcription factors. However, they are of limited use in exploring bacterial two component regulatory systems with uncharacterized activation conditions. Such systems regulate transcription only when activated in the presence of unique signals. Since these signals are often unknown, the in vitro
microarray based method described in this video article can be used to determine gene targets and binding sites for response regulators. This DNA-affinity-purified-chip method may be used for any purified regulator in any organism with a sequenced genome. The protocol involves allowing the purified tagged protein to bind to sheared genomic DNA and then affinity purifying the protein-bound DNA, followed by fluorescent labeling of the DNA and hybridization to a custom tiling array. Preceding steps that may be used to optimize the assay for specific regulators are also described. The peaks generated by the array data analysis are used to predict binding site motifs, which are then experimentally validated. The motif predictions can be further used to determine gene targets of orthologous response regulators in closely related species. We demonstrate the applicability of this method by determining the gene targets and binding site motifs and thus predicting the function for a sigma54-dependent response regulator DVU3023 in the environmental bacterium Desulfovibrio vulgaris
Genetics, Issue 89, DNA-Affinity-Purified-chip, response regulator, transcription factor binding site, two component system, signal transduction, Desulfovibrio, lactate utilization regulator, ChIP-chip
Rapid Synthesis and Screening of Chemically Activated Transcription Factors with GFP-based Reporters
Institutions: Princeton University, Princeton University, California Institute of Technology.
Synthetic biology aims to rationally design and build synthetic circuits with desired quantitative properties, as well as provide tools to interrogate the structure of native control circuits. In both cases, the ability to program gene expression in a rapid and tunable fashion, with no off-target effects, can be useful. We have constructed yeast strains containing the ACT1
promoter upstream of a URA3
cassette followed by the ligand-binding domain of the human estrogen receptor and VP16. By transforming this strain with a linear PCR product containing a DNA binding domain and selecting against the presence of URA3
, a constitutively expressed artificial transcription factor (ATF) can be generated by homologous recombination. ATFs engineered in this fashion can activate a unique target gene in the presence of inducer, thereby eliminating both the off-target activation and nonphysiological growth conditions found with commonly used conditional gene expression systems. A simple method for the rapid construction of GFP reporter plasmids that respond specifically to a native or artificial transcription factor of interest is also provided.
Genetics, Issue 81, transcription, transcription factors, artificial transcription factors, zinc fingers, Zif268, synthetic biology
Efficient Production and Purification of Recombinant Murine Kindlin-3 from Insect Cells for Biophysical Studies
Institutions: University of Oxford.
Kindlins are essential coactivators, with talin, of the cell surface receptors integrins and also participate in integrin outside-in signalling, and the control of gene transcription in the cell nucleus. The kindlins are ~75 kDa multidomain proteins and bind to an NPxY motif and upstream T/S cluster of the integrin β-subunit cytoplasmic tail. The hematopoietically-important kindlin isoform, kindlin-3, is critical for platelet aggregation during thrombus formation, leukocyte rolling in response to infection and inflammation and osteoclast podocyte formation in bone resorption. Kindlin-3's role in these processes has resulted in extensive cellular and physiological studies. However, there is a need for an efficient method of acquiring high quality milligram quantities of the protein for further studies. We have developed a protocol, here described, for the efficient expression and purification of recombinant murine kindlin-3 by use of a baculovirus-driven expression system in Sf9 cells yielding sufficient amounts of high purity full-length protein to allow its biophysical characterization. The same approach could be taken in the study of the other mammalian kindlin isoforms.
Virology, Issue 85, Heterologous protein expression, insect cells, Spodoptera frugiperda, baculovirus, protein purification, kindlin, cell adhesion
Characterization of Complex Systems Using the Design of Experiments Approach: Transient Protein Expression in Tobacco as a Case Study
Institutions: RWTH Aachen University, Fraunhofer Gesellschaft.
Plants provide multiple benefits for the production of biopharmaceuticals including low costs, scalability, and safety. Transient expression offers the additional advantage of short development and production times, but expression levels can vary significantly between batches thus giving rise to regulatory concerns in the context of good manufacturing practice. We used a design of experiments (DoE) approach to determine the impact of major factors such as regulatory elements in the expression construct, plant growth and development parameters, and the incubation conditions during expression, on the variability of expression between batches. We tested plants expressing a model anti-HIV monoclonal antibody (2G12) and a fluorescent marker protein (DsRed). We discuss the rationale for selecting certain properties of the model and identify its potential limitations. The general approach can easily be transferred to other problems because the principles of the model are broadly applicable: knowledge-based parameter selection, complexity reduction by splitting the initial problem into smaller modules, software-guided setup of optimal experiment combinations and step-wise design augmentation. Therefore, the methodology is not only useful for characterizing protein expression in plants but also for the investigation of other complex systems lacking a mechanistic description. The predictive equations describing the interconnectivity between parameters can be used to establish mechanistic models for other complex systems.
Bioengineering, Issue 83, design of experiments (DoE), transient protein expression, plant-derived biopharmaceuticals, promoter, 5'UTR, fluorescent reporter protein, model building, incubation conditions, monoclonal antibody
Flat Mount Preparation for Observation and Analysis of Zebrafish Embryo Specimens Stained by Whole Mount In situ Hybridization
Institutions: University of Notre Dame.
The zebrafish embryo is now commonly used for basic and biomedical research to investigate the genetic control of developmental processes and to model congenital abnormalities. During the first day of life, the zebrafish embryo progresses through many developmental stages including fertilization, cleavage, gastrulation, segmentation, and the organogenesis of structures such as the kidney, heart, and central nervous system. The anatomy of a young zebrafish embryo presents several challenges for the visualization and analysis of the tissues involved in many of these events because the embryo develops in association with a round yolk mass. Thus, for accurate analysis and imaging of experimental phenotypes in fixed embryonic specimens between the tailbud and 20 somite stage (10 and 19 hours post fertilization (hpf), respectively), such as those stained using whole mount in situ
hybridization (WISH), it is often desirable to remove the embryo from the yolk ball and to position it flat on a glass slide. However, performing a flat mount procedure can be tedious. Therefore, successful and efficient flat mount preparation is greatly facilitated through the visual demonstration of the dissection technique, and also helped by using reagents that assist in optimal tissue handling. Here, we provide our WISH protocol for one or two-color detection of gene expression in the zebrafish embryo, and demonstrate how the flat mounting procedure can be performed on this example of a stained fixed specimen. This flat mounting protocol is broadly applicable to the study of many embryonic structures that emerge during early zebrafish development, and can be implemented in conjunction with other staining methods performed on fixed embryo samples.
Developmental Biology, Issue 89, animals, vertebrates, fishes, zebrafish, growth and development, morphogenesis, embryonic and fetal development, organogenesis, natural science disciplines, embryo, whole mount in situ hybridization, flat mount, deyolking, imaging
Specificity Analysis of Protein Lysine Methyltransferases Using SPOT Peptide Arrays
Institutions: Stuttgart University.
Lysine methylation is an emerging post-translation modification and it has been identified on several histone and non-histone proteins, where it plays crucial roles in cell development and many diseases. Approximately 5,000 lysine methylation sites were identified on different proteins, which are set by few dozens of protein lysine methyltransferases. This suggests that each PKMT methylates multiple proteins, however till now only one or two substrates have been identified for several of these enzymes. To approach this problem, we have introduced peptide array based substrate specificity analyses of PKMTs. Peptide arrays are powerful tools to characterize the specificity of PKMTs because methylation of several substrates with different sequences can be tested on one array. We synthesized peptide arrays on cellulose membrane using an Intavis SPOT synthesizer and analyzed the specificity of various PKMTs. Based on the results, for several of these enzymes, novel substrates could be identified. For example, for NSD1 by employing peptide arrays, we showed that it methylates K44 of H4 instead of the reported H4K20 and in addition H1.5K168 is the highly preferred substrate over the previously known H3K36. Hence, peptide arrays are powerful tools to biochemically characterize the PKMTs.
Biochemistry, Issue 93, Peptide arrays, solid phase peptide synthesis, SPOT synthesis, protein lysine methyltransferases, substrate specificity profile analysis, lysine methylation
Comprehensive Analysis of Transcription Dynamics from Brain Samples Following Behavioral Experience
Institutions: The Hebrew University of Jerusalem.
The encoding of experiences in the brain and the consolidation of long-term memories depend on gene transcription. Identifying the function of specific genes in encoding experience is one of the main objectives of molecular neuroscience. Furthermore, the functional association of defined genes with specific behaviors has implications for understanding the basis of neuropsychiatric disorders. Induction of robust transcription programs has been observed in the brains of mice following various behavioral manipulations. While some genetic elements are utilized recurrently following different behavioral manipulations and in different brain nuclei, transcriptional programs are overall unique to the inducing stimuli and the structure in which they are studied1,2
In this publication, a protocol is described for robust and comprehensive transcriptional profiling from brain nuclei of mice in response to behavioral manipulation. The protocol is demonstrated in the context of analysis of gene expression dynamics in the nucleus accumbens following acute cocaine experience. Subsequent to a defined in vivo
experience, the target neural tissue is dissected; followed by RNA purification, reverse transcription and utilization of microfluidic arrays for comprehensive qPCR analysis of multiple target genes. This protocol is geared towards comprehensive analysis (addressing 50-500 genes) of limiting quantities of starting material, such as small brain samples or even single cells.
The protocol is most advantageous for parallel analysis of multiple samples (e.g.
single cells, dynamic analysis following pharmaceutical, viral or behavioral perturbations). However, the protocol could also serve for the characterization and quality assurance of samples prior to whole-genome studies by microarrays or RNAseq, as well as validation of data obtained from whole-genome studies.
Behavior, Issue 90,
Brain, behavior, RNA, transcription, nucleus accumbens, cocaine, high-throughput qPCR, experience-dependent plasticity, gene regulatory networks, microdissection
Analysis of Nephron Composition and Function in the Adult Zebrafish Kidney
Institutions: University of Notre Dame.
The zebrafish model has emerged as a relevant system to study kidney development, regeneration and disease. Both the embryonic and adult zebrafish kidneys are composed of functional units known as nephrons, which are highly conserved with other vertebrates, including mammals. Research in zebrafish has recently demonstrated that two distinctive phenomena transpire after adult nephrons incur damage: first, there is robust regeneration within existing nephrons that replaces the destroyed tubule epithelial cells; second, entirely new nephrons are produced from renal progenitors in a process known as neonephrogenesis. In contrast, humans and other mammals seem to have only a limited ability for nephron epithelial regeneration. To date, the mechanisms responsible for these kidney regeneration phenomena remain poorly understood. Since adult zebrafish kidneys undergo both nephron epithelial regeneration and neonephrogenesis, they provide an outstanding experimental paradigm to study these events. Further, there is a wide range of genetic and pharmacological tools available in the zebrafish model that can be used to delineate the cellular and molecular mechanisms that regulate renal regeneration. One essential aspect of such research is the evaluation of nephron structure and function. This protocol describes a set of labeling techniques that can be used to gauge renal composition and test nephron functionality in the adult zebrafish kidney. Thus, these methods are widely applicable to the future phenotypic characterization of adult zebrafish kidney injury paradigms, which include but are not limited to, nephrotoxicant exposure regimes or genetic methods of targeted cell death such as the nitroreductase mediated cell ablation technique. Further, these methods could be used to study genetic perturbations in adult kidney formation and could also be applied to assess renal status during chronic disease modeling.
Cellular Biology, Issue 90,
zebrafish; kidney; nephron; nephrology; renal; regeneration; proximal tubule; distal tubule; segment; mesonephros; physiology; acute kidney injury (AKI)
Generation and Purification of Human INO80 Chromatin Remodeling Complexes and Subcomplexes
Institutions: Stowers Institute for Medical Research, Kansas University Medical Center.
INO80 chromatin remodeling complexes regulate nucleosome dynamics and DNA accessibility by catalyzing ATP-dependent nucleosome remodeling. Human INO80 complexes consist of 14 protein subunits including Ino80, a SNF2-like ATPase, which serves both as the catalytic subunit and the scaffold for assembly of the complexes. Functions of the other subunits and the mechanisms by which they contribute to the INO80 complex's chromatin remodeling activity remain poorly understood, in part due to the challenge of generating INO80 subassemblies in human cells or heterologous expression systems. This JOVE protocol describes a procedure that allows purification of human INO80 chromatin remodeling subcomplexes that are lacking a subunit or a subset of subunits. N-terminally FLAG epitope tagged Ino80 cDNA are stably introduced into human embryonic kidney (HEK) 293 cell lines using Flp-mediated recombination. In the event that a subset of subunits of the INO80 complex is to be deleted, one expresses instead mutant Ino80 proteins that lack the platform needed for assembly of those subunits. In the event an individual subunit is to be depleted, one transfects siRNAs targeting this subunit into an HEK 293 cell line stably expressing FLAG tagged Ino80 ATPase. Nuclear extracts are prepared, and FLAG immunoprecipitation is performed to enrich protein fractions containing Ino80 derivatives. The compositions of purified INO80 subcomplexes can then be analyzed using methods such as immunoblotting, silver staining, and mass spectrometry. The INO80 and INO80 subcomplexes generated according to this protocol can be further analyzed using various biochemical assays, which are described in the accompanying JOVE protocol. The methods described here can be adapted for studies of the structural and functional properties of any mammalian multi-subunit chromatin remodeling and modifying complexes.
Biochemistry, Issue 92, chromatin remodeling, INO80, SNF2 family ATPase, structure-function, enzyme purification
Combining Magnetic Sorting of Mother Cells and Fluctuation Tests to Analyze Genome Instability During Mitotic Cell Aging in Saccharomyces cerevisiae
Institutions: Rensselaer Polytechnic Institute.
has been an excellent model system for examining mechanisms and consequences of genome instability. Information gained from this yeast model is relevant to many organisms, including humans, since DNA repair and DNA damage response factors are well conserved across diverse species. However, S. cerevisiae
has not yet been used to fully address whether the rate of accumulating mutations changes with increasing replicative (mitotic) age due to technical constraints. For instance, measurements of yeast replicative lifespan through micromanipulation involve very small populations of cells, which prohibit detection of rare mutations. Genetic methods to enrich for mother cells in populations by inducing death of daughter cells have been developed, but population sizes are still limited by the frequency with which random mutations that compromise the selection systems occur. The current protocol takes advantage of magnetic sorting of surface-labeled yeast mother cells to obtain large enough populations of aging mother cells to quantify rare mutations through phenotypic selections. Mutation rates, measured through fluctuation tests, and mutation frequencies are first established for young cells and used to predict the frequency of mutations in mother cells of various replicative ages. Mutation frequencies are then determined for sorted mother cells, and the age of the mother cells is determined using flow cytometry by staining with a fluorescent reagent that detects bud scars formed on their cell surfaces during cell division. Comparison of predicted mutation frequencies based on the number of cell divisions to the frequencies experimentally observed for mother cells of a given replicative age can then identify whether there are age-related changes in the rate of accumulating mutations. Variations of this basic protocol provide the means to investigate the influence of alterations in specific gene functions or specific environmental conditions on mutation accumulation to address mechanisms underlying genome instability during replicative aging.
Microbiology, Issue 92, Aging, mutations, genome instability, Saccharomyces cerevisiae, fluctuation test, magnetic sorting, mother cell, replicative aging
Analyzing Protein Dynamics Using Hydrogen Exchange Mass Spectrometry
Institutions: University of Heidelberg.
All cellular processes depend on the functionality of proteins. Although the functionality of a given protein is the direct consequence of its unique amino acid sequence, it is only realized by the folding of the polypeptide chain into a single defined three-dimensional arrangement or more commonly into an ensemble of interconverting conformations. Investigating the connection between protein conformation and its function is therefore essential for a complete understanding of how proteins are able to fulfill their great variety of tasks. One possibility to study conformational changes a protein undergoes while progressing through its functional cycle is hydrogen-1
H-exchange in combination with high-resolution mass spectrometry (HX-MS). HX-MS is a versatile and robust method that adds a new dimension to structural information obtained by e.g.
crystallography. It is used to study protein folding and unfolding, binding of small molecule ligands, protein-protein interactions, conformational changes linked to enzyme catalysis, and allostery. In addition, HX-MS is often used when the amount of protein is very limited or crystallization of the protein is not feasible. Here we provide a general protocol for studying protein dynamics with HX-MS and describe as an example how to reveal the interaction interface of two proteins in a complex.
Chemistry, Issue 81, Molecular Chaperones, mass spectrometers, Amino Acids, Peptides, Proteins, Enzymes, Coenzymes, Protein dynamics, conformational changes, allostery, protein folding, secondary structure, mass spectrometry
Isolation and Chemical Characterization of Lipid A from Gram-negative Bacteria
Institutions: The University of Texas at Austin, The University of Texas at Austin, The University of Texas at Austin.
Lipopolysaccharide (LPS) is the major cell surface molecule of gram-negative bacteria, deposited on the outer leaflet of the outer membrane bilayer. LPS can be subdivided into three domains: the distal O-polysaccharide, a core oligosaccharide, and the lipid A domain consisting of a lipid A molecular species and 3-deoxy-D-manno-oct-2-ulosonic acid residues (Kdo). The lipid A domain is the only component essential for bacterial cell survival. Following its synthesis, lipid A is chemically modified in response to environmental stresses such as pH or temperature, to promote resistance to antibiotic compounds, and to evade recognition by mediators of the host innate immune response. The following protocol details the small- and large-scale isolation of lipid A from gram-negative bacteria. Isolated material is then chemically characterized by thin layer chromatography (TLC) or mass-spectrometry (MS). In addition to matrix-assisted laser desorption/ionization-time of flight (MALDI-TOF) MS, we also describe tandem MS protocols for analyzing lipid A molecular species using electrospray ionization (ESI) coupled to collision induced dissociation (CID) and newly employed ultraviolet photodissociation (UVPD) methods. Our MS protocols allow for unequivocal determination of chemical structure, paramount to characterization of lipid A molecules that contain unique or novel chemical modifications. We also describe the radioisotopic labeling, and subsequent isolation, of lipid A from bacterial cells for analysis by TLC. Relative to MS-based protocols, TLC provides a more economical and rapid characterization method, but cannot be used to unambiguously assign lipid A chemical structures without the use of standards of known chemical structure. Over the last two decades isolation and characterization of lipid A has led to numerous exciting discoveries that have improved our understanding of the physiology of gram-negative bacteria, mechanisms of antibiotic resistance, the human innate immune response, and have provided many new targets in the development of antibacterial compounds.
Chemistry, Issue 79, Membrane Lipids, Toll-Like Receptors, Endotoxins, Glycolipids, Lipopolysaccharides, Lipid A, Microbiology, Lipids, lipid A, Bligh-Dyer, thin layer chromatography (TLC), lipopolysaccharide, mass spectrometry, Collision Induced Dissociation (CID), Photodissociation (PD)
Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules
Institutions: Princeton University.
The aim of de novo
protein design is to find the amino acid sequences that will fold into a desired 3-dimensional structure with improvements in specific properties, such as binding affinity, agonist or antagonist behavior, or stability, relative to the native sequence. Protein design lies at the center of current advances drug design and discovery. Not only does protein design provide predictions for potentially useful drug targets, but it also enhances our understanding of the protein folding process and protein-protein interactions. Experimental methods such as directed evolution have shown success in protein design. However, such methods are restricted by the limited sequence space that can be searched tractably. In contrast, computational design strategies allow for the screening of a much larger set of sequences covering a wide variety of properties and functionality. We have developed a range of computational de novo
protein design methods capable of tackling several important areas of protein design. These include the design of monomeric proteins for increased stability and complexes for increased binding affinity.
To disseminate these methods for broader use we present Protein WISDOM (http://www.proteinwisdom.org), a tool that provides automated methods for a variety of protein design problems. Structural templates are submitted to initialize the design process. The first stage of design is an optimization sequence selection stage that aims at improving stability through minimization of potential energy in the sequence space. Selected sequences are then run through a fold specificity stage and a binding affinity stage. A rank-ordered list of the sequences for each step of the process, along with relevant designed structures, provides the user with a comprehensive quantitative assessment of the design. Here we provide the details of each design method, as well as several notable experimental successes attained through the use of the methods.
Genetics, Issue 77, Molecular Biology, Bioengineering, Biochemistry, Biomedical Engineering, Chemical Engineering, Computational Biology, Genomics, Proteomics, Protein, Protein Binding, Computational Biology, Drug Design, optimization (mathematics), Amino Acids, Peptides, and Proteins, De novo protein and peptide design, Drug design, In silico sequence selection, Optimization, Fold specificity, Binding affinity, sequencing
Peptide-based Identification of Functional Motifs and their Binding Partners
Institutions: Morehouse School of Medicine, Institute for Systems Biology, Universiti Sains Malaysia.
Specific short peptides derived from motifs found in full-length proteins, in our case HIV-1 Nef, not only retain their biological function, but can also competitively inhibit the function of the full-length protein. A set of 20 Nef scanning peptides, 20 amino acids in length with each overlapping 10 amino acids of its neighbor, were used to identify motifs in Nef responsible for its induction of apoptosis. Peptides containing these apoptotic motifs induced apoptosis at levels comparable to the full-length Nef protein. A second peptide, derived from the Secretion Modification Region (SMR) of Nef, retained the ability to interact with cellular proteins involved in Nef's secretion in exosomes (exNef). This SMRwt peptide was used as the "bait" protein in co-immunoprecipitation experiments to isolate cellular proteins that bind specifically to Nef's SMR motif. Protein transfection and antibody inhibition was used to physically disrupt the interaction between Nef and mortalin, one of the isolated SMR-binding proteins, and the effect was measured with a fluorescent-based exNef secretion assay. The SMRwt peptide's ability to outcompete full-length Nef for cellular proteins that bind the SMR motif, make it the first inhibitor of exNef secretion. Thus, by employing the techniques described here, which utilize the unique properties of specific short peptides derived from motifs found in full-length proteins, one may accelerate the identification of functional motifs in proteins and the development of peptide-based inhibitors of pathogenic functions.
Virology, Issue 76, Biochemistry, Immunology, Infection, Infectious Diseases, Molecular Biology, Medicine, Genetics, Microbiology, Genomics, Proteins, Exosomes, HIV, Peptides, Exocytosis, protein trafficking, secretion, HIV-1, Nef, Secretion Modification Region, SMR, peptide, AIDS, assay
In Vitro Analysis of PDZ-dependent CFTR Macromolecular Signaling Complexes
Institutions: Wayne State University School of Medicine, Wayne State University School of Medicine, Wayne State University School of Medicine.
Cystic fibrosis transmembrane conductance regulator (CFTR), a chloride channel located primarily at the apical membranes of epithelial cells, plays a crucial role in transepithelial fluid homeostasis1-3
. CFTR has been implicated in two major diseases: cystic fibrosis (CF)4
and secretory diarrhea5
. In CF, the synthesis or functional activity of the CFTR Cl- channel is reduced. This disorder affects approximately 1 in 2,500 Caucasians in the United States6
. Excessive CFTR activity has also been implicated in cases of toxin-induced secretory diarrhea (e.g.
, by cholera toxin and heat stable E. coli
enterotoxin) that stimulates cAMP or cGMP production in the gut7
Accumulating evidence suggest the existence of physical and functional interactions between CFTR and a growing number of other proteins, including transporters, ion channels, receptors, kinases, phosphatases, signaling molecules, and cytoskeletal elements, and these interactions between CFTR and its binding proteins have been shown to be critically involved in regulating CFTR-mediated transepithelial ion transport in vitro
and also in vivo8-19
. In this protocol, we focus only on the methods that aid in the study of the interactions between CFTR carboxyl terminal tail, which possesses a protein-binding motif [referred to as PSD95/Dlg1/ZO-1 (PDZ) motif], and a group of scaffold proteins, which contain a specific binding module referred to as PDZ domains. So far, several different PDZ scaffold proteins have been reported to bind to the carboxyl terminal tail of CFTR with various affinities, such as NHERF1, NHERF2, PDZK1, PDZK2, CAL (CFTR-associated ligand), Shank2, and GRASP20-27
. The PDZ motif within CFTR that is recognized by PDZ scaffold proteins is the last four amino acids at the C terminus (i.e.
, 1477-DTRL-1480 in human CFTR)20
. Interestingly, CFTR can bind more than one PDZ domain of both NHERFs and PDZK1, albeit with varying affinities22
. This multivalency with respect to CFTR binding has been shown to be of functional significance, suggesting that PDZ scaffold proteins may facilitate formation of CFTR macromolecular signaling complexes for specific/selective and efficient signaling in cells16-18
Multiple biochemical assays have been developed to study CFTR-involving protein interactions, such as co-immunoprecipitation, pull-down assay, pair-wise binding assay, colorimetric pair-wise binding assay, and macromolecular complex assembly assay16-19,28,29
. Here we focus on the detailed procedures of assembling a PDZ motif-dependent CFTR-containing macromolecular complex in vitro
, which is used extensively by our laboratory to study protein-protein or domain-domain interactions involving CFTR16-19,28,29
Biochemistry, Issue 66, Molecular Biology, Chemistry, CFTR, macromolecular complex, protein interaction, PDZ scaffold protein, epithelial cell, cystic fibrosis
The ITS2 Database
Institutions: University of Würzburg, University of Würzburg.
The internal transcribed spacer 2 (ITS2) has been used as a phylogenetic marker for more than two decades. As ITS2 research mainly focused on the very variable ITS2 sequence, it confined this marker to low-level phylogenetics only. However, the combination of the ITS2 sequence and its highly conserved secondary structure improves the phylogenetic resolution1
and allows phylogenetic inference at multiple taxonomic ranks, including species delimitation2-8
The ITS2 Database9
presents an exhaustive dataset of internal transcribed spacer 2 sequences from NCBI GenBank11
. Following an annotation by profile Hidden Markov Models (HMMs), the secondary structure of each sequence is predicted. First, it is tested whether a minimum energy based fold12
(direct fold) results in a correct, four helix conformation. If this is not the case, the structure is predicted by homology modeling13
. In homology modeling, an already known secondary structure is transferred to another ITS2 sequence, whose secondary structure was not able to fold correctly in a direct fold.
The ITS2 Database is not only a database for storage and retrieval of ITS2 sequence-structures. It also provides several tools to process your own ITS2 sequences, including annotation, structural prediction, motif detection and BLAST14
search on the combined sequence-structure information. Moreover, it integrates trimmed versions of 4SALE15,16
for multiple sequence-structure alignment calculation and Neighbor Joining18
tree reconstruction. Together they form a coherent analysis pipeline from an initial set of sequences to a phylogeny based on sequence and secondary structure.
In a nutshell, this workbench simplifies first phylogenetic analyses to only a few mouse-clicks, while additionally providing tools and data for comprehensive large-scale analyses.
Genetics, Issue 61, alignment, internal transcribed spacer 2, molecular systematics, secondary structure, ribosomal RNA, phylogenetic tree, homology modeling, phylogeny
In Situ Hybridization for the Precise Localization of Transcripts in Plants
Institutions: Cold Spring Harbor Laboratory.
With the advances in genomics research of the past decade, plant biology has seen numerous studies presenting large-scale quantitative analyses of gene expression. Microarray and next generation sequencing approaches are being used to investigate developmental, physiological and stress response processes, dissect epigenetic and small RNA pathways, and build large gene regulatory networks1-3
. While these techniques facilitate the simultaneous analysis of large gene sets, they typically provide a very limited spatiotemporal resolution of gene expression changes. This limitation can be partially overcome by using either profiling method in conjunction with lasermicrodissection or fluorescence-activated cell sorting4-7
. However, to fully understand the biological role of a gene, knowledge of its spatiotemporal pattern of expression at a cellular resolution is essential. Particularly, when studying development or the effects of environmental stimuli and mutants can the detailed analysis of a gene's expression pattern become essential. For instance, subtle quantitative differences in the expression levels of key regulatory genes can lead to dramatic phenotypes when associated with the loss or gain of expression in specific cell types.
Several methods are routinely used for the detailed examination of gene expression patterns. One is through analysis of transgenic reporter lines. Such analysis can, however, become time-consuming when analyzing multiple genes or working in plants recalcitrant to transformation. Moreover, an independent validation to ensure that the transgene expression pattern mimics that of the endogenous gene is typically required. Immunohistochemical protein localization or mRNA in situ
hybridization present relatively fast alternatives for the direct visualization of gene expression within cells and tissues. The latter has the distinct advantage that it can be readily used on any gene of interest. In situ
hybridization allows detection of target mRNAs in cells by hybridization with a labeled anti-sense RNA probe obtained by in vitro
transcription of the gene of interest.
Here we outline a protocol for the in situ
localization of gene expression in plants that is highly sensitivity and specific. It is optimized for use with paraformaldehyde fixed, paraffin-embedded sections, which give excellent preservation of histology, and DIG-labeled probes that are visualized by immuno-detection and alkaline-phosphatase colorimetric reaction. This protocol has been successfully applied to a number of tissues from a wide range of plant species, and can be used to analyze expression of mRNAs as well as small RNAs8-14
Plant Biology, Issue 57, In Situ hybridization, RNA localization, expression analysis, plant, DIG-labeled probe
Non-radioactive in situ Hybridization Protocol Applicable for Norway Spruce and a Range of Plant Species
Institutions: Uppsala University, Swedish University of Agricultural Sciences.
The high-throughput expression analysis technologies available today give scientists an overflow of expression profiles but their resolution in terms of tissue specific expression is limited because of problems in dissecting individual tissues. Expression data needs to be confirmed and complemented with expression patterns using e.g. in situ
hybridization, a technique used to localize cell specific mRNA expression. The in situ
hybridization method is laborious, time-consuming and often requires extensive optimization depending on species and tissue. In situ
experiments are relatively more difficult to perform in woody species such as the conifer Norway spruce (Picea abies
). Here we present a modified DIG in situ
hybridization protocol, which is fast and applicable on a wide range of plant species including P. abies
. With just a few adjustments, including altered RNase treatment and proteinase K concentration, we could use the protocol to study tissue specific expression of homologous genes in male reproductive organs of one gymnosperm and two angiosperm species; P. abies, Arabidopsis thaliana
and Brassica napus
. The protocol worked equally well for the species and genes studied. AtAP3
were observed in second and third whorl floral organs in A. thaliana
and B. napus
and DAL13 in microsporophylls of male cones from P. abies
. For P. abies
the proteinase K concentration, used to permeablize the tissues, had to be increased to 3 g/ml instead of 1 g/ml, possibly due to more compact tissues and higher levels of phenolics and polysaccharides. For all species the RNase treatment was removed due to reduced signal strength without a corresponding increase in specificity. By comparing tissue specific expression patterns of homologous genes from both flowering plants and a coniferous tree we demonstrate that the DIG in situ
protocol presented here, with only minute adjustments, can be applied to a wide range of plant species. Hence, the protocol avoids both extensive species specific optimization and the laborious use of radioactively labeled probes in favor of DIG labeled probes. We have chosen to illustrate the technically demanding steps of the protocol in our film.
Anna Karlgren and Jenny Carlsson contributed equally to this study.
Corresponding authors: Anna Karlgren at Anna.Karlgren@ebc.uu.se and Jens F. Sundström at Jens.Sundstrom@vbsg.slu.se
Plant Biology, Issue 26, RNA, expression analysis, Norway spruce, Arabidopsis, rapeseed, conifers
In vivo and in vitro Studies of Adaptor-clathrin Interaction
Institutions: Colorado State University.
A major endocytic pathway initiates with the formation of clathrin-coated vesicles (CCVs) that transport cargo from the cell surface to endosomes1-6
. CCVs are distinguished by a polyhedral lattice of clathrin that coats the vesicle membrane and serves as a mechanical scaffold. Clathrin coats are assembled during vesicle formation from individual clathrin triskelia , the soluble form of clathrin composed of three heavy and three light chain subunits7,8
. Because the triskelion does not have the ability to bind to the membrane directly, clathrin-binding adaptors are critical to link the forming clathrin lattice to the membrane through association with lipids and/or membrane proteins9
. Adaptors also package transmembrane protein cargo, such as receptors, and can interact with each other and with other components of the CCV formation machinery9
Over twenty clathrin adaptors have been described, several are involved in clathrin mediated endocytosis and others localize to the trans Golgi network or endosomes9
. With the exception of HIP1R (yeast Sla2p), all known clathrin adaptors bind to the N-terminal -propeller domain of the clathrin heavy chain9
. Clathrin adaptors are modular proteins consisting of folded domains connected by unstructured flexible linkers. Within these linker regions, short binding motifs mediate interactions with the clathrin N-terminal domain or other components of the vesicle formation machinery9
. Two distinct clathrin-binding motifs have been defined: the clathrin-box and the W-box9
. The consensus clathrin-box sequence was originally defined as L[L/I][D/E/N][L/F][D/E]10
but variants have been subsequently discovered11
. The W-box conforms to the sequence PWxxW (where x is any residue).
Sla1p (Synthetic Lethal with Actin binding protein-1) was originally identified as an actin associated protein and is necessary for normal actin cytoskeleton structure and dynamics at endocytic sites in yeast cells12
. Sla1p also binds the NPFxD endocytic sorting signal and is critical for endocytosis of cargo bearing the NPFxD signal13,14
. More recently, Sla1p was demonstrated to bind clathrin through a motif similar to the clathrin box, LLDLQ, termed a variant clathrin-box (vCB), and to function as an endocytic clathrin adaptor15
. In addition, Sla1p has become a widely used marker for the endocytic coat in live cell fluorescence microscopy studies16
. Here we use Sla1p as a model to describe approaches for adaptor-clathrin interaction studies. We focus on live cell fluorescence microscopy, GST-pull down, and co-immunoprecipitation methods.
Cell Biology, Issue 47, clathrin, adaptor, Sla1p, pull down, immunoprecipitation, GFP, fluorescence microscopy
A Strategy to Identify de Novo Mutations in Common Disorders such as Autism and Schizophrenia
Institutions: Universite de Montreal, Universite de Montreal, Universite de Montreal.
There are several lines of evidence supporting the role of de novo
mutations as a mechanism for common disorders, such as autism and schizophrenia. First, the de novo
mutation rate in humans is relatively high, so new mutations are generated at a high frequency in the population. However, de novo
mutations have not been reported in most common diseases. Mutations in genes leading to severe diseases where there is a strong negative selection against the phenotype, such as lethality in embryonic stages or reduced reproductive fitness, will not be transmitted to multiple family members, and therefore will not be detected by linkage gene mapping or association studies. The observation of very high concordance in monozygotic twins and very low concordance in dizygotic twins also strongly supports the hypothesis that a significant fraction of cases may result from new mutations. Such is the case for diseases such as autism and schizophrenia. Second, despite reduced reproductive fitness1
and extremely variable environmental factors, the incidence of some diseases is maintained worldwide at a relatively high and constant rate. This is the case for autism and schizophrenia, with an incidence of approximately 1% worldwide. Mutational load can be thought of as a balance between selection for or against a deleterious mutation and its production by de novo
mutation. Lower rates of reproduction constitute a negative selection factor that should reduce the number of mutant alleles in the population, ultimately leading to decreased disease prevalence. These selective pressures tend to be of different intensity in different environments. Nonetheless, these severe mental disorders have been maintained at a constant relatively high prevalence in the worldwide population across a wide range of cultures and countries despite a strong negative selection against them2
. This is not what one would predict in diseases with reduced reproductive fitness, unless there was a high new mutation rate. Finally, the effects of paternal age: there is a significantly increased risk of the disease with increasing paternal age, which could result from the age related increase in paternal de novo
mutations. This is the case for autism and schizophrenia3
. The male-to-female ratio of mutation rate is estimated at about 4–6:1, presumably due to a higher number of germ-cell divisions with age in males. Therefore, one would predict that de novo
mutations would more frequently come from males, particularly older males4
. A high rate of new mutations may in part explain why genetic studies have so far failed to identify many genes predisposing to complexes diseases genes, such as autism and schizophrenia, and why diseases have been identified for a mere 3% of genes in the human genome. Identification for de novo
mutations as a cause of a disease requires a targeted molecular approach, which includes studying parents and affected subjects. The process for determining if the genetic basis of a disease may result in part from de novo
mutations and the molecular approach to establish this link will be illustrated, using autism and schizophrenia as examples.
Medicine, Issue 52, de novo mutation, complex diseases, schizophrenia, autism, rare variations, DNA sequencing
Polarized Translocation of Fluorescent Proteins in Xenopus Ectoderm in Response to Wnt Signaling
Institutions: Mount Sinai School of Medicine .
Cell polarity is a fundamental property of eukaryotic cells that is dynamically regulated by both intrinsic and extrinsic factors during embryonic development 1, 2
. One of the signaling pathways involved in this regulation is the Wnt pathway, which is used many times during embryogenesis and critical for human disease3, 4, 5
. Multiple molecular components of this pathway coordinately regulate signaling in a spatially-restricted manner, but the underlying mechanisms are not fully understood. Xenopus
embryonic epithelial cells is an excellent system to study subcellular localization of various signaling proteins. Fluorescent fusion proteins are expressed in Xenopus
embryos by RNA microinjection, ectodermal explants are prepared and protein localization is evaluated by epifluorescence. In this experimental protocol we describe how subcellular localization of Diversin, a cytoplasmic protein that has been implicated in signaling and cell polarity determination6, 7
is visualized in Xenopus
ectodermal cells to study Wnt signal transduction8
. Coexpression of a Wnt ligand or a Frizzled receptor alters the distribution of Diversin fused with red fluorescent protein, RFP, and recruits it to the cell membrane in a polarized fashion 8, 9
. This ex vivo
protocol should be a useful addition to in vitro
studies of cultured mammalian cells, in which spatial control of signaling differs from that of the intact tissue and is much more difficult to analyze.
Developmental Biology, Issue 51, Xenopus embryo, ectoderm, Diversin, Frizzled, membrane recruitment, polarity, Wnt
Using SCOPE to Identify Potential Regulatory Motifs in Coregulated Genes
Institutions: Dartmouth College.
SCOPE is an ensemble motif finder that uses three component algorithms in parallel to identify potential regulatory motifs by over-representation and motif position preference1
. Each component algorithm is optimized to find a different kind of motif. By taking the best of these three approaches, SCOPE performs better than any single algorithm, even in the presence of noisy data1
. In this article, we utilize a web version of SCOPE2
to examine genes that are involved in telomere maintenance. SCOPE has been incorporated into at least two other motif finding programs3,4
and has been used in other studies5-8
The three algorithms that comprise SCOPE are BEAM9
, which finds non-degenerate motifs (ACCGGT), PRISM10
, which finds degenerate motifs (ASCGWT), and SPACER11
, which finds longer bipartite motifs (ACCnnnnnnnnGGT). These three algorithms have been optimized to find their corresponding type of motif. Together, they allow SCOPE to perform extremely well.
Once a gene set has been analyzed and candidate motifs identified, SCOPE can look for other genes that contain the motif which, when added to the original set, will improve the motif score. This can occur through over-representation or motif position preference. Working with partial gene sets that have biologically verified transcription factor binding sites, SCOPE was able to identify most of the rest of the genes also regulated by the given transcription factor.
Output from SCOPE shows candidate motifs, their significance, and other information both as a table and as a graphical motif map. FAQs and video tutorials are available at the SCOPE web site which also includes a "Sample Search" button that allows the user to perform a trial run.
Scope has a very friendly user interface that enables novice users to access the algorithm's full power without having to become an expert in the bioinformatics of motif finding. As input, SCOPE can take a list of genes, or FASTA sequences. These can be entered in browser text fields, or read from a file. The output from SCOPE contains a list of all identified motifs with their scores, number of occurrences, fraction of genes containing the motif, and the algorithm used to identify the motif. For each motif, result details include a consensus representation of the motif, a sequence logo, a position weight matrix, and a list of instances for every motif occurrence (with exact positions and "strand" indicated). Results are returned in a browser window and also optionally by email. Previous papers describe the SCOPE algorithms in detail1,2,9-11
Genetics, Issue 51, gene regulation, computational biology, algorithm, promoter sequence motif
Propagation of Human Embryonic Stem (ES) Cells
Institutions: MGH - Massachusetts General Hospital.
Cellular Biology, Issue 1, ES, embryonic stem cells, tissue culture