Genome sequencing projects have ciphered millions of protein sequence, which require knowledge of their structure and function to improve the understanding of their biological role. Although experimental methods can provide detailed information for a small fraction of these proteins, computational modeling is needed for the majority of protein molecules which are experimentally uncharacterized. The I-TASSER server is an on-line workbench for high-resolution modeling of protein structure and function. Given a protein sequence, a typical output from the I-TASSER server includes secondary structure prediction, predicted solvent accessibility of each residue, homologous template proteins detected by threading and structure alignments, up to five full-length tertiary structural models, and structure-based functional annotations for enzyme classification, Gene Ontology terms and protein-ligand binding sites. All the predictions are tagged with a confidence score which tells how accurate the predictions are without knowing the experimental data. To facilitate the special requests of end users, the server provides channels to accept user-specified inter-residue distance and contact maps to interactively change the I-TASSER modeling; it also allows users to specify any proteins as template, or to exclude any template proteins during the structure assembly simulations. The structural information could be collected by the users based on experimental evidences or biological insights with the purpose of improving the quality of I-TASSER predictions. The server was evaluated as the best programs for protein structure and function predictions in the recent community-wide CASP experiments. There are currently >20,000 registered scientists from over 100 countries who are using the on-line I-TASSER server.
26 Related JoVE Articles!
Genome-wide Gene Deletions in Streptococcus sanguinis by High Throughput PCR
Institutions: Virginia Commonwealth University.
Transposon mutagenesis and single-gene deletion are two methods applied in genome-wide gene knockout in bacteria 1,2
. Although transposon mutagenesis is less time consuming, less costly, and does not require completed genome information, there are two weaknesses in this method: (1) the possibility of a disparate mutants in the mixed mutant library that counter-selects mutants with decreased competition; and (2) the possibility of partial gene inactivation whereby genes do not entirely lose their function following the insertion of a transposon. Single-gene deletion analysis may compensate for the drawbacks associated with transposon mutagenesis. To improve the efficiency of genome-wide single gene deletion, we attempt to establish a high-throughput technique for genome-wide single gene deletion using Streptococcus sanguinis
as a model organism. Each gene deletion construct in S. sanguinis
genome is designed to comprise 1-kb upstream of the targeted gene, the aphA-3
gene, encoding kanamycin resistance protein, and 1-kb downstream of the targeted gene. Three sets of primers F1/R1, F2/R2, and F3/R3, respectively, are designed and synthesized in a 96-well plate format for PCR-amplifications of those three components of each deletion construct. Primers R1 and F3 contain 25-bp sequences that are complementary to regions of the aphA-3
gene at their 5' end. A large scale PCR amplification of the aphA-3
gene is performed once for creating all single-gene deletion constructs. The promoter of aphA-3
gene is initially excluded to minimize the potential polar effect of kanamycin cassette. To create the gene deletion constructs, high-throughput PCR amplification and purification are performed in a 96-well plate format. A linear recombinant PCR amplicon for each gene deletion will be made up through four PCR reactions using high-fidelity DNA polymerase. The initial exponential growth phase of S. sanguinis
cultured in Todd Hewitt broth supplemented with 2.5% inactivated horse serum is used to increase competence for the transformation of PCR-recombinant constructs. Under this condition, up to 20% of S. sanguinis
cells can be transformed using ~50 ng of DNA. Based on this approach, 2,048 mutants with single-gene deletion were ultimately obtained from the 2,270 genes in S. sanguinis
excluding four gene ORFs contained entirely within other ORFs in S. sanguinis
SK36 and 218 potential essential genes. The technique on creating gene deletion constructs is high throughput and could be easy to use in genome-wide single gene deletions for any transformable bacteria.
Genetics, Issue 69, Microbiology, Molecular Biology, Biomedical Engineering, Genomics, Streptococcus sanguinis, Streptococcus, Genome-wide gene deletions, genes, High-throughput, PCR
Preparation of the Mgm101 Recombination Protein by MBP-based Tagging Strategy
Institutions: State University of New York Upstate Medical University.
gene was identified 20 years ago for its role in the maintenance of mitochondrial DNA. Studies from several groups have suggested that the Mgm101 protein is involved in the recombinational repair of mitochondrial DNA. Recent investigations have indicated that Mgm101 is related to the Rad52-type recombination protein family. These proteins form large oligomeric rings and promote the annealing of homologous single stranded DNA molecules. However, the characterization of Mgm101 has been hindered by the difficulty in producing the recombinant protein. Here, a reliable procedure for the preparation of recombinant Mgm101 is described. Maltose Binding Protein (MBP)-tagged Mgm101 is first expressed in Escherichia coli
. The fusion protein is initially purified by amylose affinity chromatography. After being released by proteolytic cleavage, Mgm101 is separated from MBP by cationic exchange chromatography. Monodispersed Mgm101 is then obtained by size exclusion chromatography. A yield of ~0.87 mg of Mgm101 per liter of bacterial culture can be routinely obtained. The recombinant Mgm101 has minimal contamination of DNA. The prepared samples are successfully used for biochemical, structural and single particle image analyses of Mgm101. This protocol may also be used for the preparation of other large oligomeric DNA-binding proteins that may be misfolded and toxic to bacterial cells.
Biochemistry, Issue 76, Genetics, Molecular Biology, Cellular Biology, Microbiology, Bacteria, Proteins, Mgm101, Rad52, mitochondria, recombination, mtDNA, maltose-binding protein, MBP, E. coli., yeast, Saccharomyces cerevisiae, chromatography, electron microscopy, cell culture
Sample Preparation of Mycobacterium tuberculosis Extracts for Nuclear Magnetic Resonance Metabolomic Studies
Institutions: University of Nebraska-Lincoln, University of Nebraska-Lincoln.
is a major cause of mortality in human beings on a global scale. The emergence of both multi- (MDR) and extensively-(XDR) drug-resistant strains threatens to derail current disease control efforts. Thus, there is an urgent need to develop drugs and vaccines that are more effective than those currently available. The genome of M. tuberculosis
has been known for more than 10 years, yet there are important gaps in our knowledge of gene function and essentiality. Many studies have since used gene expression analysis at both the transcriptomic and proteomic levels to determine the effects of drugs, oxidants, and growth conditions on the global patterns of gene expression. Ultimately, the final response of these changes is reflected in the metabolic composition of the bacterium including a few thousand small molecular weight chemicals. Comparing the metabolic profiles of wild type and mutant strains, either untreated or treated with a particular drug, can effectively allow target identification and may lead to the development of novel inhibitors with anti-tubercular activity. Likewise, the effects of two or more conditions on the metabolome can also be assessed. Nuclear magnetic resonance (NMR) is a powerful technology that is used to identify and quantify metabolic intermediates. In this protocol, procedures for the preparation of M. tuberculosis
cell extracts for NMR metabolomic analysis are described. Cell cultures are grown under appropriate conditions and required Biosafety Level 3 containment,1
harvested, and subjected to mechanical lysis while maintaining cold temperatures to maximize preservation of metabolites. Cell lysates are recovered, filtered sterilized, and stored at ultra-low temperatures. Aliquots from these cell extracts are plated on Middlebrook 7H9 agar for colony-forming units to verify absence of viable cells. Upon two months of incubation at 37 °C, if no viable colonies are observed, samples are removed from the containment facility for downstream processing. Extracts are lyophilized, resuspended in deuterated buffer and injected in the NMR instrument, capturing spectroscopic data that is then subjected to statistical analysis. The procedures described can be applied for both one-dimensional (1D) 1
H NMR and two-dimensional (2D) 1
C NMR analyses. This methodology provides more reliable small molecular weight metabolite identification and more reliable and sensitive quantitative analyses of cell extract metabolic compositions than chromatographic methods. Variations of the procedure described following the cell lysis step can also be adapted for parallel proteomic analysis.
Infection, Issue 67, Mycobacterium tuberculosis, NMR, Metabolomics, homogenizer, lysis, cell extracts, sample preparation
Demonstrating a Multi-drug Resistant Mycobacterium tuberculosis Amplification Microarray
Institutions: Akonni Biosystems, Inc..
Simplifying microarray workflow is a necessary first step for creating MDR-TB microarray-based diagnostics that can be routinely used in lower-resource environments. An amplification microarray combines asymmetric PCR amplification, target size selection, target labeling, and microarray hybridization within a single solution and into a single microfluidic chamber. A batch processing method is demonstrated with a 9-plex asymmetric master mix and low-density gel element microarray for genotyping multi-drug resistant Mycobacterium tuberculosis
(MDR-TB). The protocol described here can be completed in 6 hr and provide correct genotyping with at least 1,000 cell equivalents of genomic DNA. Incorporating on-chip wash steps is feasible, which will result in an entirely closed amplicon method and system. The extent of multiplexing with an amplification microarray is ultimately constrained by the number of primer pairs that can be combined into a single master mix and still achieve desired sensitivity and specificity performance metrics, rather than the number of probes that are immobilized on the array. Likewise, the total analysis time can be shortened or lengthened depending on the specific intended use, research question, and desired limits of detection. Nevertheless, the general approach significantly streamlines microarray workflow for the end user by reducing the number of manually intensive and time-consuming processing steps, and provides a simplified biochemical and microfluidic path for translating microarray-based diagnostics into routine clinical practice.
Immunology, Issue 86, MDR-TB, gel element microarray, closed amplicon, drug resistance, rifampin, isoniazid, streptomycin, ethambutol
A Microscopic Phenotypic Assay for the Quantification of Intracellular Mycobacteria Adapted for High-throughput/High-content Screening
Institutions: Université de Lille.
Despite the availability of therapy and vaccine, tuberculosis (TB) remains one of the most deadly and widespread bacterial infections in the world. Since several decades, the sudden burst of multi- and extensively-drug resistant strains is a serious threat for the control of tuberculosis. Therefore, it is essential to identify new targets and pathways critical for the causative agent of the tuberculosis, Mycobacterium tuberculosis
) and to search for novel chemicals that could become TB drugs. One approach is to set up methods suitable for the genetic and chemical screens of large scale libraries enabling the search of a needle in a haystack. To this end, we developed a phenotypic assay relying on the detection of fluorescently labeled Mtb
within fluorescently labeled host cells using automated confocal microscopy. This in vitro
assay allows an image based quantification of the colonization process of Mtb
into the host and was optimized for the 384-well microplate format, which is proper for screens of siRNA-, chemical compound- or Mtb
mutant-libraries. The images are then processed for multiparametric analysis, which provides read out inferring on the pathogenesis of Mtb
within host cells.
Infection, Issue 83, Mycobacterium tuberculosis, High-content/High-throughput screening, chemogenomics, Drug Discovery, siRNA library, automated confocal microscopy, image-based analysis
Single Cell Measurements of Vacuolar Rupture Caused by Intracellular Pathogens
Institutions: Institut Pasteur, Paris, France, Institut Pasteur, Paris, France, Institut Pasteur, Paris, France.
are pathogenic bacteria that invade host cells entering into an endocytic vacuole. Subsequently, the rupture of this membrane-enclosed compartment allows bacteria to move within the cytosol, proliferate and further invade neighboring cells. Mycobacterium tuberculosis
is phagocytosed by immune cells, and has recently been shown to rupture phagosomal membrane in macrophages. We developed a robust assay for tracking phagosomal membrane disruption after host cell entry of Shigella flexneri
or Mycobacterium tuberculosis
. The approach makes use of CCF4, a FRET reporter sensitive to β-lactamase that equilibrates in the cytosol of host cells. Upon invasion of host cells by bacterial pathogens, the probe remains intact as long as the bacteria reside in membrane-enclosed compartments. After disruption of the vacuole, β-lactamase activity on the surface of the intracellular pathogen cleaves CCF4 instantly leading to a loss of FRET signal and switching its emission spectrum. This robust ratiometric assay yields accurate information about the timing of vacuolar rupture induced by the invading bacteria, and it can be coupled to automated microscopy and image processing by specialized algorithms for the detection of the emission signals of the FRET donor and acceptor. Further, it allows investigating the dynamics of vacuolar disruption elicited by intracellular bacteria in real time in single cells. Finally, it is perfectly suited for high-throughput analysis with a spatio-temporal resolution exceeding previous methods. Here, we provide the experimental details of exemplary protocols for the CCF4 vacuolar rupture assay on HeLa cells and THP-1 macrophages for time-lapse experiments or end points experiments using Shigella flexneri
as well as multiple mycobacterial strains such as Mycobacterium marinum
, Mycobacterium bovis,
and Mycobacterium tuberculosis
Infection, Issue 76, Infectious Diseases, Immunology, Medicine, Microbiology, Biochemistry, Cellular Biology, Molecular Biology, Pathology, Bacteria, biology (general), life sciences, CCF4-AM, Shigella flexneri, Mycobacterium tuberculosis, vacuolar rupture, fluorescence microscopy, confocal microscopy, pathogens, cell culture
Hydrogel Nanoparticle Harvesting of Plasma or Urine for Detecting Low Abundance Proteins
Institutions: George Mason University, Ceres Nanosciences.
Novel biomarker discovery plays a crucial role in providing more sensitive and specific disease detection. Unfortunately many low-abundance biomarkers that exist in biological fluids cannot be easily detected with mass spectrometry or immunoassays because they are present in very low concentration, are labile, and are often masked by high-abundance proteins such as albumin or immunoglobulin. Bait containing poly(N-isopropylacrylamide) (NIPAm) based nanoparticles are able to overcome these physiological barriers. In one step they are able to capture, concentrate and preserve biomarkers from body fluids. Low-molecular weight analytes enter the core of the nanoparticle and are captured by different organic chemical dyes, which act as high affinity protein baits. The nanoparticles are able to concentrate the proteins of interest by several orders of magnitude. This concentration factor is sufficient to increase the protein level such that the proteins are within the detection limit of current mass spectrometers, western blotting, and immunoassays. Nanoparticles can be incubated with a plethora of biological fluids and they are able to greatly enrich the concentration of low-molecular weight proteins and peptides while excluding albumin and other high-molecular weight proteins. Our data show that a 10,000 fold amplification in the concentration of a particular analyte can be achieved, enabling mass spectrometry and immunoassays to detect previously undetectable biomarkers.
Bioengineering, Issue 90, biomarker, hydrogel, low abundance, mass spectrometry, nanoparticle, plasma, protein, urine
An Experimental Model to Study Tuberculosis-Malaria Coinfection upon Natural Transmission of Mycobacterium tuberculosis and Plasmodium berghei
Institutions: University Hospital Heidelberg, Research Center Borstel.
Coinfections naturally occur due to the geographic overlap of distinct types of pathogenic organisms. Concurrent infections most likely modulate the respective immune response to each single pathogen and may thereby affect pathogenesis and disease outcome. Coinfected patients may also respond differentially to anti-infective interventions. Coinfection between tuberculosis as caused by mycobacteria and the malaria parasite Plasmodium
, both of which are coendemic in many parts of sub-Saharan Africa, has not been studied in detail. In order to approach the challenging but scientifically and clinically highly relevant question how malaria-tuberculosis coinfection modulate host immunity and the course of each disease, we established an experimental mouse model that allows us to dissect the elicited immune responses to both pathogens in the coinfected host. Of note, in order to most precisely mimic naturally acquired human infections, we perform experimental infections of mice with both pathogens by their natural routes of infection, i.e.
aerosol and mosquito bite, respectively.
Infectious Diseases, Issue 84, coinfection, mouse, Tuberculosis, Malaria, Plasmodium berghei, Mycobacterium tuberculosis, natural transmission
A Novel Microdissection Approach to Recovering Mycobacterium tuberculosis Specific Transcripts from Formalin Fixed Paraffin Embedded Lung Granulomas
Institutions: Tulane National Primate Research Center, Tulane National Primate Research Center.
Microdissection has been used for the examination of tissues at DNA, RNA, and protein levels for over a decade. Laser capture microscopy (LCM) is the most common microdissection technique used today. In this technique, a laser is used to focally melt a thermoplastic membrane that overlies a dehydrated tissue section1
. The tissue section composite is then lifted and separated from the membrane. Although this technique can be used successfully for tissue examination, it is time consuming and expensive. Furthermore, the successful completion of procedures using this technique requires the use of a laser, thus limiting its use. A new more affordable and practical microdissection approach called mesodissection is a possible solution to the pitfalls of LCM. This technique employs the MESO-1/MeSectr system to mill the desired tissue from a slide mounted tissue sample while concurrently dispensing and aspirating fluid to recover the desired tissue sample into a consumable mill bit. Before the dissection process begins, the user aligns the formalin fixed paraffin embedded (FFPE) slide with a hematoxylin and eosin stained (H&E) reference slide. Thereafter, the operator annotates the desired dissection area and proceeds to dissect the appropriate segment. The program generates an archived image of the dissection. The main advantage of mesodissection is the short duration needed to dissect a slide, taking an average of ten minutes from set up to sample generation in this experiment. Additionally, the system is significantly more cost effective and user friendly. A slight disadvantage is that it is not as precise as laser capture microscopy. In this article we demonstrate how mesodissection can be used to extract RNA from slides from FFPE granulomas caused by Mycobacterium tuberculosis (Mtb)
Immunology, Issue 88, Microdissection, mesodissection, formalin fixed paraffin embedded, Mtb, LCM, TB, Mycobacterium tuberculosis
FtsZ Polymerization Assays: Simple Protocols and Considerations
Institutions: University of Groningen.
During bacterial cell division, the essential protein FtsZ assembles in the middle of the cell to form the so-called Z-ring. FtsZ polymerizes into long filaments in the presence of GTP in vitro
, and polymerization is regulated by several accessory proteins. FtsZ polymerization has been extensively studied in vitro
using basic methods including light scattering, sedimentation, GTP hydrolysis assays and electron microscopy. Buffer conditions influence both the polymerization properties of FtsZ, and the ability of FtsZ to interact with regulatory proteins. Here, we describe protocols for FtsZ polymerization studies and validate conditions and controls using Escherichia coli
and Bacillus subtilis
FtsZ as model proteins. A low speed sedimentation assay is introduced that allows the study of the interaction of FtsZ with proteins that bundle or tubulate FtsZ polymers. An improved GTPase assay protocol is described that allows testing of GTP hydrolysis over time using various conditions in a 96-well plate setup, with standardized incubation times that abolish variation in color development in the phosphate detection reaction. The preparation of samples for light scattering studies and electron microscopy is described. Several buffers are used to establish suitable buffer pH and salt concentration for FtsZ polymerization studies. A high concentration of KCl is the best for most of the experiments. Our methods provide a starting point for the in vitro
characterization of FtsZ, not only from E. coli
and B. subtilis
but from any other bacterium. As such, the methods can be used for studies of the interaction of FtsZ with regulatory proteins or the testing of antibacterial drugs which may affect FtsZ polymerization.
Basic Protocols, Issue 81, FtsZ, protein polymerization, cell division, GTPase, sedimentation assay, light scattering
Growth of Mycobacterium tuberculosis Biofilms
Institutions: University of Pittsburgh, University of Pittsburgh.
, the etiologic agent of human tuberculosis, has an extraordinary ability to survive against environmental stresses including antibiotics. Although stress tolerance of M. tuberculosis
is one of the likely contributors to the 6-month long chemotherapy of tuberculosis 1
, the molecular mechanisms underlying this characteristic phenotype of the pathogen remain unclear. Many microbial species have evolved to survive in stressful environments by self-assembling in highly organized, surface attached, and matrix encapsulated structures called biofilms 2-4
. Growth in communities appears to be a preferred survival strategy of microbes, and is achieved through genetic components that regulate surface attachment, intercellular communications, and synthesis of extracellular polymeric substances (EPS) 5,6
. The tolerance to environmental stress is likely facilitated by EPS, and perhaps by the physiological adaptation of individual bacilli to heterogeneous microenvironments within the complex architecture of biofilms 7
In a series of recent papers we established that M. tuberculosis
and Mycobacterium smegmatis
have a strong propensity to grow in organized multicellular structures, called biofilms, which can tolerate more than 50 times the minimal inhibitory concentrations of the anti-tuberculosis drugs isoniazid and rifampicin 8-10
. M. tuberculosis,
however, intriguingly requires specific conditions to form mature biofilms, in particular 9:1 ratio of headspace: media as well as limited exchange of air with the atmosphere 9
. Requirements of specialized environmental conditions could possibly be linked to the fact that M. tuberculosis
is an obligate human pathogen and thus has adapted to tissue environments. In this publication we demonstrate methods for culturing M. tuberculosis
biofilms in a bottle and a 12-well plate format, which is convenient for bacteriological as well as genetic studies. We have described the protocol for an attenuated strain of M. tuberculosis
, with deletion in the two loci, panCD
that are critical for in vivo
growth of the pathogen 9
. This strain can be safely used in a BSL-2 containment for understanding the basic biology of the tuberculosis pathogen thus avoiding the requirement of an expensive BSL-3 facility. The method can be extended, with appropriate modification in media, to grow biofilm of other culturable mycobacterial species.
Overall, a uniform protocol of culturing mycobacterial biofilms will help the investigators interested in studying the basic resilient characteristics of mycobacteria. In addition, a clear and concise method of growing mycobacterial biofilms will also help the clinical and pharmaceutical investigators to test the efficacy of a potential drug.
Immunology, Issue 60, Mycobacterium tuberculosis, tuberculosis, drug tolerance, biofilms
The MODS method for diagnosis of tuberculosis and multidrug resistant tuberculosis
Institutions: The Warren Alpert Medical School of Brown University, Universidad Peruana Cayetano Heredia, Johns Hopkins Bloomberg School of Public Health, Imperial College London .
Patients with active pulmonary tuberculosis (TB) infect 10-15 other persons per year, making diagnosing active TB essential to both curing the patient and preventing new infections. Furthermore, the emergence of multidrug resistant tuberculosis (MDRTB) means that detection of drug resistance is necessary for stopping the spread of drug-resistant strains. The microscopic-observation drug-susceptibility (MODS) assay is a low-cost, low-tech tool for high-performance detection of TB and MDRTB. The MODS assay is based on three principles: 1) mycobacterium tuberculosis (MTB) grows faster in liquid media than on solid media 2) microscopic MTB growth can be detected earlier in liquid media than waiting for the macroscopic appearance of colonies on solid media, and that growth is characteristic of MTB, allowing it to be distinguished from atypical mycobacteria or fungal or bacterial contamination 3) the drugs isoniazid and rifampicin can be incorporated into the MODS assay to allow for simultaneous direct detection of MDRTB, obviating the need for subculture to perform an indirect drug susceptibility test. Competing current diagnostics are hampered by low sensitivity with sputum smear, long delays until diagnosis with solid media culture, prohibitively high cost with existing liquid media culture methods, and the need to do subculture for indirect drug susceptibility testing to detect MDRTB. In contrast, the non-proprietary MODS method has a high sensitivity for TB and MDRTB, is a relatively rapid culture method, provides simultaneous drug susceptibility testing for MDRTB, and is accessible to resource-limited settings at just under $3 for testing for TB and MDRTB.
Microbiology, Issue 18, tuberculosis, TB, multidrug resistant tuberculosis, MDRTB, culture, diagnostic
Annotation of Plant Gene Function via Combined Genomics, Metabolomics and Informatics
Given the ever expanding number of model plant species for which complete genome sequences are available and the abundance of bio-resources such as knockout mutants, wild accessions and advanced breeding populations, there is a rising burden for gene functional annotation. In this protocol, annotation of plant gene function using combined co-expression gene analysis, metabolomics and informatics is provided (Figure 1
). This approach is based on the theory of using target genes of known function to allow the identification of non-annotated genes likely to be involved in a certain metabolic process, with the identification of target compounds via metabolomics. Strategies are put forward for applying this information on populations generated by both forward and reverse genetics approaches in spite of none of these are effortless. By corollary this approach can also be used as an approach to characterise unknown peaks representing new or specific secondary metabolites in the limited tissues, plant species or stress treatment, which is currently the important trial to understanding plant metabolism.
Plant Biology, Issue 64, Genetics, Bioinformatics, Metabolomics, Plant metabolism, Transcriptome analysis, Functional annotation, Computational biology, Plant biology, Theoretical biology, Spectroscopy and structural analysis
DNA-affinity-purified Chip (DAP-chip) Method to Determine Gene Targets for Bacterial Two component Regulatory Systems
Institutions: Lawrence Berkeley National Laboratory.
methods such as ChIP-chip are well-established techniques used to determine global gene targets for transcription factors. However, they are of limited use in exploring bacterial two component regulatory systems with uncharacterized activation conditions. Such systems regulate transcription only when activated in the presence of unique signals. Since these signals are often unknown, the in vitro
microarray based method described in this video article can be used to determine gene targets and binding sites for response regulators. This DNA-affinity-purified-chip method may be used for any purified regulator in any organism with a sequenced genome. The protocol involves allowing the purified tagged protein to bind to sheared genomic DNA and then affinity purifying the protein-bound DNA, followed by fluorescent labeling of the DNA and hybridization to a custom tiling array. Preceding steps that may be used to optimize the assay for specific regulators are also described. The peaks generated by the array data analysis are used to predict binding site motifs, which are then experimentally validated. The motif predictions can be further used to determine gene targets of orthologous response regulators in closely related species. We demonstrate the applicability of this method by determining the gene targets and binding site motifs and thus predicting the function for a sigma54-dependent response regulator DVU3023 in the environmental bacterium Desulfovibrio vulgaris
Genetics, Issue 89, DNA-Affinity-Purified-chip, response regulator, transcription factor binding site, two component system, signal transduction, Desulfovibrio, lactate utilization regulator, ChIP-chip
Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules
Institutions: Princeton University.
The aim of de novo
protein design is to find the amino acid sequences that will fold into a desired 3-dimensional structure with improvements in specific properties, such as binding affinity, agonist or antagonist behavior, or stability, relative to the native sequence. Protein design lies at the center of current advances drug design and discovery. Not only does protein design provide predictions for potentially useful drug targets, but it also enhances our understanding of the protein folding process and protein-protein interactions. Experimental methods such as directed evolution have shown success in protein design. However, such methods are restricted by the limited sequence space that can be searched tractably. In contrast, computational design strategies allow for the screening of a much larger set of sequences covering a wide variety of properties and functionality. We have developed a range of computational de novo
protein design methods capable of tackling several important areas of protein design. These include the design of monomeric proteins for increased stability and complexes for increased binding affinity.
To disseminate these methods for broader use we present Protein WISDOM (https://www.proteinwisdom.org), a tool that provides automated methods for a variety of protein design problems. Structural templates are submitted to initialize the design process. The first stage of design is an optimization sequence selection stage that aims at improving stability through minimization of potential energy in the sequence space. Selected sequences are then run through a fold specificity stage and a binding affinity stage. A rank-ordered list of the sequences for each step of the process, along with relevant designed structures, provides the user with a comprehensive quantitative assessment of the design. Here we provide the details of each design method, as well as several notable experimental successes attained through the use of the methods.
Genetics, Issue 77, Molecular Biology, Bioengineering, Biochemistry, Biomedical Engineering, Chemical Engineering, Computational Biology, Genomics, Proteomics, Protein, Protein Binding, Computational Biology, Drug Design, optimization (mathematics), Amino Acids, Peptides, and Proteins, De novo protein and peptide design, Drug design, In silico sequence selection, Optimization, Fold specificity, Binding affinity, sequencing
Determination of Protein-ligand Interactions Using Differential Scanning Fluorimetry
Institutions: University of Exeter.
A wide range of methods are currently available for determining the dissociation constant between a protein and interacting small molecules. However, most of these require access to specialist equipment, and often require a degree of expertise to effectively establish reliable experiments and analyze data. Differential scanning fluorimetry (DSF) is being increasingly used as a robust method for initial screening of proteins for interacting small molecules, either for identifying physiological partners or for hit discovery. This technique has the advantage that it requires only a PCR machine suitable for quantitative PCR, and so suitable instrumentation is available in most institutions; an excellent range of protocols are already available; and there are strong precedents in the literature for multiple uses of the method. Past work has proposed several means of calculating dissociation constants from DSF data, but these are mathematically demanding. Here, we demonstrate a method for estimating dissociation constants from a moderate amount of DSF experimental data. These data can typically be collected and analyzed within a single day. We demonstrate how different models can be used to fit data collected from simple binding events, and where cooperative binding or independent binding sites are present. Finally, we present an example of data analysis in a case where standard models do not apply. These methods are illustrated with data collected on commercially available control proteins, and two proteins from our research program. Overall, our method provides a straightforward way for researchers to rapidly gain further insight into protein-ligand interactions using DSF.
Biophysics, Issue 91, differential scanning fluorimetry, dissociation constant, protein-ligand interactions, StepOne, cooperativity, WcbI.
The ChroP Approach Combines ChIP and Mass Spectrometry to Dissect Locus-specific Proteomic Landscapes of Chromatin
Institutions: European Institute of Oncology.
Chromatin is a highly dynamic nucleoprotein complex made of DNA and proteins that controls various DNA-dependent processes. Chromatin structure and function at specific regions is regulated by the local enrichment of histone post-translational modifications (hPTMs) and variants, chromatin-binding proteins, including transcription factors, and DNA methylation. The proteomic characterization of chromatin composition at distinct functional regions has been so far hampered by the lack of efficient protocols to enrich such domains at the appropriate purity and amount for the subsequent in-depth analysis by Mass Spectrometry (MS). We describe here a newly designed chromatin proteomics strategy, named ChroP (Chromatin Proteomics
), whereby a preparative chromatin immunoprecipitation is used to isolate distinct chromatin regions whose features, in terms of hPTMs, variants and co-associated non-histonic proteins, are analyzed by MS. We illustrate here the setting up of ChroP for the enrichment and analysis of transcriptionally silent heterochromatic regions, marked by the presence of tri-methylation of lysine 9 on histone H3. The results achieved demonstrate the potential of ChroP
in thoroughly characterizing the heterochromatin proteome and prove it as a powerful analytical strategy for understanding how the distinct protein determinants of chromatin interact and synergize to establish locus-specific structural and functional configurations.
Biochemistry, Issue 86, chromatin, histone post-translational modifications (hPTMs), epigenetics, mass spectrometry, proteomics, SILAC, chromatin immunoprecipitation , histone variants, chromatome, hPTMs cross-talks
High Throughput Quantitative Expression Screening and Purification Applied to Recombinant Disulfide-rich Venom Proteins Produced in E. coli
Institutions: Aix-Marseille Université, Commissariat à l'énergie atomique et aux énergies alternatives (CEA) Saclay, France.
Escherichia coli (E. coli)
is the most widely used expression system for the production of recombinant proteins for structural and functional studies. However, purifying proteins is sometimes challenging since many proteins are expressed in an insoluble form. When working with difficult or multiple targets it is therefore recommended to use high throughput (HTP) protein expression screening on a small scale (1-4 ml cultures) to quickly identify conditions for soluble expression. To cope with the various structural genomics programs of the lab, a quantitative (within a range of 0.1-100 mg/L culture of recombinant protein) and HTP protein expression screening protocol was implemented and validated on thousands of proteins. The protocols were automated with the use of a liquid handling robot but can also be performed manually without specialized equipment.
Disulfide-rich venom proteins are gaining increasing recognition for their potential as therapeutic drug leads. They can be highly potent and selective, but their complex disulfide bond networks make them challenging to produce. As a member of the FP7 European Venomics project (www.venomics.eu), our challenge is to develop successful production strategies with the aim of producing thousands of novel venom proteins for functional characterization. Aided by the redox properties of disulfide bond isomerase DsbC, we adapted our HTP production pipeline for the expression of oxidized, functional venom peptides in the E. coli
cytoplasm. The protocols are also applicable to the production of diverse disulfide-rich proteins. Here we demonstrate our pipeline applied to the production of animal venom proteins. With the protocols described herein it is likely that soluble disulfide-rich proteins will be obtained in as little as a week. Even from a small scale, there is the potential to use the purified proteins for validating the oxidation state by mass spectrometry, for characterization in pilot studies, or for sensitive micro-assays.
Bioengineering, Issue 89, E. coli, expression, recombinant, high throughput (HTP), purification, auto-induction, immobilized metal affinity chromatography (IMAC), tobacco etch virus protease (TEV) cleavage, disulfide bond isomerase C (DsbC) fusion, disulfide bonds, animal venom proteins/peptides
Mapping Bacterial Functional Networks and Pathways in Escherichia Coli using Synthetic Genetic Arrays
Institutions: University of Toronto, University of Toronto, University of Regina.
Phenotypes are determined by a complex series of physical (e.g.
protein-protein) and functional (e.g.
gene-gene or genetic) interactions (GI)1
. While physical interactions can indicate which bacterial proteins are associated as complexes, they do not necessarily reveal pathway-level functional relationships1. GI screens, in which the growth of double mutants bearing two deleted or inactivated genes is measured and compared to the corresponding single mutants, can illuminate epistatic dependencies between loci and hence provide a means to query and discover novel functional relationships2
. Large-scale GI maps have been reported for eukaryotic organisms like yeast3-7
, but GI information remains sparse for prokaryotes8
, which hinders the functional annotation of bacterial genomes. To this end, we and others have developed high-throughput quantitative bacterial GI screening methods9, 10
Here, we present the key steps required to perform quantitative E. coli
Synthetic Genetic Array (eSGA) screening procedure on a genome-scale9
, using natural bacterial conjugation and homologous recombination to systemically generate and measure the fitness of large numbers of double mutants in a colony array format.
Briefly, a robot is used to transfer, through conjugation, chloramphenicol (Cm) - marked mutant alleles from engineered Hfr (High frequency of recombination) 'donor strains' into an ordered array of kanamycin (Kan) - marked F- recipient strains. Typically, we use loss-of-function single mutants bearing non-essential gene deletions (e.g.
the 'Keio' collection11
) and essential gene hypomorphic mutations (i.e.
alleles conferring reduced protein expression, stability, or activity9, 12, 13
) to query the functional associations of non-essential and essential genes, respectively. After conjugation and ensuing genetic exchange mediated by homologous recombination, the resulting double mutants are selected on solid medium containing both antibiotics. After outgrowth, the plates are digitally imaged and colony sizes are quantitatively scored using an in-house automated image processing system14
. GIs are revealed when the growth rate of a double mutant is either significantly better or worse than expected9
. Aggravating (or negative) GIs often result between loss-of-function mutations in pairs of genes from compensatory pathways that impinge on the same essential process2
. Here, the loss of a single gene is buffered, such that either single mutant is viable. However, the loss of both pathways is deleterious and results in synthetic lethality or sickness (i.e.
slow growth). Conversely, alleviating (or positive) interactions can occur between genes in the same pathway or protein complex2
as the deletion of either gene alone is often sufficient to perturb the normal function of the pathway or complex such that additional perturbations do not reduce activity, and hence growth, further. Overall, systematically identifying and analyzing GI networks can provide unbiased, global maps of the functional relationships between large numbers of genes, from which pathway-level information missed by other approaches can be inferred9
Genetics, Issue 69, Molecular Biology, Medicine, Biochemistry, Microbiology, Aggravating, alleviating, conjugation, double mutant, Escherichia coli, genetic interaction, Gram-negative bacteria, homologous recombination, network, synthetic lethality or sickness, suppression
Identification of Key Factors Regulating Self-renewal and Differentiation in EML Hematopoietic Precursor Cells by RNA-sequencing Analysis
Institutions: The University of Texas Graduate School of Biomedical Sciences at Houston.
Hematopoietic stem cells (HSCs) are used clinically for transplantation treatment to rebuild a patient's hematopoietic system in many diseases such as leukemia and lymphoma. Elucidating the mechanisms controlling HSCs self-renewal and differentiation is important for application of HSCs for research and clinical uses. However, it is not possible to obtain large quantity of HSCs due to their inability to proliferate in vitro
. To overcome this hurdle, we used a mouse bone marrow derived cell line, the EML (Erythroid, Myeloid, and Lymphocytic) cell line, as a model system for this study.
RNA-sequencing (RNA-Seq) has been increasingly used to replace microarray for gene expression studies. We report here a detailed method of using RNA-Seq technology to investigate the potential key factors in regulation of EML cell self-renewal and differentiation. The protocol provided in this paper is divided into three parts. The first part explains how to culture EML cells and separate Lin-CD34+ and Lin-CD34- cells. The second part of the protocol offers detailed procedures for total RNA preparation and the subsequent library construction for high-throughput sequencing. The last part describes the method for RNA-Seq data analysis and explains how to use the data to identify differentially expressed transcription factors between Lin-CD34+ and Lin-CD34- cells. The most significantly differentially expressed transcription factors were identified to be the potential key regulators controlling EML cell self-renewal and differentiation. In the discussion section of this paper, we highlight the key steps for successful performance of this experiment.
In summary, this paper offers a method of using RNA-Seq technology to identify potential regulators of self-renewal and differentiation in EML cells. The key factors identified are subjected to downstream functional analysis in vitro
and in vivo
Genetics, Issue 93, EML Cells, Self-renewal, Differentiation, Hematopoietic precursor cell, RNA-Sequencing, Data analysis
Analysis of Nephron Composition and Function in the Adult Zebrafish Kidney
Institutions: University of Notre Dame.
The zebrafish model has emerged as a relevant system to study kidney development, regeneration and disease. Both the embryonic and adult zebrafish kidneys are composed of functional units known as nephrons, which are highly conserved with other vertebrates, including mammals. Research in zebrafish has recently demonstrated that two distinctive phenomena transpire after adult nephrons incur damage: first, there is robust regeneration within existing nephrons that replaces the destroyed tubule epithelial cells; second, entirely new nephrons are produced from renal progenitors in a process known as neonephrogenesis. In contrast, humans and other mammals seem to have only a limited ability for nephron epithelial regeneration. To date, the mechanisms responsible for these kidney regeneration phenomena remain poorly understood. Since adult zebrafish kidneys undergo both nephron epithelial regeneration and neonephrogenesis, they provide an outstanding experimental paradigm to study these events. Further, there is a wide range of genetic and pharmacological tools available in the zebrafish model that can be used to delineate the cellular and molecular mechanisms that regulate renal regeneration. One essential aspect of such research is the evaluation of nephron structure and function. This protocol describes a set of labeling techniques that can be used to gauge renal composition and test nephron functionality in the adult zebrafish kidney. Thus, these methods are widely applicable to the future phenotypic characterization of adult zebrafish kidney injury paradigms, which include but are not limited to, nephrotoxicant exposure regimes or genetic methods of targeted cell death such as the nitroreductase mediated cell ablation technique. Further, these methods could be used to study genetic perturbations in adult kidney formation and could also be applied to assess renal status during chronic disease modeling.
Cellular Biology, Issue 90,
zebrafish; kidney; nephron; nephrology; renal; regeneration; proximal tubule; distal tubule; segment; mesonephros; physiology; acute kidney injury (AKI)
Isolation of Fidelity Variants of RNA Viruses and Characterization of Virus Mutation Frequency
Institutions: Institut Pasteur .
RNA viruses use RNA dependent RNA polymerases to replicate their genomes. The intrinsically high error rate of these enzymes is a large contributor to the generation of extreme population diversity that facilitates virus adaptation and evolution. Increasing evidence shows that the intrinsic error rates, and the resulting mutation frequencies, of RNA viruses can be modulated by subtle amino acid changes to the viral polymerase. Although biochemical assays exist for some viral RNA polymerases that permit quantitative measure of incorporation fidelity, here we describe a simple method of measuring mutation frequencies of RNA viruses that has proven to be as accurate as biochemical approaches in identifying fidelity altering mutations. The approach uses conventional virological and sequencing techniques that can be performed in most biology laboratories. Based on our experience with a number of different viruses, we have identified the key steps that must be optimized to increase the likelihood of isolating fidelity variants and generating data of statistical significance. The isolation and characterization of fidelity altering mutations can provide new insights into polymerase structure and function1-3
. Furthermore, these fidelity variants can be useful tools in characterizing mechanisms of virus adaptation and evolution4-7
Immunology, Issue 52, Polymerase fidelity, RNA virus, mutation frequency, mutagen, RNA polymerase, viral evolution
The ITS2 Database
Institutions: University of Würzburg, University of Würzburg.
The internal transcribed spacer 2 (ITS2) has been used as a phylogenetic marker for more than two decades. As ITS2 research mainly focused on the very variable ITS2 sequence, it confined this marker to low-level phylogenetics only. However, the combination of the ITS2 sequence and its highly conserved secondary structure improves the phylogenetic resolution1
and allows phylogenetic inference at multiple taxonomic ranks, including species delimitation2-8
The ITS2 Database9
presents an exhaustive dataset of internal transcribed spacer 2 sequences from NCBI GenBank11
. Following an annotation by profile Hidden Markov Models (HMMs), the secondary structure of each sequence is predicted. First, it is tested whether a minimum energy based fold12
(direct fold) results in a correct, four helix conformation. If this is not the case, the structure is predicted by homology modeling13
. In homology modeling, an already known secondary structure is transferred to another ITS2 sequence, whose secondary structure was not able to fold correctly in a direct fold.
The ITS2 Database is not only a database for storage and retrieval of ITS2 sequence-structures. It also provides several tools to process your own ITS2 sequences, including annotation, structural prediction, motif detection and BLAST14
search on the combined sequence-structure information. Moreover, it integrates trimmed versions of 4SALE15,16
for multiple sequence-structure alignment calculation and Neighbor Joining18
tree reconstruction. Together they form a coherent analysis pipeline from an initial set of sequences to a phylogeny based on sequence and secondary structure.
In a nutshell, this workbench simplifies first phylogenetic analyses to only a few mouse-clicks, while additionally providing tools and data for comprehensive large-scale analyses.
Genetics, Issue 61, alignment, internal transcribed spacer 2, molecular systematics, secondary structure, ribosomal RNA, phylogenetic tree, homology modeling, phylogeny
Using SCOPE to Identify Potential Regulatory Motifs in Coregulated Genes
Institutions: Dartmouth College.
SCOPE is an ensemble motif finder that uses three component algorithms in parallel to identify potential regulatory motifs by over-representation and motif position preference1
. Each component algorithm is optimized to find a different kind of motif. By taking the best of these three approaches, SCOPE performs better than any single algorithm, even in the presence of noisy data1
. In this article, we utilize a web version of SCOPE2
to examine genes that are involved in telomere maintenance. SCOPE has been incorporated into at least two other motif finding programs3,4
and has been used in other studies5-8
The three algorithms that comprise SCOPE are BEAM9
, which finds non-degenerate motifs (ACCGGT), PRISM10
, which finds degenerate motifs (ASCGWT), and SPACER11
, which finds longer bipartite motifs (ACCnnnnnnnnGGT). These three algorithms have been optimized to find their corresponding type of motif. Together, they allow SCOPE to perform extremely well.
Once a gene set has been analyzed and candidate motifs identified, SCOPE can look for other genes that contain the motif which, when added to the original set, will improve the motif score. This can occur through over-representation or motif position preference. Working with partial gene sets that have biologically verified transcription factor binding sites, SCOPE was able to identify most of the rest of the genes also regulated by the given transcription factor.
Output from SCOPE shows candidate motifs, their significance, and other information both as a table and as a graphical motif map. FAQs and video tutorials are available at the SCOPE web site which also includes a "Sample Search" button that allows the user to perform a trial run.
Scope has a very friendly user interface that enables novice users to access the algorithm's full power without having to become an expert in the bioinformatics of motif finding. As input, SCOPE can take a list of genes, or FASTA sequences. These can be entered in browser text fields, or read from a file. The output from SCOPE contains a list of all identified motifs with their scores, number of occurrences, fraction of genes containing the motif, and the algorithm used to identify the motif. For each motif, result details include a consensus representation of the motif, a sequence logo, a position weight matrix, and a list of instances for every motif occurrence (with exact positions and "strand" indicated). Results are returned in a browser window and also optionally by email. Previous papers describe the SCOPE algorithms in detail1,2,9-11
Genetics, Issue 51, gene regulation, computational biology, algorithm, promoter sequence motif
A Strategy to Identify de Novo Mutations in Common Disorders such as Autism and Schizophrenia
Institutions: Universite de Montreal, Universite de Montreal, Universite de Montreal.
There are several lines of evidence supporting the role of de novo
mutations as a mechanism for common disorders, such as autism and schizophrenia. First, the de novo
mutation rate in humans is relatively high, so new mutations are generated at a high frequency in the population. However, de novo
mutations have not been reported in most common diseases. Mutations in genes leading to severe diseases where there is a strong negative selection against the phenotype, such as lethality in embryonic stages or reduced reproductive fitness, will not be transmitted to multiple family members, and therefore will not be detected by linkage gene mapping or association studies. The observation of very high concordance in monozygotic twins and very low concordance in dizygotic twins also strongly supports the hypothesis that a significant fraction of cases may result from new mutations. Such is the case for diseases such as autism and schizophrenia. Second, despite reduced reproductive fitness1
and extremely variable environmental factors, the incidence of some diseases is maintained worldwide at a relatively high and constant rate. This is the case for autism and schizophrenia, with an incidence of approximately 1% worldwide. Mutational load can be thought of as a balance between selection for or against a deleterious mutation and its production by de novo
mutation. Lower rates of reproduction constitute a negative selection factor that should reduce the number of mutant alleles in the population, ultimately leading to decreased disease prevalence. These selective pressures tend to be of different intensity in different environments. Nonetheless, these severe mental disorders have been maintained at a constant relatively high prevalence in the worldwide population across a wide range of cultures and countries despite a strong negative selection against them2
. This is not what one would predict in diseases with reduced reproductive fitness, unless there was a high new mutation rate. Finally, the effects of paternal age: there is a significantly increased risk of the disease with increasing paternal age, which could result from the age related increase in paternal de novo
mutations. This is the case for autism and schizophrenia3
. The male-to-female ratio of mutation rate is estimated at about 4–6:1, presumably due to a higher number of germ-cell divisions with age in males. Therefore, one would predict that de novo
mutations would more frequently come from males, particularly older males4
. A high rate of new mutations may in part explain why genetic studies have so far failed to identify many genes predisposing to complexes diseases genes, such as autism and schizophrenia, and why diseases have been identified for a mere 3% of genes in the human genome. Identification for de novo
mutations as a cause of a disease requires a targeted molecular approach, which includes studying parents and affected subjects. The process for determining if the genetic basis of a disease may result in part from de novo
mutations and the molecular approach to establish this link will be illustrated, using autism and schizophrenia as examples.
Medicine, Issue 52, de novo mutation, complex diseases, schizophrenia, autism, rare variations, DNA sequencing
Interview: HIV-1 Proviral DNA Excision Using an Evolved Recombinase
Institutions: Heinrich-Pette-Institute for Experimental Virology and Immunology, University of Hamburg.
HIV-1 integrates into the host chromosome of infected cells and persists as a provirus flanked by long terminal repeats. Current treatment strategies primarily target virus enzymes or virus-cell fusion, suppressing the viral life cycle without eradicating the infection. Since the integrated provirus is not targeted by these approaches, new resistant strains of HIV-1 may emerge. Here, we report that the engineered recombinase Tre (see Molecular evolution of the Tre recombinase , Buchholz, F., Max Planck Institute for Cell Biology and Genetics, Dresden) efficiently excises integrated HIV-1 proviral DNA from the genome of infected cells. We produced loxLTR containing viral pseudotypes and infected HeLa cells to examine whether Tre recombinase can excise the provirus from the genome of HIV-1 infected human cells. A virus particle-releasing cell line was cloned and transfected with a plasmid expressing Tre or with a parental control vector. Recombinase activity and virus production were monitored. All assays demonstrated the efficient deletion of the provirus from infected cells without visible cytotoxic effects. These results serve as proof of principle that it is possible to evolve a recombinase to specifically target an HIV-1 LTR and that this recombinase is capable of excising the HIV-1 provirus from the genome of HIV-1-infected human cells.
Before an engineered recombinase could enter the therapeutic arena, however, significant obstacles need to be overcome. Among the most critical issues, that we face, are an efficient and safe delivery to targeted cells and the absence of side effects.
Medicine, Issue 16, HIV, Cell Biology, Recombinase, provirus, HeLa Cells