Many researchers, across incredibly diverse foci, are applying phylogenetics to their research question(s). However, many researchers are new to this topic and so it presents inherent problems. Here we compile a practical introduction to phylogenetics for nonexperts. We outline in a step-by-step manner, a pipeline for generating reliable phylogenies from gene sequence datasets. We begin with a user-guide for similarity search tools via online interfaces as well as local executables. Next, we explore programs for generating multiple sequence alignments followed by protocols for using software to determine best-fit models of evolution. We then outline protocols for reconstructing phylogenetic relationships via maximum likelihood and Bayesian criteria and finally describe tools for visualizing phylogenetic trees. While this is not by any means an exhaustive description of phylogenetic approaches, it does provide the reader with practical starting information on key software applications commonly utilized by phylogeneticists. The vision for this article would be that it could serve as a practical training tool for researchers embarking on phylogenetic studies and also serve as an educational resource that could be incorporated into a classroom or teaching-lab.
21 Related JoVE Articles!
A PCR-based Genotyping Method to Distinguish Between Wild-type and Ornamental Varieties of Imperata cylindrica
Institutions: The University of Alabama, Huntsville, Center for Plant Health Science and Technology.
Wild-type I. cylindrica
(cogongrass) is one of the top ten worst invasive plants in the world, negatively impacting agricultural and natural resources in 73 different countries throughout Africa, Asia, Europe, New Zealand, Oceania and the Americas1-2
. Cogongrass forms rapidly-spreading, monodominant stands that displace a large variety of native plant species and in turn threaten the native animals that depend on the displaced native plant species for forage and shelter. To add to the problem, an ornamental variety [I. cylindrica
(Retzius)] is widely marketed under the names of Imperata cylindrica
'Rubra', Red Baron, and Japanese blood grass (JBG). This variety is putatively sterile and noninvasive and is considered a desirable ornamental for its red-colored leaves. However, under the correct conditions, JBG can produce viable seed (Carol Holko, 2009 personal communication) and can revert to a green invasive form that is often indistinguishable from cogongrass as it takes on the distinguishing characteristics of the wild-type invasive variety4
). This makes identification using morphology a difficult task even for well-trained plant taxonomists. Reversion of JBG to an aggressive green phenotype is also not a rare occurrence. Using sequence comparisons of coding and variable regions in both nuclear and chloroplast DNA, we have confirmed that JBG has reverted to the green invasive within the states of Maryland, South Carolina, and Missouri. JBG has been sold and planted in just about every state in the continental U.S. where there is not an active cogongrass infestation. The extent of the revert problem in not well understood because reverted plants are undocumented and often destroyed.
Application of this molecular protocol provides a method to identify JBG reverts and can help keep these varieties from co-occurring and possibly hybridizing. Cogongrass is an obligate outcrosser and, when crossed with a different genotype, can produce viable wind-dispersed seeds that spread cogongrass over wide distances5-7
. JBG has a slightly different genotype than cogongrass and may be able to form viable hybrids with cogongrass. To add to the problem, JBG is more cold and shade tolerant than cogongrass8-10
, and gene flow between these two varieties is likely to generate hybrids that are more aggressive, shade tolerant, and cold hardy than wild-type cogongrass. While wild-type cogongrass currently infests over 490 million hectares worldwide, in the Southeast U.S. it infests over 500,000 hectares and is capable of occupying most of the U.S. as it rapidly spreads northward due to its broad niche and geographic potential3,7,11
. The potential of a genetic crossing is a serious concern for the USDA-APHIS Federal Noxious Week Program. Currently, the USDA-APHIS prohibits JBG in states where there are major cogongrass infestations (e.g., Florida, Alabama, Mississippi). However, preventing the two varieties from combining can prove more difficult as cogongrass and JBG expand their distributions. Furthermore, the distribution of the JBG revert is currently unknown and without the ability to identify these varieties through morphology, some cogongrass infestations may be the result of JBG reverts. Unfortunately, current molecular methods of identification typically rely on AFLP (Amplified Fragment Length Polymorphisms) and DNA sequencing, both of which are time consuming and costly. Here, we present the first cost-effective and reliable PCR-based molecular genotyping method to accurately distinguish between cogongrass and JBG revert.
Molecular Biology, Issue 60, Molecular genotyping, Japanese blood grass, Red Baron, cogongrass, invasive plants
Simultaneous Quantification of T-Cell Receptor Excision Circles (TRECs) and K-Deleting Recombination Excision Circles (KRECs) by Real-time PCR
Institutions: Spedali Civili di Brescia.
T-cell receptor excision circles (TRECs) and K-deleting recombination excision circles (KRECs) are circularized DNA elements formed during recombination process that creates T- and B-cell receptors. Because TRECs and KRECs are unable to replicate, they are diluted after each cell division, and therefore persist in the cell. Their quantity in peripheral blood can be considered as an estimation of thymic and bone marrow output. By combining well established and commonly used TREC assay with a modified version of KREC assay, we have developed a duplex quantitative real-time PCR that allows quantification of both newly-produced T and B lymphocytes in a single assay. The number of TRECs and KRECs are obtained using a standard curve prepared by serially diluting TREC and KREC signal joints cloned in a bacterial plasmid, together with a fragment of T-cell receptor alpha constant gene that serves as reference gene. Results are reported as number of TRECs and KRECs/106
cells or per ml of blood. The quantification of these DNA fragments have been proven useful for monitoring immune reconstitution following bone marrow transplantation in both children and adults, for improved characterization of immune deficiencies, or for better understanding of certain immunomodulating drug activity.
Immunology, Issue 94, B lymphocytes, primary immunodeficiency, real-time PCR, immune recovery, T-cell homeostasis, T lymphocytes, thymic output, bone marrow output
Mouse Genome Engineering Using Designer Nucleases
Institutions: University of Zurich, University of Minnesota.
Transgenic mice carrying site-specific genome modifications (knockout, knock-in) are of vital importance for dissecting complex biological systems as well as for modeling human diseases and testing therapeutic strategies. Recent advances in the use of designer nucleases such as zinc finger nucleases (ZFNs), transcription activator-like effector nucleases (TALENs), and the clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated (Cas) 9 system for site-specific genome engineering open the possibility to perform rapid targeted genome modification in virtually any laboratory species without the need to rely on embryonic stem (ES) cell technology. A genome editing experiment typically starts with identification of designer nuclease target sites within a gene of interest followed by construction of custom DNA-binding domains to direct nuclease activity to the investigator-defined genomic locus. Designer nuclease plasmids are in vitro
transcribed to generate mRNA for microinjection of fertilized mouse oocytes. Here, we provide a protocol for achieving targeted genome modification by direct injection of TALEN mRNA into fertilized mouse oocytes.
Genetics, Issue 86, Oocyte microinjection, Designer nucleases, ZFN, TALEN, Genome Engineering
Detection of the Genome and Transcripts of a Persistent DNA Virus in Neuronal Tissues by Fluorescent In situ Hybridization Combined with Immunostaining
Institutions: CNRS UMR 5534, Université de Lyon 1, LabEX DEVweCAN, CNRS UPR 3296, CNRS UMR 5286.
Single cell codetection of a gene, its RNA product and cellular regulatory proteins is critical to study gene expression regulation. This is a challenge in the field of virology; in particular for nuclear-replicating persistent DNA viruses that involve animal models for their study. Herpes simplex virus type 1 (HSV-1) establishes a life-long latent infection in peripheral neurons. Latent virus serves as reservoir, from which it reactivates and induces a new herpetic episode. The cell biology of HSV-1 latency remains poorly understood, in part due to the lack of methods to detect HSV-1 genomes in situ
in animal models. We describe a DNA-fluorescent in situ
hybridization (FISH) approach efficiently detecting low-copy viral genomes within sections of neuronal tissues from infected animal models. The method relies on heat-based antigen unmasking, and directly labeled home-made DNA probes, or commercially available probes. We developed a triple staining approach, combining DNA-FISH with RNA-FISH and immunofluorescence, using peroxidase based signal amplification to accommodate each staining requirement. A major improvement is the ability to obtain, within 10 µm tissue sections, low-background signals that can be imaged at high resolution by confocal microscopy and wide-field conventional epifluorescence. Additionally, the triple staining worked with a wide range of antibodies directed against cellular and viral proteins. The complete protocol takes 2.5 days to accommodate antibody and probe penetration within the tissue.
Neuroscience, Issue 83, Life Sciences (General), Virology, Herpes Simplex Virus (HSV), Latency, In situ hybridization, Nuclear organization, Gene expression, Microscopy
A New Approach for the Comparative Analysis of Multiprotein Complexes Based on 15N Metabolic Labeling and Quantitative Mass Spectrometry
Institutions: University of Münster, Carnegie Institution for Science.
The introduced protocol provides a tool for the analysis of multiprotein complexes in the thylakoid membrane, by revealing insights into complex composition under different conditions. In this protocol the approach is demonstrated by comparing the composition of the protein complex responsible for cyclic electron flow (CEF) in Chlamydomonas reinhardtii
, isolated from genetically different strains. The procedure comprises the isolation of thylakoid membranes, followed by their separation into multiprotein complexes by sucrose density gradient centrifugation, SDS-PAGE, immunodetection and comparative, quantitative mass spectrometry (MS) based on differential metabolic labeling (14
N) of the analyzed strains. Detergent solubilized thylakoid membranes are loaded on sucrose density gradients at equal chlorophyll concentration. After ultracentrifugation, the gradients are separated into fractions, which are analyzed by mass-spectrometry based on equal volume. This approach allows the investigation of the composition within the gradient fractions and moreover to analyze the migration behavior of different proteins, especially focusing on ANR1, CAS, and PGRL1. Furthermore, this method is demonstrated by confirming the results with immunoblotting and additionally by supporting the findings from previous studies (the identification and PSI-dependent migration of proteins that were previously described to be part of the CEF-supercomplex such as PGRL1, FNR, and cyt f
). Notably, this approach is applicable to address a broad range of questions for which this protocol can be adopted and e.g.
used for comparative analyses of multiprotein complex composition isolated from distinct environmental conditions.
Microbiology, Issue 85, Sucrose density gradients, Chlamydomonas, multiprotein complexes, 15N metabolic labeling, thylakoids
Thin-layer Chromatographic (TLC) Separations and Bioassays of Plant Extracts to Identify Antimicrobial Compounds
Institutions: United States Department of Agriculture.
A common screen for plant antimicrobial compounds consists of separating plant extracts by paper or thin-layer chromatography (PC or TLC), exposing the chromatograms to microbial suspensions (e.g.
fungi or bacteria in broth or agar), allowing time for the microbes to grow in a humid environment, and visualizing zones with no microbial growth. The effectiveness of this screening method, known as bioautography, depends on both the quality of the chromatographic separation and the care taken with microbial culture conditions. This paper describes standard protocols for TLC and contact bioautography with a novel application to amino acid-fermenting bacteria. The extract is separated on flexible (aluminum-backed) silica TLC plates, and bands are visualized under ultraviolet (UV) light. Zones are cut out and incubated face down onto agar inoculated with the test microorganism. Inhibitory bands are visualized by staining the agar plates with tetrazolium red. The method is applied to the separation of red clover (Trifolium pratense
cv. Kenland) phenolic compounds and their screening for activity against Clostridium sticklandii
, a hyper ammonia-producing bacterium (HAB) that is native to the bovine rumen. The TLC methods apply to many types of plant extracts and other bacterial species (aerobic or anaerobic), as well as fungi, can be used as test organisms if culture conditions are modified to fit the growth requirements of the species.
Chemistry, Issue 85, Thin-layer chromatography, bioautography, anaerobic bacteria, tetrazolium red, phenolic compounds, plant
Affinity-based Isolation of Tagged Nuclei from Drosophila Tissues for Gene Expression Analysis
Institutions: Purdue University.
embryonic and larval tissues often contain a highly heterogeneous mixture of cell types, which can complicate the analysis of gene expression in these tissues. Thus, to analyze cell-specific gene expression profiles from Drosophila
tissues, it may be necessary to isolate specific cell types with high purity and at sufficient yields for downstream applications such as transcriptional profiling and chromatin immunoprecipitation. However, the irregular cellular morphology in tissues such as the central nervous system, coupled with the rare population of specific cell types in these tissues, can pose challenges for traditional methods of cell isolation such as laser microdissection and fluorescence-activated cell sorting (FACS). Here, an alternative approach to characterizing cell-specific gene expression profiles using affinity-based isolation of tagged nuclei, rather than whole cells, is described. Nuclei in the specific cell type of interest are genetically labeled with a nuclear envelope-localized EGFP tag using the Gal4/UAS binary expression system. These EGFP-tagged nuclei can be isolated using antibodies against GFP that are coupled to magnetic beads. The approach described in this protocol enables consistent isolation of nuclei from specific cell types in the Drosophila
larval central nervous system at high purity and at sufficient levels for expression analysis, even when these cell types comprise less than 2% of the total cell population in the tissue. This approach can be used to isolate nuclei from a wide variety of Drosophila
embryonic and larval cell types using specific Gal4 drivers, and may be useful for isolating nuclei from cell types that are not suitable for FACS or laser microdissection.
Biochemistry, Issue 85, Gene Expression, nuclei isolation, Drosophila, KASH, GFP, cell-type specific
Combined DNA-RNA Fluorescent In situ Hybridization (FISH) to Study X Chromosome Inactivation in Differentiated Female Mouse Embryonic Stem Cells
Institutions: Erasmus MC - University Medical Center.
Fluorescent in situ
hybridization (FISH) is a molecular technique which enables the detection of nucleic acids in cells. DNA FISH is often used in cytogenetics and cancer diagnostics, and can detect aberrations of the genome, which often has important clinical implications. RNA FISH can be used to detect RNA molecules in cells and has provided important insights in regulation of gene expression. Combining DNA and RNA FISH within the same cell is technically challenging, as conditions suitable for DNA FISH might be too harsh for fragile, single stranded RNA molecules. We here present an easily applicable protocol which enables the combined, simultaneous detection of Xist
RNA and DNA encoded by the X chromosomes. This combined DNA-RNA FISH protocol can likely be applied to other systems where both RNA and DNA need to be detected.
Biochemistry, Issue 88, Fluorescent in situ hybridization (FISH), combined DNA-RNA FISH, ES cell, cytogenetics, single cell analysis, X chromosome inactivation (XCI), Xist, Bacterial artificial chromosome (BAC), DNA-probe, Rnf12
Combining Magnetic Sorting of Mother Cells and Fluctuation Tests to Analyze Genome Instability During Mitotic Cell Aging in Saccharomyces cerevisiae
Institutions: Rensselaer Polytechnic Institute.
has been an excellent model system for examining mechanisms and consequences of genome instability. Information gained from this yeast model is relevant to many organisms, including humans, since DNA repair and DNA damage response factors are well conserved across diverse species. However, S. cerevisiae
has not yet been used to fully address whether the rate of accumulating mutations changes with increasing replicative (mitotic) age due to technical constraints. For instance, measurements of yeast replicative lifespan through micromanipulation involve very small populations of cells, which prohibit detection of rare mutations. Genetic methods to enrich for mother cells in populations by inducing death of daughter cells have been developed, but population sizes are still limited by the frequency with which random mutations that compromise the selection systems occur. The current protocol takes advantage of magnetic sorting of surface-labeled yeast mother cells to obtain large enough populations of aging mother cells to quantify rare mutations through phenotypic selections. Mutation rates, measured through fluctuation tests, and mutation frequencies are first established for young cells and used to predict the frequency of mutations in mother cells of various replicative ages. Mutation frequencies are then determined for sorted mother cells, and the age of the mother cells is determined using flow cytometry by staining with a fluorescent reagent that detects bud scars formed on their cell surfaces during cell division. Comparison of predicted mutation frequencies based on the number of cell divisions to the frequencies experimentally observed for mother cells of a given replicative age can then identify whether there are age-related changes in the rate of accumulating mutations. Variations of this basic protocol provide the means to investigate the influence of alterations in specific gene functions or specific environmental conditions on mutation accumulation to address mechanisms underlying genome instability during replicative aging.
Microbiology, Issue 92, Aging, mutations, genome instability, Saccharomyces cerevisiae, fluctuation test, magnetic sorting, mother cell, replicative aging
A Rapid and Efficient Method for Assessing Pathogenicity of Ustilago maydis on Maize and Teosinte Lines
Institutions: University of Georgia.
Maize is a major cereal crop worldwide. However, susceptibility to biotrophic pathogens is the primary constraint to increasing productivity. U. maydis
is a biotrophic fungal pathogen and the causal agent of corn smut on maize. This disease is responsible for significant yield losses of approximately $1.0 billion annually in the U.S.1
Several methods including crop rotation, fungicide application and seed treatments are currently used to control corn smut2
. However, host resistance is the only practical method for managing corn smut. Identification of crop plants including maize, wheat, and rice that are resistant to various biotrophic pathogens has significantly decreased yield losses annually3-5
. Therefore, the use of a pathogen inoculation method that efficiently and reproducibly delivers the pathogen in between the plant leaves, would facilitate the rapid identification of maize lines that are resistant to U. maydis
. As, a first step toward indentifying maize lines that are resistant to U. maydis
, a needle injection inoculation method and a resistance reaction screening method was utilized to inoculate maize, teosinte, and maize x teosinte introgression lines with a U. maydis
strain and to select resistant plants.
Maize, teosinte and maize x teosinte introgression lines, consisting of about 700 plants, were planted, inoculated with a strain of U. maydis
, and screened for resistance. The inoculation and screening methods successfully identified three teosinte lines resistant to U. maydis
. Here a detailed needle injection inoculation and resistance reaction screening protocol for maize, teosinte, and maize x teosinte introgression lines is presented. This study demonstrates that needle injection inoculation is an invaluable tool in agriculture that can efficiently deliver U. maydis
in between the plant leaves and has provided plant lines that are resistant to U. maydis
that can now be combined and tested in breeding programs for improved disease resistance.
Environmental Sciences, Issue 83, Bacterial Infections, Signs and Symptoms, Eukaryota, Plant Physiological Phenomena, Ustilago maydis, needle injection inoculation, disease rating scale, plant-pathogen interactions
Simultaneous Multicolor Imaging of Biological Structures with Fluorescence Photoactivation Localization Microscopy
Institutions: University of Maine.
Localization-based super resolution microscopy can be applied to obtain a spatial map (image) of the distribution of individual fluorescently labeled single molecules within a sample with a spatial resolution of tens of nanometers. Using either photoactivatable (PAFP) or photoswitchable (PSFP) fluorescent proteins fused to proteins of interest, or organic dyes conjugated to antibodies or other molecules of interest, fluorescence photoactivation localization microscopy (FPALM) can simultaneously image multiple species of molecules within single cells. By using the following approach, populations of large numbers (thousands to hundreds of thousands) of individual molecules are imaged in single cells and localized with a precision of ~10-30 nm. Data obtained can be applied to understanding the nanoscale spatial distributions of multiple protein types within a cell. One primary advantage of this technique is the dramatic increase in spatial resolution: while diffraction limits resolution to ~200-250 nm in conventional light microscopy, FPALM can image length scales more than an order of magnitude smaller. As many biological hypotheses concern the spatial relationships among different biomolecules, the improved resolution of FPALM can provide insight into questions of cellular organization which have previously been inaccessible to conventional fluorescence microscopy. In addition to detailing the methods for sample preparation and data acquisition, we here describe the optical setup for FPALM. One additional consideration for researchers wishing to do super-resolution microscopy is cost: in-house setups are significantly cheaper than most commercially available imaging machines. Limitations of this technique include the need for optimizing the labeling of molecules of interest within cell samples, and the need for post-processing software to visualize results. We here describe the use of PAFP and PSFP expression to image two protein species in fixed cells. Extension of the technique to living cells is also described.
Basic Protocol, Issue 82, Microscopy, Super-resolution imaging, Multicolor, single molecule, FPALM, Localization microscopy, fluorescent proteins
Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules
Institutions: Princeton University.
The aim of de novo
protein design is to find the amino acid sequences that will fold into a desired 3-dimensional structure with improvements in specific properties, such as binding affinity, agonist or antagonist behavior, or stability, relative to the native sequence. Protein design lies at the center of current advances drug design and discovery. Not only does protein design provide predictions for potentially useful drug targets, but it also enhances our understanding of the protein folding process and protein-protein interactions. Experimental methods such as directed evolution have shown success in protein design. However, such methods are restricted by the limited sequence space that can be searched tractably. In contrast, computational design strategies allow for the screening of a much larger set of sequences covering a wide variety of properties and functionality. We have developed a range of computational de novo
protein design methods capable of tackling several important areas of protein design. These include the design of monomeric proteins for increased stability and complexes for increased binding affinity.
To disseminate these methods for broader use we present Protein WISDOM (https://www.proteinwisdom.org), a tool that provides automated methods for a variety of protein design problems. Structural templates are submitted to initialize the design process. The first stage of design is an optimization sequence selection stage that aims at improving stability through minimization of potential energy in the sequence space. Selected sequences are then run through a fold specificity stage and a binding affinity stage. A rank-ordered list of the sequences for each step of the process, along with relevant designed structures, provides the user with a comprehensive quantitative assessment of the design. Here we provide the details of each design method, as well as several notable experimental successes attained through the use of the methods.
Genetics, Issue 77, Molecular Biology, Bioengineering, Biochemistry, Biomedical Engineering, Chemical Engineering, Computational Biology, Genomics, Proteomics, Protein, Protein Binding, Computational Biology, Drug Design, optimization (mathematics), Amino Acids, Peptides, and Proteins, De novo protein and peptide design, Drug design, In silico sequence selection, Optimization, Fold specificity, Binding affinity, sequencing
Non-radioactive in situ Hybridization Protocol Applicable for Norway Spruce and a Range of Plant Species
Institutions: Uppsala University, Swedish University of Agricultural Sciences.
The high-throughput expression analysis technologies available today give scientists an overflow of expression profiles but their resolution in terms of tissue specific expression is limited because of problems in dissecting individual tissues. Expression data needs to be confirmed and complemented with expression patterns using e.g. in situ
hybridization, a technique used to localize cell specific mRNA expression. The in situ
hybridization method is laborious, time-consuming and often requires extensive optimization depending on species and tissue. In situ
experiments are relatively more difficult to perform in woody species such as the conifer Norway spruce (Picea abies
). Here we present a modified DIG in situ
hybridization protocol, which is fast and applicable on a wide range of plant species including P. abies
. With just a few adjustments, including altered RNase treatment and proteinase K concentration, we could use the protocol to study tissue specific expression of homologous genes in male reproductive organs of one gymnosperm and two angiosperm species; P. abies, Arabidopsis thaliana
and Brassica napus
. The protocol worked equally well for the species and genes studied. AtAP3
were observed in second and third whorl floral organs in A. thaliana
and B. napus
and DAL13 in microsporophylls of male cones from P. abies
. For P. abies
the proteinase K concentration, used to permeablize the tissues, had to be increased to 3 g/ml instead of 1 g/ml, possibly due to more compact tissues and higher levels of phenolics and polysaccharides. For all species the RNase treatment was removed due to reduced signal strength without a corresponding increase in specificity. By comparing tissue specific expression patterns of homologous genes from both flowering plants and a coniferous tree we demonstrate that the DIG in situ
protocol presented here, with only minute adjustments, can be applied to a wide range of plant species. Hence, the protocol avoids both extensive species specific optimization and the laborious use of radioactively labeled probes in favor of DIG labeled probes. We have chosen to illustrate the technically demanding steps of the protocol in our film.
Anna Karlgren and Jenny Carlsson contributed equally to this study.
Corresponding authors: Anna Karlgren at Anna.Karlgren@ebc.uu.se and Jens F. Sundström at Jens.Sundstrom@vbsg.slu.se
Plant Biology, Issue 26, RNA, expression analysis, Norway spruce, Arabidopsis, rapeseed, conifers
The Production of C. elegans Transgenes via Recombineering with the galK Selectable Marker
Institutions: Beth Israel Deaconess Medical Center, Harvard Medical School, University of Pittsburgh.
The creation of transgenic animals is widely utilized in C. elegans
research including the use of GFP fusion proteins to study the regulation and expression pattern of genes of interest or generation of tandem affinity purification (TAP) tagged versions of specific genes to facilitate their purification. Typically transgenes are generated by placing a promoter upstream of a GFP reporter gene or cDNA of interest, and this often produces a representative expression pattern. However, critical elements of gene regulation, such as control elements in the 3' untranslated region or alternative promoters, could be missed by this approach. Further only a single splice variant can be usually studied by this means. In contrast, the use of worm genomic DNA carried by fosmid DNA clones likely includes most if not all elements involved in gene regulation in vivo
which permits the greater ability to capture the genuine expression pattern and timing. To facilitate the generation of transgenes using fosmid DNA, we describe an E. coli
based recombineering procedure to insert GFP, a TAP-tag, or other sequences of interest into any location in the gene. The procedure uses the galK
gene as the selection marker for both the positive and negative selection steps in recombineering which results in obtaining the desired modification with high efficiency. Further, plasmids containing the galK
gene flanked by homology arms to commonly used GFP and TAP fusion genes are available which reduce the cost of oligos by 50% when generating a GFP or TAP fusion protein. These plasmids use the R6K replication origin which precludes the need for extensive PCR product purification. Finally, we also demonstrate a technique to integrate the unc-119
marker on to the fosmid backbone which allows the fosmid to be directly injected or bombarded into worms to generate transgenic animals. This video demonstrates the procedures involved in generating a transgene via recombineering using this method.
Genetics, Issue 47, C. elegans, transgenes, fosmid clone, galK, recombineering, homologous recombination, E. coli
Quantitative Imaging of Lineage-specific Toll-like Receptor-mediated Signaling in Monocytes and Dendritic Cells from Small Samples of Human Blood
Institutions: Yale University School of Medicine .
Individual variations in immune status determine responses to infection and contribute to disease severity and outcome. Aging is associated with an increased susceptibility to viral and bacterial infections and decreased responsiveness to vaccines with a well-documented decline in humoral as well as cell-mediated immune responses1,2
. We have recently assessed the effects of aging on Toll-like receptors (TLRs), key components of the innate immune system that detect microbial infection and trigger antimicrobial host defense responses3
. In a large cohort of healthy human donors, we showed that peripheral blood monocytes from the elderly have decreased expression and function of certain TLRs4
and similar reduced TLR levels and signaling responses in dendritic cells (DCs), antigen-presenting cells that are pivotal in the linkage between innate and adaptive immunity5
. We have shown dysregulation of TLR3 in macrophages and lower production of IFN by DCs from elderly donors in response to infection with West Nile virus6,7
Paramount to our understanding of immunosenescence and to therapeutic intervention is a detailed understanding of specific cell types responding and the mechanism(s) of signal transduction. Traditional studies of immune responses through imaging of primary cells and surveying cell markers by FACS or immunoblot have advanced our understanding significantly, however, these studies are generally limited technically by the small sample volume available from patients and the inability to conduct complex laboratory techniques on multiple human samples. ImageStream combines quantitative flow cytometry with simultaneous high-resolution digital imaging and thus facilitates investigation in multiple cell populations contemporaneously for an efficient capture of patient susceptibility. Here we demonstrate the use of ImageStream in DCs to assess TLR7/8 activation-mediated increases in phosphorylation and nuclear translocation of a key transcription factor, NF-κB, which initiates transcription of numerous genes that are critical for immune responses8
. Using this technology, we have also recently demonstrated a previously unrecognized alteration of TLR5 signaling and the NF-κB pathway in monocytes from older donors that may contribute to altered immune responsiveness in aging9
Immunology, Issue 62, monocyte, dendritic cells, Toll-like receptors, fluorescent imaging, signaling, FACS, aging
The ITS2 Database
Institutions: University of Würzburg, University of Würzburg.
The internal transcribed spacer 2 (ITS2) has been used as a phylogenetic marker for more than two decades. As ITS2 research mainly focused on the very variable ITS2 sequence, it confined this marker to low-level phylogenetics only. However, the combination of the ITS2 sequence and its highly conserved secondary structure improves the phylogenetic resolution1
and allows phylogenetic inference at multiple taxonomic ranks, including species delimitation2-8
The ITS2 Database9
presents an exhaustive dataset of internal transcribed spacer 2 sequences from NCBI GenBank11
. Following an annotation by profile Hidden Markov Models (HMMs), the secondary structure of each sequence is predicted. First, it is tested whether a minimum energy based fold12
(direct fold) results in a correct, four helix conformation. If this is not the case, the structure is predicted by homology modeling13
. In homology modeling, an already known secondary structure is transferred to another ITS2 sequence, whose secondary structure was not able to fold correctly in a direct fold.
The ITS2 Database is not only a database for storage and retrieval of ITS2 sequence-structures. It also provides several tools to process your own ITS2 sequences, including annotation, structural prediction, motif detection and BLAST14
search on the combined sequence-structure information. Moreover, it integrates trimmed versions of 4SALE15,16
for multiple sequence-structure alignment calculation and Neighbor Joining18
tree reconstruction. Together they form a coherent analysis pipeline from an initial set of sequences to a phylogeny based on sequence and secondary structure.
In a nutshell, this workbench simplifies first phylogenetic analyses to only a few mouse-clicks, while additionally providing tools and data for comprehensive large-scale analyses.
Genetics, Issue 61, alignment, internal transcribed spacer 2, molecular systematics, secondary structure, ribosomal RNA, phylogenetic tree, homology modeling, phylogeny
A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types
Institutions: Stony Brook University, Cold Spring Harbor Laboratory, University of Texas at Dallas.
ChIPseq is a widely used technique for investigating protein-DNA interactions. Read density profiles are generated by using next-sequencing of protein-bound DNA and aligning the short reads to a reference genome. Enriched regions are revealed as peaks, which often differ dramatically in shape, depending on the target protein1
. For example, transcription factors often bind in a site- and sequence-specific manner and tend to produce punctate peaks, while histone modifications are more pervasive and are characterized by broad, diffuse islands of enrichment2
. Reliably identifying these regions was the focus of our work.
Algorithms for analyzing ChIPseq data have employed various methodologies, from heuristics3-5
to more rigorous statistical models, e.g.
Hidden Markov Models (HMMs)6-8
. We sought a solution that minimized the necessity for difficult-to-define, ad hoc parameters that often compromise resolution and lessen the intuitive usability of the tool. With respect to HMM-based methods, we aimed to curtail parameter estimation procedures and simple, finite state classifications that are often utilized.
Additionally, conventional ChIPseq data analysis involves categorization of the expected read density profiles as either punctate or diffuse followed by subsequent application of the appropriate tool. We further aimed to replace the need for these two distinct models with a single, more versatile model, which can capably address the entire spectrum of data types.
To meet these objectives, we first constructed a statistical framework that naturally modeled ChIPseq data structures using a cutting edge advance in HMMs9
, which utilizes only explicit formulas-an innovation crucial to its performance advantages. More sophisticated then heuristic models, our HMM accommodates infinite hidden states through a Bayesian model. We applied it to identifying reasonable change points in read density, which further define segments of enrichment. Our analysis revealed how our Bayesian Change Point (BCP) algorithm had a reduced computational complexity-evidenced by an abridged run time and memory footprint. The BCP algorithm was successfully applied to both punctate peak and diffuse island identification with robust accuracy and limited user-defined parameters. This illustrated both its versatility and ease of use. Consequently, we believe it can be implemented readily across broad ranges of data types and end users in a manner that is easily compared and contrasted, making it a great tool for ChIPseq data analysis that can aid in collaboration and corroboration between research groups. Here, we demonstrate the application of BCP to existing transcription factor10,11
and epigenetic data12
to illustrate its usefulness.
Genetics, Issue 70, Bioinformatics, Genomics, Molecular Biology, Cellular Biology, Immunology, Chromatin immunoprecipitation, ChIP-Seq, histone modifications, segmentation, Bayesian, Hidden Markov Models, epigenetics
Purification of Transcripts and Metabolites from Drosophila Heads
Institutions: University of Florida , University of Florida , University of Florida , University of Florida .
For the last decade, we have tried to understand the molecular and cellular mechanisms of neuronal degeneration using Drosophila
as a model organism. Although fruit flies provide obvious experimental advantages, research on neurodegenerative diseases has mostly relied on traditional techniques, including genetic interaction, histology, immunofluorescence, and protein biochemistry. These techniques are effective for mechanistic, hypothesis-driven studies, which lead to a detailed understanding of the role of single genes in well-defined biological problems. However, neurodegenerative diseases are highly complex and affect multiple cellular organelles and processes over time. The advent of new technologies and the omics age provides a unique opportunity to understand the global cellular perturbations underlying complex diseases. Flexible model organisms such as Drosophila
are ideal for adapting these new technologies because of their strong annotation and high tractability. One challenge with these small animals, though, is the purification of enough informational molecules (DNA, mRNA, protein, metabolites) from highly relevant tissues such as fly brains. Other challenges consist of collecting large numbers of flies for experimental replicates (critical for statistical robustness) and developing consistent procedures for the purification of high-quality biological material. Here, we describe the procedures for collecting thousands of fly heads and the extraction of transcripts and metabolites to understand how global changes in gene expression and metabolism contribute to neurodegenerative diseases. These procedures are easily scalable and can be applied to the study of proteomic and epigenomic contributions to disease.
Genetics, Issue 73, Biochemistry, Molecular Biology, Neurobiology, Neuroscience, Bioengineering, Cellular Biology, Anatomy, Neurodegenerative Diseases, Biological Assay, Drosophila, fruit fly, head separation, purification, mRNA, RNA, cDNA, DNA, transcripts, metabolites, replicates, SCA3, neurodegeneration, NMR, gene expression, animal model
Establishing Fungal Entomopathogens as Endophytes: Towards Endophytic Biological Control
Institutions: International Center for Tropical Agriculture (CIAT), Cali, Colombia , United States Department of Agriculture, Beltsville, Maryland, USA.
is a fungal entomopathogen with the ability to colonize plants endophytically. As an endophyte, B. bassiana
may play a role in protecting plants from herbivory and disease. This protocol demonstrates two inoculation methods to establish B. bassiana
endophytically in the common bean (Phaseolus vulgaris
), in preparation for subsequent evaluations of endophytic biological control. Plants are grown from surface-sterilized seeds for two weeks before receiving a B. bassiana
treatment of 108
conidia/ml (or water) applied either as a foliar spray or a soil drench. Two weeks later, the plants are harvested and their leaves, stems and roots are sampled to evaluate endophytic fungal colonization. For this, samples are individually surface sterilized, cut into multiple sections, and incubated in potato dextrose agar media for 20 days. The media is inspected every 2-3 days to observe fungal growth associated with plant sections and record the occurrence of B. bassiana
to estimate the extent of its endophytic colonization. Analyses of inoculation success compare the occurrence of B. bassiana
within a given plant part (i.e.
leaves, stems or roots) across treatments and controls. In addition to the inoculation method, the specific outcome of the experiment may depend on the target crop species or variety, the fungal entomopathogen species strain or isolate used, and the plant's growing conditions.
Bioengineering, Issue 74, Plant Biology, Microbiology, Infection, Environmental Sciences, Molecular Biology, Mycology, Entomology, Botany, Pathology, Agriculture, Pest Control, Fungi, Entomopathogen, Endophyte, Pest, Pathogen, Phaseolus vulgaris, Beauveria bassiana, Sustainable Agriculture, hemocytometer, inoculation, fungus
A Strategy to Identify de Novo Mutations in Common Disorders such as Autism and Schizophrenia
Institutions: Universite de Montreal, Universite de Montreal, Universite de Montreal.
There are several lines of evidence supporting the role of de novo
mutations as a mechanism for common disorders, such as autism and schizophrenia. First, the de novo
mutation rate in humans is relatively high, so new mutations are generated at a high frequency in the population. However, de novo
mutations have not been reported in most common diseases. Mutations in genes leading to severe diseases where there is a strong negative selection against the phenotype, such as lethality in embryonic stages or reduced reproductive fitness, will not be transmitted to multiple family members, and therefore will not be detected by linkage gene mapping or association studies. The observation of very high concordance in monozygotic twins and very low concordance in dizygotic twins also strongly supports the hypothesis that a significant fraction of cases may result from new mutations. Such is the case for diseases such as autism and schizophrenia. Second, despite reduced reproductive fitness1
and extremely variable environmental factors, the incidence of some diseases is maintained worldwide at a relatively high and constant rate. This is the case for autism and schizophrenia, with an incidence of approximately 1% worldwide. Mutational load can be thought of as a balance between selection for or against a deleterious mutation and its production by de novo
mutation. Lower rates of reproduction constitute a negative selection factor that should reduce the number of mutant alleles in the population, ultimately leading to decreased disease prevalence. These selective pressures tend to be of different intensity in different environments. Nonetheless, these severe mental disorders have been maintained at a constant relatively high prevalence in the worldwide population across a wide range of cultures and countries despite a strong negative selection against them2
. This is not what one would predict in diseases with reduced reproductive fitness, unless there was a high new mutation rate. Finally, the effects of paternal age: there is a significantly increased risk of the disease with increasing paternal age, which could result from the age related increase in paternal de novo
mutations. This is the case for autism and schizophrenia3
. The male-to-female ratio of mutation rate is estimated at about 4–6:1, presumably due to a higher number of germ-cell divisions with age in males. Therefore, one would predict that de novo
mutations would more frequently come from males, particularly older males4
. A high rate of new mutations may in part explain why genetic studies have so far failed to identify many genes predisposing to complexes diseases genes, such as autism and schizophrenia, and why diseases have been identified for a mere 3% of genes in the human genome. Identification for de novo
mutations as a cause of a disease requires a targeted molecular approach, which includes studying parents and affected subjects. The process for determining if the genetic basis of a disease may result in part from de novo
mutations and the molecular approach to establish this link will be illustrated, using autism and schizophrenia as examples.
Medicine, Issue 52, de novo mutation, complex diseases, schizophrenia, autism, rare variations, DNA sequencing
Electroporation of Mycobacteria
Institutions: Barts and the London School of Medicine and Dentistry, Barts and the London School of Medicine and Dentistry.
High efficiency transformation is a major limitation in the study of mycobacteria. The genus Mycobacterium can be difficult to transform; this is mainly caused by the thick and waxy cell wall, but is compounded by the fact that most molecular techniques have been developed for distantly-related species such as Escherichia coli and Bacillus subtilis. In spite of these obstacles, mycobacterial plasmids have been identified and DNA transformation of many mycobacterial species have now been described. The most successful method for introducing DNA into mycobacteria is electroporation. Many parameters contribute to successful transformation; these include the species/strain, the nature of the transforming DNA, the selectable marker used, the growth medium, and the conditions for the electroporation pulse. Optimized methods for the transformation of both slow- and fast-grower are detailed here. Transformation efficiencies for different mycobacterial species and with various selectable markers are reported.
Microbiology, Issue 15, Springer Protocols, Mycobacteria, Electroporation, Bacterial Transformation, Transformation Efficiency, Bacteria, Tuberculosis, M. Smegmatis, Springer Protocols