Phenotypes are determined by a complex series of physical (e.g. protein-protein) and functional (e.g. gene-gene or genetic) interactions (GI)1. While physical interactions can indicate which bacterial proteins are associated as complexes, they do not necessarily reveal pathway-level functional relationships1. GI screens, in which the growth of double mutants bearing two deleted or inactivated genes is measured and compared to the corresponding single mutants, can illuminate epistatic dependencies between loci and hence provide a means to query and discover novel functional relationships2. Large-scale GI maps have been reported for eukaryotic organisms like yeast3-7, but GI information remains sparse for prokaryotes8, which hinders the functional annotation of bacterial genomes. To this end, we and others have developed high-throughput quantitative bacterial GI screening methods9, 10.
Here, we present the key steps required to perform quantitative E. coli Synthetic Genetic Array (eSGA) screening procedure on a genome-scale9, using natural bacterial conjugation and homologous recombination to systemically generate and measure the fitness of large numbers of double mutants in a colony array format. Briefly, a robot is used to transfer, through conjugation, chloramphenicol (Cm) - marked mutant alleles from engineered Hfr (High frequency of recombination) 'donor strains' into an ordered array of kanamycin (Kan) - marked F- recipient strains. Typically, we use loss-of-function single mutants bearing non-essential gene deletions (e.g. the 'Keio' collection11) and essential gene hypomorphic mutations (i.e. alleles conferring reduced protein expression, stability, or activity9, 12, 13) to query the functional associations of non-essential and essential genes, respectively. After conjugation and ensuing genetic exchange mediated by homologous recombination, the resulting double mutants are selected on solid medium containing both antibiotics. After outgrowth, the plates are digitally imaged and colony sizes are quantitatively scored using an in-house automated image processing system14. GIs are revealed when the growth rate of a double mutant is either significantly better or worse than expected9. Aggravating (or negative) GIs often result between loss-of-function mutations in pairs of genes from compensatory pathways that impinge on the same essential process2. Here, the loss of a single gene is buffered, such that either single mutant is viable. However, the loss of both pathways is deleterious and results in synthetic lethality or sickness (i.e. slow growth). Conversely, alleviating (or positive) interactions can occur between genes in the same pathway or protein complex2 as the deletion of either gene alone is often sufficient to perturb the normal function of the pathway or complex such that additional perturbations do not reduce activity, and hence growth, further. Overall, systematically identifying and analyzing GI networks can provide unbiased, global maps of the functional relationships between large numbers of genes, from which pathway-level information missed by other approaches can be inferred9.
26 Related JoVE Articles!
A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types
Institutions: Stony Brook University, Cold Spring Harbor Laboratory, University of Texas at Dallas.
ChIPseq is a widely used technique for investigating protein-DNA interactions. Read density profiles are generated by using next-sequencing of protein-bound DNA and aligning the short reads to a reference genome. Enriched regions are revealed as peaks, which often differ dramatically in shape, depending on the target protein1
. For example, transcription factors often bind in a site- and sequence-specific manner and tend to produce punctate peaks, while histone modifications are more pervasive and are characterized by broad, diffuse islands of enrichment2
. Reliably identifying these regions was the focus of our work.
Algorithms for analyzing ChIPseq data have employed various methodologies, from heuristics3-5
to more rigorous statistical models, e.g.
Hidden Markov Models (HMMs)6-8
. We sought a solution that minimized the necessity for difficult-to-define, ad hoc parameters that often compromise resolution and lessen the intuitive usability of the tool. With respect to HMM-based methods, we aimed to curtail parameter estimation procedures and simple, finite state classifications that are often utilized.
Additionally, conventional ChIPseq data analysis involves categorization of the expected read density profiles as either punctate or diffuse followed by subsequent application of the appropriate tool. We further aimed to replace the need for these two distinct models with a single, more versatile model, which can capably address the entire spectrum of data types.
To meet these objectives, we first constructed a statistical framework that naturally modeled ChIPseq data structures using a cutting edge advance in HMMs9
, which utilizes only explicit formulas-an innovation crucial to its performance advantages. More sophisticated then heuristic models, our HMM accommodates infinite hidden states through a Bayesian model. We applied it to identifying reasonable change points in read density, which further define segments of enrichment. Our analysis revealed how our Bayesian Change Point (BCP) algorithm had a reduced computational complexity-evidenced by an abridged run time and memory footprint. The BCP algorithm was successfully applied to both punctate peak and diffuse island identification with robust accuracy and limited user-defined parameters. This illustrated both its versatility and ease of use. Consequently, we believe it can be implemented readily across broad ranges of data types and end users in a manner that is easily compared and contrasted, making it a great tool for ChIPseq data analysis that can aid in collaboration and corroboration between research groups. Here, we demonstrate the application of BCP to existing transcription factor10,11
and epigenetic data12
to illustrate its usefulness.
Genetics, Issue 70, Bioinformatics, Genomics, Molecular Biology, Cellular Biology, Immunology, Chromatin immunoprecipitation, ChIP-Seq, histone modifications, segmentation, Bayesian, Hidden Markov Models, epigenetics
Determining Genetic Expression Profiles in C. elegans Using Microarray and Real-time PCR
Institutions: Southwestern Oklahoma State University.
Synapses are composed of a presynaptic active zone in the signaling cell and a postsynaptic terminal in the target cell. In the case of chemical synapses, messages are carried by neurotransmitters released from presynaptic terminals and received by receptors on postsynaptic cells. Our previous research in Caenorhabditis elegans
has shown that VSM-1 negatively regulates exocytosis. Additionally, analysis of synapses in vsm-1
mutants showed that animals lacking a fully functional VSM-1 have increased synaptic connectivity. Based on these preliminary findings, we hypothesized that C. elegans
VSM-1 may play a crucial role in synaptogenesis. To test this hypothesis, double-labeled microarray analysis was performed, and gene expression profiles were determined. First, total RNA was isolated, reversely transcribed to cDNA, and hybridized to the DNA microarrays. Then, in-silico analysis of fluorescent probe hybridization revealed significant induction of many genes coding for members of the major sperm protein family (MSP) in mutants with enhanced synaptogenesis. MSPs are the major component of sperm in C. elegans
and appear to signal nematode oocyte maturation and ovulation . In fruit flies, Chai and colleagues 1
demonstrated that MSP-like molecules regulate presynaptic bouton number and size at the neuromuscular junction. Moreover, analysis performed by Tsuda and coworkers 2
suggested that MSPs may act as ligands for Eph receptors and trigger receptor tyrosine kinase signaling cascades. Lastly, real time PCR analysis corroborated that the gene coding for MSP-32 is induced in vsm-1(ok1468)
mutants. Taken together, research performed by our laboratory has shown that vsm-1
mutants have a significant increase in synaptic density, which could be mediated by MSP-32 signaling.
Molecular Biology, Issue 53, microarray, C. elegans, real-time PCR, neuroscience
Isolation of mRNAs Associated with Yeast Mitochondria to Study Mechanisms of Localized Translation
Institutions: Technion - Israel Institute of Technology.
Most of mitochondrial proteins are encoded in the nucleus and need to be imported into the organelle. Import may occur while the protein is synthesized near the mitochondria. Support for this possibility is derived from recent studies, in which many mRNAs encoding mitochondrial proteins were shown to be localized to the mitochondria vicinity. Together with earlier demonstrations of ribosomes’ association with the outer membrane, these results suggest a localized translation process. Such localized translation may improve import efficiency, provide unique regulation sites and minimize cases of ectopic expression. Diverse methods have been used to characterize the factors and elements that mediate localized translation. Standard among these is subcellular fractionation by differential centrifugation. This protocol has the advantage of isolation of mRNAs, ribosomes and proteins in a single procedure. These can then be characterized by various molecular and biochemical methods. Furthermore, transcriptomics and proteomics methods can be applied to the resulting material, thereby allow genome-wide insights. The utilization of yeast as a model organism for such studies has the advantages of speed, costs and simplicity. Furthermore, the advanced genetic tools and available deletion strains facilitate verification of candidate factors.
Biochemistry, Issue 85, mitochondria, mRNA localization, Yeast, S. cerevisiae, microarray, localized translation, biochemical fractionation
Methods to Assess Subcellular Compartments of Muscle in C. elegans
Institutions: University of Nottingham.
Muscle is a dynamic tissue that responds to changes in nutrition, exercise, and disease state. The loss of muscle mass and function with disease and age are significant public health burdens. We currently understand little about the genetic regulation of muscle health with disease or age. The nematode C. elegans
is an established model for understanding the genomic regulation of biological processes of interest. This worm’s body wall muscles display a large degree of homology with the muscles of higher metazoan species. Since C. elegans
is a transparent organism, the localization of GFP to mitochondria and sarcomeres allows visualization of these structures in vivo
. Similarly, feeding animals cationic dyes, which accumulate based on the existence of a mitochondrial membrane potential, allows the assessment of mitochondrial function in vivo
. These methods, as well as assessment of muscle protein homeostasis, are combined with assessment of whole animal muscle function, in the form of movement assays, to allow correlation of sub-cellular defects with functional measures of muscle performance. Thus, C. elegans
provides a powerful platform with which to assess the impact of mutations, gene knockdown, and/or chemical compounds upon muscle structure and function. Lastly, as GFP, cationic dyes, and movement assays are assessed non-invasively, prospective studies of muscle structure and function can be conducted across the whole life course and this at present cannot be easily investigated in vivo
in any other organism.
Developmental Biology, Issue 93, Physiology, C. elegans, muscle, mitochondria, sarcomeres, ageing
Aplysia Ganglia Preparation for Electrophysiological and Molecular Analyses of Single Neurons
Institutions: The Scripps Research Institute, Florida.
A major challenge in neurobiology is to understand the molecular underpinnings of neural circuitry that govern a specific behavior. Once the specific molecular mechanisms are identified, new therapeutic strategies can be developed to treat abnormalities in specific behaviors caused by degenerative diseases or aging of the nervous system. The marine snail Aplysia californica
is well suited for the investigations of cellular and molecular basis of behavior because neural circuitry underlying a specific behavior could be easily determined and the individual components of the circuitry could be easily manipulated. These advantages of Aplysia
have led to several fundamental discoveries of neurobiology of learning and memory. Here we describe a preparation of the Aplysia
nervous system for the electrophysiological and molecular analyses of individual neurons. Briefly, ganglion dissected from the nervous system is exposed to protease to remove the ganglion sheath such that neurons are exposed but retain neuronal activity as in the intact animal. This preparation is used to carry out electrophysiological measurements of single or multiple neurons. Importantly, following the recording using a simple methodology, the neurons could be isolated directly from the ganglia for gene expression analysis. These protocols were used to carry out simultaneous electrophysiological recordings from L7 and R15 neurons, study their response to acetylcholine and quantitating expression of CREB1 gene in isolated single L7, L11, R15, and R2 neurons of Aplysia
Neurobiology, Issue 83, intracellular recording, identified neuron, neural circuitry, gene expression, action potential, CREB, Aplysia californica, genomics
Detection of Alternative Splicing During Epithelial-Mesenchymal Transition
Institutions: Northwestern University Feinberg School of Medicine.
Alternative splicing plays a critical role in the epithelial-mesenchymal transition (EMT), an essential cellular program that occurs in various physiological and pathological processes. Here we describe a strategy to detect alternative splicing during EMT using an inducible EMT model by expressing the transcription repressor Twist. EMT is monitored by changes in cell morphology, loss of E-cadherin localization at cell-cell junctions, and the switched expression of EMT markers, such as loss of epithelial markers E-cadherin and γ-catenin and gain of mesenchymal markers N-cadherin and vimentin. Using isoform-specific primer sets, the alternative splicing of interested mRNAs are analyzed by quantitative RT-PCR. The production of corresponding protein isoforms is validated by immunoblotting assays. The method of detecting splice isoforms described here is also suitable for the study of alternative splicing in other biological processes.
Cellular Biology, Issue 92, alternative splicing, EMT, RNA, primer design, real time PCR, splice isoforms
Microarray-based Identification of Individual HERV Loci Expression: Application to Biomarker Discovery in Prostate Cancer
Institutions: Joint Unit Hospices de Lyon-bioMérieux, BioMérieux, Hospices Civils de Lyon, Lyon 1 University, BioMérieux, Hospices Civils de Lyon, Hospices Civils de Lyon.
The prostate-specific antigen (PSA) is the main diagnostic biomarker for prostate cancer in clinical use, but it lacks specificity and sensitivity, particularly in low dosage values1
. ‘How to use PSA' remains a current issue, either for diagnosis as a gray zone corresponding to a concentration in serum of 2.5-10 ng/ml which does not allow a clear differentiation to be made between cancer and noncancer2
or for patient follow-up as analysis of post-operative PSA kinetic parameters can pose considerable challenges for their practical application3,4
. Alternatively, noncoding RNAs (ncRNAs) are emerging as key molecules in human cancer, with the potential to serve as novel markers of disease, e.g.
PCA3 in prostate cancer5,6
and to reveal uncharacterized aspects of tumor biology. Moreover, data from the ENCODE project published in 2012 showed that different RNA types cover about 62% of the genome. It also appears that the amount of transcriptional regulatory motifs is at least 4.5x higher than the one corresponding to protein-coding exons. Thus, long terminal repeats (LTRs) of human endogenous retroviruses (HERVs) constitute a wide range of putative/candidate transcriptional regulatory sequences, as it is their primary function in infectious retroviruses. HERVs, which are spread throughout the human genome, originate from ancestral and independent infections within the germ line, followed by copy-paste propagation processes and leading to multicopy families occupying 8% of the human genome (note that exons span 2% of our genome). Some HERV loci still express proteins that have been associated with several pathologies including cancer7-10
. We have designed a high-density microarray, in Affymetrix format, aiming to optimally characterize individual HERV loci expression, in order to better understand whether they can be active, if they drive ncRNA transcription or modulate coding gene expression. This tool has been applied in the prostate cancer field (Figure 1
Medicine, Issue 81, Cancer Biology, Genetics, Molecular Biology, Prostate, Retroviridae, Biomarkers, Pharmacological, Tumor Markers, Biological, Prostatectomy, Microarray Analysis, Gene Expression, Diagnosis, Human Endogenous Retroviruses, HERV, microarray, Transcriptome, prostate cancer, Affymetrix
Assessment of Mitochondrial Functions and Cell Viability in Renal Cells Overexpressing Protein Kinase C Isozymes
Institutions: University of Arkansas for Medical Sciences .
The protein kinase C (PKC) family of isozymes is involved in numerous physiological and pathological processes. Our recent data demonstrate that PKC regulates mitochondrial function and cellular energy status. Numerous reports demonstrated that the activation of PKC-a and PKC-ε improves mitochondrial function in the ischemic heart and mediates cardioprotection. In contrast, we have demonstrated that PKC-α and PKC-ε are involved in nephrotoxicant-induced mitochondrial dysfunction and cell death in kidney cells. Therefore, the goal of this study was to develop an in vitro
model of renal cells maintaining active mitochondrial functions in which PKC isozymes could be selectively activated or inhibited to determine their role in regulation of oxidative phosphorylation and cell survival. Primary cultures of renal proximal tubular cells (RPTC) were cultured in improved conditions resulting in mitochondrial respiration and activity of mitochondrial enzymes similar to those in RPTC in vivo
. Because traditional transfection techniques (Lipofectamine, electroporation) are inefficient in primary cultures and have adverse effects on mitochondrial function, PKC-ε mutant cDNAs were delivered to RPTC through adenoviral vectors. This approach results in transfection of over 90% cultured RPTC.
Here, we present methods for assessing the role of PKC-ε in: 1. regulation of mitochondrial morphology and functions associated with ATP synthesis, and 2. survival of RPTC in primary culture. PKC-ε is activated by overexpressing the constitutively active PKC-ε mutant. PKC-ε is inhibited by overexpressing the inactive mutant of PKC-ε. Mitochondrial function is assessed by examining respiration, integrity of the respiratory chain, activities of respiratory complexes and F0
-ATPase, ATP production rate, and ATP content. Respiration is assessed in digitonin-permeabilized RPTC as state 3 (maximum respiration in the presence of excess substrates and ADP) and uncoupled respirations. Integrity of the respiratory chain is assessed by measuring activities of all four complexes of the respiratory chain in isolated mitochondria. Capacity of oxidative phosphorylation is evaluated by measuring the mitochondrial membrane potential, ATP production rate, and activity of F0
-ATPase. Energy status of RPTC is assessed by determining the intracellular ATP content. Mitochondrial morphology in live cells is visualized using MitoTracker Red 580, a fluorescent dye that specifically accumulates in mitochondria, and live monolayers are examined under a fluorescent microscope. RPTC viability is assessed using annexin V/propidium iodide staining followed by flow cytometry to determine apoptosis and oncosis.
These methods allow for a selective activation/inhibition of individual PKC isozymes to assess their role in cellular functions in a variety of physiological and pathological conditions that can be reproduced in in vitro
Cellular Biology, Issue 71, Biochemistry, Molecular Biology, Genetics, Pharmacology, Physiology, Medicine, Protein, Mitochondrial dysfunction, mitochondria, protein kinase C, renal proximal tubular cells, reactive oxygen species, oxygen consumption, electron transport chain, respiratory complexes, ATP, adenovirus, primary culture, ischemia, cells, flow cytometry
Genomic MRI - a Public Resource for Studying Sequence Patterns within Genomic DNA
Institutions: University of Toledo Health Science Campus.
Non-coding genomic regions in complex eukaryotes, including intergenic areas, introns, and untranslated segments of exons, are profoundly non-random in their nucleotide composition and consist of a complex mosaic of sequence patterns. These patterns include so-called Mid-Range Inhomogeneity (MRI) regions -- sequences 30-10000 nucleotides in length that are enriched by a particular base or combination of bases (e.g. (G+T)-rich, purine-rich, etc.). MRI regions are associated with unusual (non-B-form) DNA structures that are often involved in regulation of gene expression, recombination, and other genetic processes (Fedorova & Fedorov 2010). The existence of a strong fixation bias within MRI regions against mutations that tend to reduce their sequence inhomogeneity additionally supports the functionality and importance of these genomic sequences (Prakash et al.
Here we demonstrate a freely available Internet resource -- the Genomic MRI
program package -- designed for computational analysis of genomic sequences in order to find and characterize various MRI patterns within them (Bechtel et al.
2008). This package also allows generation of randomized sequences with various properties and level of correspondence to the natural input DNA sequences. The main goal of this resource is to facilitate examination of vast regions of non-coding DNA that are still scarcely investigated and await thorough exploration and recognition.
Genetics, Issue 51, bioinformatics, computational biology, genomics, non-randomness, signals, gene regulation, DNA conformation
The ITS2 Database
Institutions: University of Würzburg, University of Würzburg.
The internal transcribed spacer 2 (ITS2) has been used as a phylogenetic marker for more than two decades. As ITS2 research mainly focused on the very variable ITS2 sequence, it confined this marker to low-level phylogenetics only. However, the combination of the ITS2 sequence and its highly conserved secondary structure improves the phylogenetic resolution1
and allows phylogenetic inference at multiple taxonomic ranks, including species delimitation2-8
The ITS2 Database9
presents an exhaustive dataset of internal transcribed spacer 2 sequences from NCBI GenBank11
. Following an annotation by profile Hidden Markov Models (HMMs), the secondary structure of each sequence is predicted. First, it is tested whether a minimum energy based fold12
(direct fold) results in a correct, four helix conformation. If this is not the case, the structure is predicted by homology modeling13
. In homology modeling, an already known secondary structure is transferred to another ITS2 sequence, whose secondary structure was not able to fold correctly in a direct fold.
The ITS2 Database is not only a database for storage and retrieval of ITS2 sequence-structures. It also provides several tools to process your own ITS2 sequences, including annotation, structural prediction, motif detection and BLAST14
search on the combined sequence-structure information. Moreover, it integrates trimmed versions of 4SALE15,16
for multiple sequence-structure alignment calculation and Neighbor Joining18
tree reconstruction. Together they form a coherent analysis pipeline from an initial set of sequences to a phylogeny based on sequence and secondary structure.
In a nutshell, this workbench simplifies first phylogenetic analyses to only a few mouse-clicks, while additionally providing tools and data for comprehensive large-scale analyses.
Genetics, Issue 61, alignment, internal transcribed spacer 2, molecular systematics, secondary structure, ribosomal RNA, phylogenetic tree, homology modeling, phylogeny
Effect of Male Accessory Gland Products on Egg Laying in Gastropod Molluscs
Institutions: VU University.
In internally fertilizing animals, seminal fluid is usually added to the spermatozoa, together forming the semen or ejaculate. Besides nourishing and activating sperm, the components in the seminal fluid can also influence female physiology to augment fertilization success of the sperm donor. While many studies have reported such effects in species with separate sexes, few studies have addressed this in simultaneously hermaphroditic animals. This video protocol presents a method to study effects of seminal fluid in gastropods, using a simultaneously hermaphroditic freshwater snail, the great pond snail Lymnaea stagnalis
, as model organism. While the procedure is shown using complete prostate gland extracts, individual components (i.e.
, proteins, peptides, and other compounds) of the seminal fluid can be tested in the same way. Effects of the receipt of ejaculate components on egg laying can be quantified in terms of frequency of egg laying and more subtle estimates of female reproductive performance such as egg numbers within each egg masses. Results show that seminal fluid proteins affect female reproductive output in this simultaneous hermaphrodite, highlighting their importance for sexual selection.
Physiology, Issue 88, Allohormone, Fresh-water snail, Gastropod, Lymnaea stagnalis, Mollusc, Pond snail, Prostate, Semen, Seminal fluid Sexual selection, Sperm
A Restriction Enzyme Based Cloning Method to Assess the In vitro Replication Capacity of HIV-1 Subtype C Gag-MJ4 Chimeric Viruses
Institutions: Emory University, Emory University.
The protective effect of many HLA class I alleles on HIV-1 pathogenesis and disease progression is, in part, attributed to their ability to target conserved portions of the HIV-1 genome that escape with difficulty. Sequence changes attributed to cellular immune pressure arise across the genome during infection, and if found within conserved regions of the genome such as Gag, can affect the ability of the virus to replicate in vitro
. Transmission of HLA-linked polymorphisms in Gag to HLA-mismatched recipients has been associated with reduced set point viral loads. We hypothesized this may be due to a reduced replication capacity of the virus. Here we present a novel method for assessing the in vitro
replication of HIV-1 as influenced by the gag
gene isolated from acute time points from subtype C infected Zambians. This method uses restriction enzyme based cloning to insert the gag
gene into a common subtype C HIV-1 proviral backbone, MJ4. This makes it more appropriate to the study of subtype C sequences than previous recombination based methods that have assessed the in vitro
replication of chronically derived gag-pro
sequences. Nevertheless, the protocol could be readily modified for studies of viruses from other subtypes. Moreover, this protocol details a robust and reproducible method for assessing the replication capacity of the Gag-MJ4 chimeric viruses on a CEM-based T cell line. This method was utilized for the study of Gag-MJ4 chimeric viruses derived from 149 subtype C acutely infected Zambians, and has allowed for the identification of residues in Gag that affect replication. More importantly, the implementation of this technique has facilitated a deeper understanding of how viral replication defines parameters of early HIV-1 pathogenesis such as set point viral load and longitudinal CD4+ T cell decline.
Infectious Diseases, Issue 90, HIV-1, Gag, viral replication, replication capacity, viral fitness, MJ4, CEM, GXR25
A Practical Guide to Phylogenetics for Nonexperts
Institutions: The George Washington University.
Many researchers, across incredibly diverse foci, are applying phylogenetics to their research question(s). However, many researchers are new to this topic and so it presents inherent problems. Here we compile a practical introduction to phylogenetics for nonexperts. We outline in a step-by-step manner, a pipeline for generating reliable phylogenies from gene sequence datasets. We begin with a user-guide for similarity search tools via online interfaces as well as local executables. Next, we explore programs for generating multiple sequence alignments followed by protocols for using software to determine best-fit models of evolution. We then outline protocols for reconstructing phylogenetic relationships via maximum likelihood and Bayesian criteria and finally describe tools for visualizing phylogenetic trees. While this is not by any means an exhaustive description of phylogenetic approaches, it does provide the reader with practical starting information on key software applications commonly utilized by phylogeneticists. The vision for this article would be that it could serve as a practical training tool for researchers embarking on phylogenetic studies and also serve as an educational resource that could be incorporated into a classroom or teaching-lab.
Basic Protocol, Issue 84, phylogenetics, multiple sequence alignments, phylogenetic tree, BLAST executables, basic local alignment search tool, Bayesian models
Isolation of Fidelity Variants of RNA Viruses and Characterization of Virus Mutation Frequency
Institutions: Institut Pasteur .
RNA viruses use RNA dependent RNA polymerases to replicate their genomes. The intrinsically high error rate of these enzymes is a large contributor to the generation of extreme population diversity that facilitates virus adaptation and evolution. Increasing evidence shows that the intrinsic error rates, and the resulting mutation frequencies, of RNA viruses can be modulated by subtle amino acid changes to the viral polymerase. Although biochemical assays exist for some viral RNA polymerases that permit quantitative measure of incorporation fidelity, here we describe a simple method of measuring mutation frequencies of RNA viruses that has proven to be as accurate as biochemical approaches in identifying fidelity altering mutations. The approach uses conventional virological and sequencing techniques that can be performed in most biology laboratories. Based on our experience with a number of different viruses, we have identified the key steps that must be optimized to increase the likelihood of isolating fidelity variants and generating data of statistical significance. The isolation and characterization of fidelity altering mutations can provide new insights into polymerase structure and function1-3
. Furthermore, these fidelity variants can be useful tools in characterizing mechanisms of virus adaptation and evolution4-7
Immunology, Issue 52, Polymerase fidelity, RNA virus, mutation frequency, mutagen, RNA polymerase, viral evolution
Metabolic Labeling of Newly Transcribed RNA for High Resolution Gene Expression Profiling of RNA Synthesis, Processing and Decay in Cell Culture
Institutions: Max von Pettenkofer Institute, University of Cambridge, Ludwig-Maximilians-University Munich.
The development of whole-transcriptome microarrays and next-generation sequencing has revolutionized our understanding of the complexity of cellular gene expression. Along with a better understanding of the involved molecular mechanisms, precise measurements of the underlying kinetics have become increasingly important. Here, these powerful methodologies face major limitations due to intrinsic properties of the template samples they study, i.e.
total cellular RNA. In many cases changes in total cellular RNA occur either too slowly or too quickly to represent the underlying molecular events and their kinetics with sufficient resolution. In addition, the contribution of alterations in RNA synthesis, processing, and decay are not readily differentiated.
We recently developed high-resolution gene expression profiling to overcome these limitations. Our approach is based on metabolic labeling of newly transcribed RNA with 4-thiouridine (thus also referred to as 4sU-tagging) followed by rigorous purification of newly transcribed RNA using thiol-specific biotinylation and streptavidin-coated magnetic beads. It is applicable to a broad range of organisms including vertebrates, Drosophila
, and yeast. We successfully applied 4sU-tagging to study real-time kinetics of transcription factor activities, provide precise measurements of RNA half-lives, and obtain novel insights into the kinetics of RNA processing. Finally, computational modeling can be employed to generate an integrated, comprehensive analysis of the underlying molecular mechanisms.
Genetics, Issue 78, Cellular Biology, Molecular Biology, Microbiology, Biochemistry, Eukaryota, Investigative Techniques, Biological Phenomena, Gene expression profiling, RNA synthesis, RNA processing, RNA decay, 4-thiouridine, 4sU-tagging, microarray analysis, RNA-seq, RNA, DNA, PCR, sequencing
Polymerase Chain Reaction: Basic Protocol Plus Troubleshooting and Optimization Strategies
Institutions: University of California, Los Angeles .
In the biological sciences there have been technological advances that catapult the discipline into golden ages of discovery. For example, the field of microbiology was transformed with the advent of Anton van Leeuwenhoek's microscope, which allowed scientists to visualize prokaryotes for the first time. The development of the polymerase chain reaction (PCR) is one of those innovations that changed the course of molecular science with its impact spanning countless subdisciplines in biology. The theoretical process was outlined by Keppe and coworkers in 1971; however, it was another 14 years until the complete PCR procedure was described and experimentally applied by Kary Mullis while at Cetus Corporation in 1985. Automation and refinement of this technique progressed with the introduction of a thermal stable DNA polymerase from the bacterium Thermus aquaticus
, consequently the name Taq
PCR is a powerful amplification technique that can generate an ample supply of a specific segment of DNA (i.e., an amplicon) from only a small amount of starting material (i.e., DNA template or target sequence). While straightforward and generally trouble-free, there are pitfalls that complicate the reaction producing spurious results. When PCR fails it can lead to many non-specific DNA products of varying sizes that appear as a ladder or smear of bands on agarose gels. Sometimes no products form at all. Another potential problem occurs when mutations are unintentionally introduced in the amplicons, resulting in a heterogeneous population of PCR products. PCR failures can become frustrating unless patience and careful troubleshooting are employed to sort out and solve the problem(s). This protocol outlines the basic principles of PCR, provides a methodology that will result in amplification of most target sequences, and presents strategies for optimizing a reaction. By following this PCR guide, students should be able to:
● Set up reactions and thermal cycling conditions for a conventional PCR experiment
● Understand the function of various reaction components and their overall effect on a PCR experiment
● Design and optimize a PCR experiment for any DNA template
● Troubleshoot failed PCR experiments
Basic Protocols, Issue 63, PCR, optimization, primer design, melting temperature, Tm, troubleshooting, additives, enhancers, template DNA quantification, thermal cycler, molecular biology, genetics
RNA-seq Analysis of Transcriptomes in Thrombin-treated and Control Human Pulmonary Microvascular Endothelial Cells
Institutions: Children's Mercy Hospital and Clinics, School of Medicine, University of Missouri-Kansas City.
The characterization of gene expression in cells via measurement of mRNA levels is a useful tool in determining how the transcriptional machinery of the cell is affected by external signals (e.g.
drug treatment), or how cells differ between a healthy state and a diseased state. With the advent and continuous refinement of next-generation DNA sequencing technology, RNA-sequencing (RNA-seq) has become an increasingly popular method of transcriptome analysis to catalog all species of transcripts, to determine the transcriptional structure of all expressed genes and to quantify the changing expression levels of the total set of transcripts in a given cell, tissue or organism1,2
. RNA-seq is gradually replacing DNA microarrays as a preferred method for transcriptome analysis because it has the advantages of profiling a complete transcriptome, providing a digital type datum (copy number of any transcript) and not relying on any known genomic sequence3
Here, we present a complete and detailed protocol to apply RNA-seq to profile transcriptomes in human pulmonary microvascular endothelial cells with or without thrombin treatment. This protocol is based on our recent published study entitled "RNA-seq Reveals Novel Transcriptome of Genes and Their Isoforms in Human Pulmonary Microvascular Endothelial Cells Treated with Thrombin,"4
in which we successfully performed the first complete transcriptome analysis of human pulmonary microvascular endothelial cells treated with thrombin using RNA-seq. It yielded unprecedented resources for further experimentation to gain insights into molecular mechanisms underlying thrombin-mediated endothelial dysfunction in the pathogenesis of inflammatory conditions, cancer, diabetes, and coronary heart disease, and provides potential new leads for therapeutic targets to those diseases.
The descriptive text of this protocol is divided into four parts. The first part describes the treatment of human pulmonary microvascular endothelial cells with thrombin and RNA isolation, quality analysis and quantification. The second part describes library construction and sequencing. The third part describes the data analysis. The fourth part describes an RT-PCR validation assay. Representative results of several key steps are displayed. Useful tips or precautions to boost success in key steps are provided in the Discussion section. Although this protocol uses human pulmonary microvascular endothelial cells treated with thrombin, it can be generalized to profile transcriptomes in both mammalian and non-mammalian cells and in tissues treated with different stimuli or inhibitors, or to compare transcriptomes in cells or tissues between a healthy state and a disease state.
Genetics, Issue 72, Molecular Biology, Immunology, Medicine, Genomics, Proteins, RNA-seq, Next Generation DNA Sequencing, Transcriptome, Transcription, Thrombin, Endothelial cells, high-throughput, DNA, genomic DNA, RT-PCR, PCR
Identification of Key Factors Regulating Self-renewal and Differentiation in EML Hematopoietic Precursor Cells by RNA-sequencing Analysis
Institutions: The University of Texas Graduate School of Biomedical Sciences at Houston.
Hematopoietic stem cells (HSCs) are used clinically for transplantation treatment to rebuild a patient's hematopoietic system in many diseases such as leukemia and lymphoma. Elucidating the mechanisms controlling HSCs self-renewal and differentiation is important for application of HSCs for research and clinical uses. However, it is not possible to obtain large quantity of HSCs due to their inability to proliferate in vitro
. To overcome this hurdle, we used a mouse bone marrow derived cell line, the EML (Erythroid, Myeloid, and Lymphocytic) cell line, as a model system for this study.
RNA-sequencing (RNA-Seq) has been increasingly used to replace microarray for gene expression studies. We report here a detailed method of using RNA-Seq technology to investigate the potential key factors in regulation of EML cell self-renewal and differentiation. The protocol provided in this paper is divided into three parts. The first part explains how to culture EML cells and separate Lin-CD34+ and Lin-CD34- cells. The second part of the protocol offers detailed procedures for total RNA preparation and the subsequent library construction for high-throughput sequencing. The last part describes the method for RNA-Seq data analysis and explains how to use the data to identify differentially expressed transcription factors between Lin-CD34+ and Lin-CD34- cells. The most significantly differentially expressed transcription factors were identified to be the potential key regulators controlling EML cell self-renewal and differentiation. In the discussion section of this paper, we highlight the key steps for successful performance of this experiment.
In summary, this paper offers a method of using RNA-Seq technology to identify potential regulators of self-renewal and differentiation in EML cells. The key factors identified are subjected to downstream functional analysis in vitro
and in vivo
Genetics, Issue 93, EML Cells, Self-renewal, Differentiation, Hematopoietic precursor cell, RNA-Sequencing, Data analysis
Non-radioactive in situ Hybridization Protocol Applicable for Norway Spruce and a Range of Plant Species
Institutions: Uppsala University, Swedish University of Agricultural Sciences.
The high-throughput expression analysis technologies available today give scientists an overflow of expression profiles but their resolution in terms of tissue specific expression is limited because of problems in dissecting individual tissues. Expression data needs to be confirmed and complemented with expression patterns using e.g. in situ
hybridization, a technique used to localize cell specific mRNA expression. The in situ
hybridization method is laborious, time-consuming and often requires extensive optimization depending on species and tissue. In situ
experiments are relatively more difficult to perform in woody species such as the conifer Norway spruce (Picea abies
). Here we present a modified DIG in situ
hybridization protocol, which is fast and applicable on a wide range of plant species including P. abies
. With just a few adjustments, including altered RNase treatment and proteinase K concentration, we could use the protocol to study tissue specific expression of homologous genes in male reproductive organs of one gymnosperm and two angiosperm species; P. abies, Arabidopsis thaliana
and Brassica napus
. The protocol worked equally well for the species and genes studied. AtAP3
were observed in second and third whorl floral organs in A. thaliana
and B. napus
and DAL13 in microsporophylls of male cones from P. abies
. For P. abies
the proteinase K concentration, used to permeablize the tissues, had to be increased to 3 g/ml instead of 1 g/ml, possibly due to more compact tissues and higher levels of phenolics and polysaccharides. For all species the RNase treatment was removed due to reduced signal strength without a corresponding increase in specificity. By comparing tissue specific expression patterns of homologous genes from both flowering plants and a coniferous tree we demonstrate that the DIG in situ
protocol presented here, with only minute adjustments, can be applied to a wide range of plant species. Hence, the protocol avoids both extensive species specific optimization and the laborious use of radioactively labeled probes in favor of DIG labeled probes. We have chosen to illustrate the technically demanding steps of the protocol in our film.
Anna Karlgren and Jenny Carlsson contributed equally to this study.
Corresponding authors: Anna Karlgren at Anna.Karlgren@ebc.uu.se and Jens F. Sundström at Jens.Sundstrom@vbsg.slu.se
Plant Biology, Issue 26, RNA, expression analysis, Norway spruce, Arabidopsis, rapeseed, conifers
DNA-affinity-purified Chip (DAP-chip) Method to Determine Gene Targets for Bacterial Two component Regulatory Systems
Institutions: Lawrence Berkeley National Laboratory.
methods such as ChIP-chip are well-established techniques used to determine global gene targets for transcription factors. However, they are of limited use in exploring bacterial two component regulatory systems with uncharacterized activation conditions. Such systems regulate transcription only when activated in the presence of unique signals. Since these signals are often unknown, the in vitro
microarray based method described in this video article can be used to determine gene targets and binding sites for response regulators. This DNA-affinity-purified-chip method may be used for any purified regulator in any organism with a sequenced genome. The protocol involves allowing the purified tagged protein to bind to sheared genomic DNA and then affinity purifying the protein-bound DNA, followed by fluorescent labeling of the DNA and hybridization to a custom tiling array. Preceding steps that may be used to optimize the assay for specific regulators are also described. The peaks generated by the array data analysis are used to predict binding site motifs, which are then experimentally validated. The motif predictions can be further used to determine gene targets of orthologous response regulators in closely related species. We demonstrate the applicability of this method by determining the gene targets and binding site motifs and thus predicting the function for a sigma54-dependent response regulator DVU3023 in the environmental bacterium Desulfovibrio vulgaris
Genetics, Issue 89, DNA-Affinity-Purified-chip, response regulator, transcription factor binding site, two component system, signal transduction, Desulfovibrio, lactate utilization regulator, ChIP-chip
Vaccinia Virus Infection & Temporal Analysis of Virus Gene Expression: Part 3
Institutions: MIT - Massachusetts Institute of Technology.
The family Poxviridae
consists of large double-stranded DNA containing viruses that replicate exclusively in the cytoplasm of infected cells. Members of the orthopox
genus include variola, the causative agent of human small pox, monkeypox, and vaccinia (VAC), the prototypic member of the virus family. Within the relatively large (~ 200 kb) vaccinia genome, three classes of genes are encoded: early, intermediate, and late. While all three classes are transcribed by virally-encoded RNA polymerases, each class serves a different function in the life cycle of the virus. Poxviruses utilize multiple strategies for modulation of the host cellular environment during infection. In order to understand regulation of both host and virus gene expression, we have utilized genome-wide approaches to analyze transcript abundance from both virus and host cells. Here, we demonstrate time course infections of HeLa cells with Vaccinia virus and sampling RNA at several time points post-infection. Both host and viral total RNA is isolated and amplified for hybridization to microarrays for analysis of gene expression.
Microbiology, Issue 26, Vaccinia, virus, infection, HeLa, Microarray, amplified RNA, amino allyl, RNA, Ambion Amino Allyl MessageAmpII, gene expression
Building a Better Mosquito: Identifying the Genes Enabling Malaria and Dengue Fever Resistance in A. gambiae and A. aegypti Mosquitoes
Institutions: Johns Hopkins University.
In this interview, George Dimopoulos focuses on the physiological mechanisms used by mosquitoes to combat Plasmodium falciparum and dengue virus infections. Explanation is given for how key refractory genes, those genes conferring resistance to vector pathogens, are identified in the mosquito and how this knowledge can be used to generate transgenic mosquitoes that are unable to carry the malaria parasite or dengue virus.
Cellular Biology, Issue 5, Translational Research, mosquito, malaria, virus, dengue, genetics, injection, RNAi, transgenesis, transgenic
Virus-induced Gene Silencing (VIGS) in Nicotiana benthamiana and Tomato
Institutions: Cornell University, Boyce Thompson Institute for Plant Research.
RNA interference (RNAi) is a highly specific gene-silencing phenomenon triggered by dsRNA1
. This silencing mechanism uses two major classes of RNA regulators: microRNAs, which are produced from non-protein coding genes and short interfering RNAs (siRNAs). Plants use RNAi to control transposons and to exert tight control over developmental processes such as flower organ formation and leaf development2,3,4
. Plants also use RNAi to defend themselves against infection by viruses. Consequently, many viruses have evolved suppressors of gene silencing to allow their successful colonization of their host5
Virus-induced gene silencing (VIGS) is a method that takes advantage of the plant RNAi-mediated antiviral defense mechanism. In plants infected with unmodified viruses the mechanism is specifically targeted against the viral genome. However, with virus vectors carrying sequences derived from host genes, the process can be additionally targeted against the corresponding host mRNAs. VIGS has been adapted for high-throughput functional genomics in plants by using the plant pathogen Agrobacterium tumefaciens
to deliver, via its Ti plasmid, a recombinant virus carrying the entire or part of the gene sequence targeted for silencing. Systemic virus spread and the endogenous plant RNAi machinery take care of the rest. dsRNAs corresponding to the target gene are produced and then cleaved by the ribonuclease Dicer into siRNAs of 21 to 24 nucleotides in length. These siRNAs ultimately guide the RNA-induced silencing complex (RISC) to degrade the target transcript2
Different vectors have been employed in VIGS and one of the most frequently used is based on tobacco rattle virus (TRV). TRV is a bipartite virus and, as such, two different A. tumefaciens
strains are used for VIGS. One carries pTRV1, which encodes the replication and movement viral functions while the other, pTRV2, harbors the coat protein and the sequence used for VIGS6,7
. Inoculation of Nicotiana benthamiana
and tomato seedlings with a mixture of both strains results in gene silencing. Silencing of the endogenous phytoene desaturase
) gene, which causes photobleaching, is used as a control for VIGS efficiency. It should be noted, however, that silencing in tomato is usually less efficient than in N. benthamiana
. RNA transcript abundance of the gene of interest should always be measured to ensure that the target gene has efficiently been down-regulated. Nevertheless, heterologous gene sequences from N. benthamiana
can be used to silence their respective orthologs in tomato and vice versa8
Plant Biology, Issue 28, Virus-induced gene silencing (VIGS), RNA interference (RNAi), Tobacco Rattle Virus (TRV) vectors, Nicotiana benthamiana, tomato
A Strategy to Identify de Novo Mutations in Common Disorders such as Autism and Schizophrenia
Institutions: Universite de Montreal, Universite de Montreal, Universite de Montreal.
There are several lines of evidence supporting the role of de novo
mutations as a mechanism for common disorders, such as autism and schizophrenia. First, the de novo
mutation rate in humans is relatively high, so new mutations are generated at a high frequency in the population. However, de novo
mutations have not been reported in most common diseases. Mutations in genes leading to severe diseases where there is a strong negative selection against the phenotype, such as lethality in embryonic stages or reduced reproductive fitness, will not be transmitted to multiple family members, and therefore will not be detected by linkage gene mapping or association studies. The observation of very high concordance in monozygotic twins and very low concordance in dizygotic twins also strongly supports the hypothesis that a significant fraction of cases may result from new mutations. Such is the case for diseases such as autism and schizophrenia. Second, despite reduced reproductive fitness1
and extremely variable environmental factors, the incidence of some diseases is maintained worldwide at a relatively high and constant rate. This is the case for autism and schizophrenia, with an incidence of approximately 1% worldwide. Mutational load can be thought of as a balance between selection for or against a deleterious mutation and its production by de novo
mutation. Lower rates of reproduction constitute a negative selection factor that should reduce the number of mutant alleles in the population, ultimately leading to decreased disease prevalence. These selective pressures tend to be of different intensity in different environments. Nonetheless, these severe mental disorders have been maintained at a constant relatively high prevalence in the worldwide population across a wide range of cultures and countries despite a strong negative selection against them2
. This is not what one would predict in diseases with reduced reproductive fitness, unless there was a high new mutation rate. Finally, the effects of paternal age: there is a significantly increased risk of the disease with increasing paternal age, which could result from the age related increase in paternal de novo
mutations. This is the case for autism and schizophrenia3
. The male-to-female ratio of mutation rate is estimated at about 4–6:1, presumably due to a higher number of germ-cell divisions with age in males. Therefore, one would predict that de novo
mutations would more frequently come from males, particularly older males4
. A high rate of new mutations may in part explain why genetic studies have so far failed to identify many genes predisposing to complexes diseases genes, such as autism and schizophrenia, and why diseases have been identified for a mere 3% of genes in the human genome. Identification for de novo
mutations as a cause of a disease requires a targeted molecular approach, which includes studying parents and affected subjects. The process for determining if the genetic basis of a disease may result in part from de novo
mutations and the molecular approach to establish this link will be illustrated, using autism and schizophrenia as examples.
Medicine, Issue 52, de novo mutation, complex diseases, schizophrenia, autism, rare variations, DNA sequencing
Using an Automated Cell Counter to Simplify Gene Expression Studies: siRNA Knockdown of IL-4 Dependent Gene Expression in Namalwa Cells
Institutions: Bio-Rad Laboratories.
The use of siRNA mediated gene knockdown is continuing to be an important tool in studies of gene expression. siRNA studies are being conducted not only to study the effects of downregulating single genes, but also to interrogate signaling pathways and other complex interaction networks. These pathway analyses require both the use of relevant cellular models and methods that cause less perturbation to the cellular physiology. Electroporation is increasingly being used as an effective way to introduce siRNA and other nucleic acids into difficult to transfect cell lines and primary cells without altering the signaling pathway under investigation. There are multiple critical steps to a successful siRNA experiment, and there are ways to simplify the work while improving the data quality at several experimental stages. To help you get started with your siRNA mediated gene knockdown project, we will demonstrate how to perform a pathway study complete from collecting and counting the cells prior to electroporation through post transfection real-time PCR gene expression analysis. The following study investigates the role of the transcriptional activator STAT6 in IL-4 dependent gene expression of CCL17 in a Burkitt lymphoma cell line (Namalwa). The techniques demonstrated are useful for a wide range of siRNA-based experiments on both adherent and suspension cells. We will also show how to streamline cell counting with the TC10 automated cell counter, how to electroporate multiple samples simultaneously using the MXcell electroporation system, and how to simultaneously assess RNA quality and quantity with the Experion automated electrophoresis system.
Cellular Biology, Issue 38, Cell Counting, Gene Silencing, siRNA, Namalwa Cells, IL4, Gene Expression, Electroporation, Real Time PCR
Using SCOPE to Identify Potential Regulatory Motifs in Coregulated Genes
Institutions: Dartmouth College.
SCOPE is an ensemble motif finder that uses three component algorithms in parallel to identify potential regulatory motifs by over-representation and motif position preference1
. Each component algorithm is optimized to find a different kind of motif. By taking the best of these three approaches, SCOPE performs better than any single algorithm, even in the presence of noisy data1
. In this article, we utilize a web version of SCOPE2
to examine genes that are involved in telomere maintenance. SCOPE has been incorporated into at least two other motif finding programs3,4
and has been used in other studies5-8
The three algorithms that comprise SCOPE are BEAM9
, which finds non-degenerate motifs (ACCGGT), PRISM10
, which finds degenerate motifs (ASCGWT), and SPACER11
, which finds longer bipartite motifs (ACCnnnnnnnnGGT). These three algorithms have been optimized to find their corresponding type of motif. Together, they allow SCOPE to perform extremely well.
Once a gene set has been analyzed and candidate motifs identified, SCOPE can look for other genes that contain the motif which, when added to the original set, will improve the motif score. This can occur through over-representation or motif position preference. Working with partial gene sets that have biologically verified transcription factor binding sites, SCOPE was able to identify most of the rest of the genes also regulated by the given transcription factor.
Output from SCOPE shows candidate motifs, their significance, and other information both as a table and as a graphical motif map. FAQs and video tutorials are available at the SCOPE web site which also includes a "Sample Search" button that allows the user to perform a trial run.
Scope has a very friendly user interface that enables novice users to access the algorithm's full power without having to become an expert in the bioinformatics of motif finding. As input, SCOPE can take a list of genes, or FASTA sequences. These can be entered in browser text fields, or read from a file. The output from SCOPE contains a list of all identified motifs with their scores, number of occurrences, fraction of genes containing the motif, and the algorithm used to identify the motif. For each motif, result details include a consensus representation of the motif, a sequence logo, a position weight matrix, and a list of instances for every motif occurrence (with exact positions and "strand" indicated). Results are returned in a browser window and also optionally by email. Previous papers describe the SCOPE algorithms in detail1,2,9-11
Genetics, Issue 51, gene regulation, computational biology, algorithm, promoter sequence motif