Whole transcriptome sequencing by mRNA-Seq is now used extensively to perform global gene expression, mutation, allele-specific expression and other genome-wide analyses. mRNA-Seq even opens the gate for gene expression analysis of non-sequenced genomes. mRNA-Seq offers high sensitivity, a large dynamic range and allows measurement of transcript copy numbers in a sample. Illumina’s genome analyzer performs sequencing of a large number (> 107) of relatively short sequence reads (< 150 bp).The "paired end" approach, wherein a single long read is sequenced at both its ends, allows for tracking alternate splice junctions, insertions and deletions, and is useful for de novo transcriptome assembly.
One of the major challenges faced by researchers is a limited amount of starting material. For example, in experiments where cells are harvested by laser micro-dissection, available starting total RNA may measure in nanograms. Preparation of mRNA-Seq libraries from such samples have been described1, 2 but involves significant PCR amplification that may introduce bias. Other RNA-Seq library construction procedures with minimal PCR amplification have been published3, 4 but require microgram amounts of starting total RNA.
Here we describe a protocol for the Illumina Genome Analyzer II platform for mRNA-Seq sequencing for library preparation that avoids significant PCR amplification and requires only 10 nanograms of total RNA. While this protocol has been described previously and validated for single-end sequencing5, where it was shown to produce directional libraries without introducing significant amplification bias, here we validate it further for use as a paired end protocol. We selectively amplify polyadenylated messenger RNAs from starting total RNA using the T7 based Eberwine linear amplification method, coined "T7LA" (T7 linear amplification). The amplified poly-A mRNAs are fragmented, reverse transcribed and adapter ligated to produce the final sequencing library. For both single read and paired end runs, sequences are mapped to the human transcriptome6 and normalized so that data from multiple runs can be compared. We report the gene expression measurement in units of transcripts per million (TPM), which is a superior measure to RPKM when comparing samples7.
22 Related JoVE Articles!
Massively Parallel Reporter Assays in Cultured Mammalian Cells
Institutions: Broad Institute.
The genetic reporter assay is a well-established and powerful tool for dissecting the relationship between DNA sequences and their gene regulatory activities. The potential throughput of this assay has, however, been limited by the need to individually clone and assay the activity of each sequence on interest using protein fluorescence or enzymatic activity as a proxy for regulatory activity. Advances in high-throughput DNA synthesis and sequencing technologies have recently made it possible to overcome these limitations by multiplexing the construction and interrogation of large libraries of reporter constructs. This protocol describes implementation of a Massively Parallel Reporter Assay (MPRA) that allows direct comparison of hundreds of thousands of putative regulatory sequences in a single cell culture dish.
Genetics, Issue 90, gene regulation, transcriptional regulation, sequence-activity mapping, reporter assay, library cloning, transfection, tag sequencing, mammalian cells
Analyzing Gene Expression from Marine Microbial Communities using Environmental Transcriptomics
Institutions: University of Georgia (UGA).
Analogous to metagenomics, environmental transcriptomics (metatranscriptomics) retrieves and sequences environmental mRNAs from a microbial assemblage without prior knowledge of what genes the community might be expressing. Thus it provides the most unbiased perspective on community gene expression in situ
. Environmental transcriptomics protocols are technically difficult since prokaryotic mRNAs generally lack the poly(A) tails that make isolation of eukaryotic messages relatively straightforward 1
and because of the relatively short half lives of mRNAs 2
. In addition, mRNAs are much less abundant than rRNAs in total RNA extracts, thus an rRNA background often overwhelms mRNA signals. However, techniques for overcoming some of these difficulties have recently been developed. A procedure for analyzing environmental transcriptomes by creating clone libraries using random primers to reverse-transcribe and amplify environmental mRNAs was recently described was successful in two different natural environments, but results were biased by selection of the random primers used to initiate cDNA synthesis 3
. Advances in linear amplification of mRNA obviate the need for random primers in the amplification step and make it possible to use less starting material decreasing the collection and processing time of samples and thereby minimizing RNA degradation 4
. In vitro
transcription methods for amplifying mRNA involve polyadenylating the mRNA and incorporating a T7 promoter onto the 3 end of the transcript. Amplified RNA (aRNA) can then be converted to double stranded cDNA using random hexamers and directly sequenced by pyrosequencing 5
. A first use of this method at Station ALOHA demonstrated its utility for characterizing microbial community gene expression 6
Microbiology, Issue 24, transcriptomics, bacterioplankton, mRNA, microbial communities, gene expression
Primer Extension Capture: Targeted Sequence Retrieval from Heavily Degraded DNA Sources
Institutions: Max-Planck Institute for Evolutionary Anthropology, Leipzig.
We present a method of targeted DNA sequence retrieval from DNA sources which are heavily degraded and contaminated with microbial DNA, as is typical of ancient bones. The method greatly reduces sample destruction and sequencing demands relative to direct PCR or shotgun sequencing approaches. We used this method to reconstruct the complete mitochondrial DNA (mtDNA) genomes of five Neandertals from across their geographic range. The mtDNA genetic diversity of the late Neandertals was approximately three times lower than that of contemporary modern humans. Together with analyses of mtDNA protein evolution, these data suggest that the long-term effective population size of Neandertals was smaller than that of modern humans and extant great apes.
Cellular Biology, Issue 31, Neandertal, anthropology, evolution, ancient DNA, DNA sequencing, targeted sequencing, capture
Direct Restart of a Replication Fork Stalled by a Head-On RNA Polymerase
Institutions: Rockefeller University.
studies suggest that replication forks are arrested due to encounters with head-on transcription complexes. Yet, the fate of the replisome and RNA polymerase (RNAP) following a head-on collision is unknown. Here, we find that the E. coli
replisome stalls upon collision with a head-on transcription complex, but instead of collapsing, the replication fork remains highly stable and eventually resumes elongation after displacing the RNAP from DNA. We also find that the transcription-repair coupling factor, Mfd, promotes direct restart of the fork following the collision by facilitating displacement of the RNAP. These findings demonstrate the intrinsic stability of the replication apparatus and a novel role for the transcription-coupled repair pathway in promoting replication past a RNAP block.
Cellular Biology, Issue 38, replication, transcription, transcription-coupled repair, replisome, RNA polymerase, collision
A Fluorescence-based Exonuclease Assay to Characterize DmWRNexo, Orthologue of Human Progeroid WRN Exonuclease, and Its Application to Other Nucleases
Institutions: University of Oxford.
WRN exonuclease is involved in resolving DNA damage that occurs either during DNA replication or following exposure to endogenous or exogenous genotoxins. It is likely to play a role in preventing accumulation of recombinogenic intermediates that would otherwise accumulate at transiently stalled replication forks, consistent with a hyper-recombinant phenotype of cells lacking WRN. In humans, the exonuclease domain comprises an N-terminal portion of a much larger protein that also possesses helicase activity, together with additional sites important for DNA and protein interaction. By contrast, in Drosophila
, the exonuclease activity of WRN (DmWRNexo) is encoded by a distinct genetic locus from the presumptive helicase, allowing biochemical (and genetic) dissection of the role of the exonuclease activity in genome stability mechanisms. Here, we demonstrate a fluorescent method to determine WRN exonuclease activity using purified recombinant DmWRNexo and end-labeled fluorescent oligonucleotides. This system allows greater reproducibility than radioactive assays as the substrate oligonucleotides remain stable for months, and provides a safer and relatively rapid method for detailed analysis of nuclease activity, permitting determination of nuclease polarity, processivity, and substrate preferences.
Biochemistry, Issue 82, Aging, Premature, Exonucleases, Enzyme Assays, biochemistry, WRN, exonuclease, nuclease, RecQ, progeroid disease, aging, DmWRNexo
Multiplex Detection of Bacteria in Complex Clinical and Environmental Samples using Oligonucleotide-coupled Fluorescent Microspheres
Institutions: Agriculture and Agri-Food Canada, University of Saskatchewan , National Research Council of Canada.
Bacterial vaginosis (BV) is a recurring polymicrobial syndrome that is characterized by a change in the "normal" microbiota from Lactobacillus
-dominated to a microbiota dominated by a number of bacterial species, including Gardnerella vaginalis
, Atopobium vaginae
, and others1-3
. This condition is associated with a range of negative health outcomes, including HIV acquisition4
, and it can be difficult to manage clinically5
. Furthermore, diagnosis of BV has relied on the use of Gram stains of vaginal swab smears that are scored on various numerical criteria6,7
. While this diagnostic is simple, inexpensive, and well suited to resource-limited settings, it can suffer from problems related to subjective interpretations and it does not give a detailed profile of the composition of the vaginal microbiota8
. Recent deep sequencing efforts have revealed a rich, diverse vaginal microbiota with clear differences between samples taken from individuals that are diagnosed with BV compared to those individuals that are considered normal9,10
, which has resulted in the identification of a number of potential targets for molecular diagnosis of BV11,12
. These studies have provided a wealth of useful information, but deep sequencing is not yet practical as a diagnostic method in a clinical setting. We have recently described a method for rapidly profiling the vaginal microbiota in a multiplex format using oligonucleotide-coupled fluorescent beads with detection on a Luminex platform13
. This method, like current Gram stain-based methods, is rapid and simple but adds the additional advantage of exploiting molecular knowledge arising from sequencing studies in probe design. This method therefore provides a way to profile the major microorganisms that are present in a vaginal swab that can be used to diagnose BV with high specificity and sensitivity compared to Gram stain while providing additional information on species presence and abundance in a semi-quantitative and rapid manner. This multiplex method is expandable well beyond the range of current quantitative PCR assays for particular organisms, which is currently limited to 5 or 6 different assays in a single sample14
. Importantly, the method is not limited to the detection of bacteria in vaginal swabs and can be easily adapted to rapidly profile nearly any microbial community of interest. For example, we have recently begun to apply this methodology to the development of diagnostic tools for use in wastewater treatment plants.
Immunology, Issue 56, Medicine, chaperonin-60, hsp60, luminex, multiplex, diagnostics, bacterial vaginosis, PCR
A Method for Culturing Embryonic C. elegans Cells
Institutions: University of Miami .
is a powerful model system, in which genetic and molecular techniques are easily applicable. Until recently though, techniques that require direct access to cells and isolation of specific cell types, could not be applied in C. elegans
. This limitation was due to the fact that tissues are confined within a pressurized cuticle which is not easily digested by treatment with enzymes and/or detergents. Based on early pioneer work by Laird Bloom, Christensen and colleagues 1
developed a robust method for culturing C. elegans
embryonic cells in large scale. Eggs are isolated from gravid adults by treatment with bleach/NaOH and subsequently treated with chitinase to remove the eggshells. Embryonic cells are then dissociated by manual pipetting and plated onto substrate-covered glass in serum-enriched media. Within 24 hr of isolation cells begin to differentiate by changing morphology and by expressing cell specific markers. C. elegans
cells cultured using this method survive for up 2 weeks in vitro
and have been used for electrophysiological, immunochemical, and imaging analyses as well as they have been sorted and used for microarray profiling.
Developmental Biology, Issue 79, Eukaryota, Biological Phenomena, Cell Physiological Phenomena, C. elegans, cell culture, embryonic cells
A Quantitative Assay to Study Protein:DNA Interactions, Discover Transcriptional Regulators of Gene Expression, and Identify Novel Anti-tumor Agents
Institutions: University of Maryland School of Medicine, University of Maryland School of Medicine, University of Maryland School of Medicine, University of Maryland School of Medicine, University of Maryland School of Medicine.
Many DNA-binding assays such as electrophoretic mobility shift assays (EMSA), chemiluminescent assays, chromatin immunoprecipitation (ChIP)-based assays, and multiwell-based assays are used to measure transcription factor activity. However, these assays are nonquantitative, lack specificity, may involve the use of radiolabeled oligonucleotides, and may not be adaptable for the screening of inhibitors of DNA binding. On the other hand, using a quantitative DNA-binding enzyme-linked immunosorbent assay (D-ELISA) assay, we demonstrate nuclear protein interactions with DNA using the RUNX2 transcription factor that depend on specific association with consensus DNA-binding sequences present on biotin-labeled oligonucleotides. Preparation of cells, extraction of nuclear protein, and design of double stranded oligonucleotides are described. Avidin-coated 96-well plates are fixed with alkaline buffer and incubated with nuclear proteins in nucleotide blocking buffer. Following extensive washing of the plates, specific primary antibody and secondary antibody incubations are followed by the addition of horseradish peroxidase substrate and development of the colorimetric reaction. Stop reaction mode or continuous kinetic monitoring were used to quantitatively measure protein interaction with DNA. We discuss appropriate specificity controls, including treatment with non-specific IgG or without protein or primary antibody. Applications of the assay are described including its utility in drug screening and representative positive and negative results are discussed.
Cellular Biology, Issue 78, Transcription Factors, Vitamin D, Drug Discovery, Enzyme-Linked Immunosorbent Assay (ELISA), DNA-binding, transcription factor, drug screening, antibody
Nucleoside Triphosphates - From Synthesis to Biochemical Characterization
Institutions: University of Bern.
The traditional strategy for the introduction of chemical functionalities is the use of solid-phase synthesis by appending suitably modified phosphoramidite precursors to the nascent chain. However, the conditions used during the synthesis and the restriction to rather short sequences hamper the applicability of this methodology. On the other hand, modified nucleoside triphosphates are activated building blocks that have been employed for the mild introduction of numerous functional groups into nucleic acids, a strategy that paves the way for the use of modified nucleic acids in a wide-ranging palette of practical applications such as functional tagging and generation of ribozymes and DNAzymes. One of the major challenges resides in the intricacy of the methodology leading to the isolation and characterization of these nucleoside analogues.
In this video article, we present a detailed protocol for the synthesis of these modified analogues using phosphorous(III)-based reagents. In addition, the procedure for their biochemical characterization is divulged, with a special emphasis on primer extension reactions and TdT tailing polymerization. This detailed protocol will be of use for the crafting of modified dNTPs and their further use in chemical biology.
Chemistry, Issue 86, Nucleic acid analogues, Bioorganic Chemistry, PCR, primer extension reactions, organic synthesis, PAGE, HPLC, nucleoside triphosphates
Vaccinia Virus Infection & Temporal Analysis of Virus Gene Expression: Part 2
Institutions: MIT - Massachusetts Institute of Technology.
The family Poxviridae
consists of large double-stranded DNA containing viruses that replicate exclusively in the cytoplasm of infected cells. Members of the orthopox
genus include variola, the causative agent of human small pox, monkeypox, and vaccinia (VAC), the prototypic member of the virus family. Within the relatively large (~ 200 kb) vaccinia genome, three classes of genes are encoded: early, intermediate, and late. While all three classes are transcribed by virally-encoded RNA polymerases, each class serves a different function in the life cycle of the virus. Poxviruses utilize multiple strategies for modulation of the host cellular environment during infection. In order to understand regulation of both host and virus gene expression, we have utilized genome-wide approaches to analyze transcript abundance from both virus and host cells. Here, we demonstrate time course infections of HeLa cells with Vaccinia virus and sampling RNA at several time points post-infection. Both host and viral total RNA is isolated and amplified for hybridization to microarrays for analysis of gene expression.
Cellular Biology, Immunology, Microbiology, Issue 26, Vaccinia, virus, infection, HeLa, TRIzol reagent, total RNA, Microarray, amplification, amino allyl, RNA, Ambion Amino Allyl MessageAmpII, gene expression
Large-scale Gene Knockdown in C. elegans Using dsRNA Feeding Libraries to Generate Robust Loss-of-function Phenotypes
Institutions: University of Massachusetts, Amherst, University of Massachusetts, Amherst, University of Massachusetts, Amherst.
RNA interference by feeding worms bacteria expressing dsRNAs has been a useful tool to assess gene function in C. elegans
. While this strategy works well when a small number of genes are targeted for knockdown, large scale feeding screens show variable knockdown efficiencies, which limits their utility. We have deconstructed previously published RNAi knockdown protocols and found that the primary source of the reduced knockdown can be attributed to the loss of dsRNA-encoding plasmids from the bacteria fed to the animals. Based on these observations, we have developed a dsRNA feeding protocol that greatly reduces or eliminates plasmid loss to achieve efficient, high throughput knockdown. We demonstrate that this protocol will produce robust, reproducible knock down of C. elegans
genes in multiple tissue types, including neurons, and will permit efficient knockdown in large scale screens. This protocol uses a commercially available dsRNA feeding library and describes all steps needed to duplicate the library and perform dsRNA screens. The protocol does not require the use of any sophisticated equipment, and can therefore be performed by any C. elegans
Developmental Biology, Issue 79, Caenorhabditis elegans (C. elegans), Gene Knockdown Techniques, C. elegans, dsRNA interference, gene knockdown, large scale feeding screen
Metabolic Labeling of Newly Transcribed RNA for High Resolution Gene Expression Profiling of RNA Synthesis, Processing and Decay in Cell Culture
Institutions: Max von Pettenkofer Institute, University of Cambridge, Ludwig-Maximilians-University Munich.
The development of whole-transcriptome microarrays and next-generation sequencing has revolutionized our understanding of the complexity of cellular gene expression. Along with a better understanding of the involved molecular mechanisms, precise measurements of the underlying kinetics have become increasingly important. Here, these powerful methodologies face major limitations due to intrinsic properties of the template samples they study, i.e.
total cellular RNA. In many cases changes in total cellular RNA occur either too slowly or too quickly to represent the underlying molecular events and their kinetics with sufficient resolution. In addition, the contribution of alterations in RNA synthesis, processing, and decay are not readily differentiated.
We recently developed high-resolution gene expression profiling to overcome these limitations. Our approach is based on metabolic labeling of newly transcribed RNA with 4-thiouridine (thus also referred to as 4sU-tagging) followed by rigorous purification of newly transcribed RNA using thiol-specific biotinylation and streptavidin-coated magnetic beads. It is applicable to a broad range of organisms including vertebrates, Drosophila
, and yeast. We successfully applied 4sU-tagging to study real-time kinetics of transcription factor activities, provide precise measurements of RNA half-lives, and obtain novel insights into the kinetics of RNA processing. Finally, computational modeling can be employed to generate an integrated, comprehensive analysis of the underlying molecular mechanisms.
Genetics, Issue 78, Cellular Biology, Molecular Biology, Microbiology, Biochemistry, Eukaryota, Investigative Techniques, Biological Phenomena, Gene expression profiling, RNA synthesis, RNA processing, RNA decay, 4-thiouridine, 4sU-tagging, microarray analysis, RNA-seq, RNA, DNA, PCR, sequencing
Primer-Free Aptamer Selection Using A Random DNA Library
Institutions: Pennsylvania State University, Pennsylvania State University, Pennsylvania State University, Pennsylvania State University.
Aptamers are highly structured oligonucleotides (DNA or RNA) that can bind to targets with affinities comparable to antibodies 1
. They are identified through an in vitro selection process called Systematic Evolution of Ligands by EXponential enrichment (SELEX) to recognize a wide variety of targets, from small molecules to proteins and other macromolecules 2-4
. Aptamers have properties that are well suited for in vivo diagnostic and/or therapeutic applications: Besides good specificity and affinity, they are easily synthesized, survive more rigorous processing conditions, they are poorly immunogenic, and their relatively small size can result in facile penetration of tissues.
Aptamers that are identified through the standard SELEX process usually comprise ~80 nucleotides (nt), since they are typically selected from nucleic acid libraries with ~40 nt long randomized regions plus fixed primer sites of ~20 nt on each side. The fixed primer sequences thus can comprise nearly ~50% of the library sequences, and therefore may positively or negatively compromise identification of aptamers in the selection process 3
, although bioinformatics approaches suggest that the fixed sequences do not contribute significantly to aptamer structure after selection 5
. To address these potential problems, primer sequences have been blocked by complementary oligonucleotides or switched to different sequences midway during the rounds of SELEX 6
, or they have been trimmed to 6-9 nt 7, 8
. Wen and Gray 9
designed a primer-free genomic SELEX method, in which the primer sequences were completely removed from the library before selection and were then regenerated to allow amplification of the selected genomic fragments. However, to employ the technique, a unique genomic library has to be constructed, which possesses limited diversity, and regeneration after rounds of selection relies on a linear reamplification step. Alternatively, efforts to circumvent problems caused by fixed primer sequences using high efficiency partitioning are met with problems regarding PCR amplification 10
We have developed a primer-free (PF) selection method that significantly simplifies SELEX procedures and effectively eliminates primer-interference problems 11, 12
. The protocols work in a straightforward manner. The central random region of the library is purified without extraneous flanking sequences and is bound to a suitable target (for example to a purified protein or complex mixtures such as cell lines). Then the bound sequences are obtained, reunited with flanking sequences, and re-amplified to generate selected sub-libraries. As an example, here we selected aptamers to S100B, a protein marker for melanoma. Binding assays showed Kd s in the 10-7
M range after a few rounds of selection, and we demonstrate that the aptamers function effectively in a sandwich binding format.
Cellular Biology, Issue 41, aptamer, selection, S100B, sandwich
Detecting Somatic Genetic Alterations in Tumor Specimens by Exon Capture and Massively Parallel Sequencing
Institutions: Memorial Sloan-Kettering Cancer Center, Memorial Sloan-Kettering Cancer Center.
Efforts to detect and investigate key oncogenic mutations have proven valuable to facilitate the appropriate treatment for cancer patients. The establishment of high-throughput, massively parallel "next-generation" sequencing has aided the discovery of many such mutations. To enhance the clinical and translational utility of this technology, platforms must be high-throughput, cost-effective, and compatible with formalin-fixed paraffin embedded (FFPE) tissue samples that may yield small amounts of degraded or damaged DNA. Here, we describe the preparation of barcoded and multiplexed DNA libraries followed by hybridization-based capture of targeted exons for the detection of cancer-associated mutations in fresh frozen and FFPE tumors by massively parallel sequencing. This method enables the identification of sequence mutations, copy number alterations, and select structural rearrangements involving all targeted genes. Targeted exon sequencing offers the benefits of high throughput, low cost, and deep sequence coverage, thus conferring high sensitivity for detecting low frequency mutations.
Molecular Biology, Issue 80, Molecular Diagnostic Techniques, High-Throughput Nucleotide Sequencing, Genetics, Neoplasms, Diagnosis, Massively parallel sequencing, targeted exon sequencing, hybridization capture, cancer, FFPE, DNA mutations
Highly Efficient Ligation of Small RNA Molecules for MicroRNA Quantitation by High-Throughput Sequencing
Institutions: University of Colorado, Boulder, University of Colorado, Denver.
MiRNA cloning and high-throughput sequencing, termed miR-Seq, stands alone as a transcriptome-wide approach to quantify miRNAs with single nucleotide resolution. This technique captures miRNAs by attaching 3’ and 5’ oligonucleotide adapters to miRNA molecules and allows de novo
miRNA discovery. Coupling with powerful next-generation sequencing platforms, miR-Seq has been instrumental in the study of miRNA biology. However, significant biases introduced by oligonucleotide ligation steps have prevented miR-Seq from being employed as an accurate quantitation tool. Previous studies demonstrate that biases in current miR-Seq methods often lead to inaccurate miRNA quantification with errors up to 1,000-fold for some miRNAs1,2
. To resolve these biases imparted by RNA ligation, we have developed a small RNA ligation method that results in ligation efficiencies of over 95% for both 3’ and 5′ ligation steps. Benchmarking this improved library construction method using equimolar or differentially mixed synthetic miRNAs, consistently yields reads numbers with less than two-fold deviation from the expected value. Furthermore, this high-efficiency miR-Seq method permits accurate genome-wide miRNA profiling from in vivo
total RNA samples2
Molecular Biology, Issue 93, RNA, ligation, miRNA, miR-Seq, linker, oligonucleotide, high-throughput sequencing
Chromatin Interaction Analysis with Paired-End Tag Sequencing (ChIA-PET) for Mapping Chromatin Interactions and Understanding Transcription Regulation
Institutions: Agency for Science, Technology and Research, Singapore, A*STAR-Duke-NUS Neuroscience Research Partnership, Singapore, National University of Singapore, Singapore.
Genomes are organized into three-dimensional structures, adopting higher-order conformations inside the micron-sized nuclear spaces 7, 2, 12
. Such architectures are not random and involve interactions between gene promoters and regulatory elements 13
. The binding of transcription factors to specific regulatory sequences brings about a network of transcription regulation and coordination 1, 14
Chromatin Interaction Analysis by Paired-End Tag Sequencing (ChIA-PET) was developed to identify these higher-order chromatin structures 5,6
. Cells are fixed and interacting loci are captured by covalent DNA-protein cross-links. To minimize non-specific noise and reduce complexity, as well as to increase the specificity of the chromatin interaction analysis, chromatin immunoprecipitation (ChIP) is used against specific protein factors to enrich chromatin fragments of interest before proximity ligation. Ligation involving half-linkers subsequently forms covalent links between pairs of DNA fragments tethered together within individual chromatin complexes. The flanking MmeI restriction enzyme sites in the half-linkers allow extraction of paired end tag-linker-tag constructs (PETs) upon MmeI digestion. As the half-linkers are biotinylated, these PET constructs are purified using streptavidin-magnetic beads. The purified PETs are ligated with next-generation sequencing adaptors and a catalog of interacting fragments is generated via next-generation sequencers such as the Illumina Genome Analyzer. Mapping and bioinformatics analysis is then performed to identify ChIP-enriched binding sites and ChIP-enriched chromatin interactions 8
We have produced a video to demonstrate critical aspects of the ChIA-PET protocol, especially the preparation of ChIP as the quality of ChIP plays a major role in the outcome of a ChIA-PET library. As the protocols are very long, only the critical steps are shown in the video.
Genetics, Issue 62, ChIP, ChIA-PET, Chromatin Interactions, Genomics, Next-Generation Sequencing
RNA-seq Analysis of Transcriptomes in Thrombin-treated and Control Human Pulmonary Microvascular Endothelial Cells
Institutions: Children's Mercy Hospital and Clinics, School of Medicine, University of Missouri-Kansas City.
The characterization of gene expression in cells via measurement of mRNA levels is a useful tool in determining how the transcriptional machinery of the cell is affected by external signals (e.g.
drug treatment), or how cells differ between a healthy state and a diseased state. With the advent and continuous refinement of next-generation DNA sequencing technology, RNA-sequencing (RNA-seq) has become an increasingly popular method of transcriptome analysis to catalog all species of transcripts, to determine the transcriptional structure of all expressed genes and to quantify the changing expression levels of the total set of transcripts in a given cell, tissue or organism1,2
. RNA-seq is gradually replacing DNA microarrays as a preferred method for transcriptome analysis because it has the advantages of profiling a complete transcriptome, providing a digital type datum (copy number of any transcript) and not relying on any known genomic sequence3
Here, we present a complete and detailed protocol to apply RNA-seq to profile transcriptomes in human pulmonary microvascular endothelial cells with or without thrombin treatment. This protocol is based on our recent published study entitled "RNA-seq Reveals Novel Transcriptome of Genes and Their Isoforms in Human Pulmonary Microvascular Endothelial Cells Treated with Thrombin,"4
in which we successfully performed the first complete transcriptome analysis of human pulmonary microvascular endothelial cells treated with thrombin using RNA-seq. It yielded unprecedented resources for further experimentation to gain insights into molecular mechanisms underlying thrombin-mediated endothelial dysfunction in the pathogenesis of inflammatory conditions, cancer, diabetes, and coronary heart disease, and provides potential new leads for therapeutic targets to those diseases.
The descriptive text of this protocol is divided into four parts. The first part describes the treatment of human pulmonary microvascular endothelial cells with thrombin and RNA isolation, quality analysis and quantification. The second part describes library construction and sequencing. The third part describes the data analysis. The fourth part describes an RT-PCR validation assay. Representative results of several key steps are displayed. Useful tips or precautions to boost success in key steps are provided in the Discussion section. Although this protocol uses human pulmonary microvascular endothelial cells treated with thrombin, it can be generalized to profile transcriptomes in both mammalian and non-mammalian cells and in tissues treated with different stimuli or inhibitors, or to compare transcriptomes in cells or tissues between a healthy state and a disease state.
Genetics, Issue 72, Molecular Biology, Immunology, Medicine, Genomics, Proteins, RNA-seq, Next Generation DNA Sequencing, Transcriptome, Transcription, Thrombin, Endothelial cells, high-throughput, DNA, genomic DNA, RT-PCR, PCR
Linear Amplification Mediated PCR – Localization of Genetic Elements and Characterization of Unknown Flanking DNA
Institutions: National Center for Tumor Diseases (NCT) and German Cancer Research Center (DKFZ).
Linear-amplification mediated PCR (LAM-PCR) has been developed to study hematopoiesis in gene corrected cells of patients treated by gene therapy with integrating vector systems. Due to the stable integration of retroviral vectors, integration sites can be used to study the clonal fate of individual cells and their progeny. LAM- PCR for the first time provided evidence that leukemia in gene therapy treated patients originated from provirus induced overexpression of a neighboring proto-oncogene. The high sensitivity and specificity of LAM-PCR compared to existing methods like inverse PCR and ligation mediated (LM)-PCR is achieved by an initial preamplification step (linear PCR of 100 cycles) using biotinylated vector specific primers which allow subsequent reaction steps to be carried out on solid phase (magnetic beads). LAM-PCR is currently the most sensitive method available to identify unknown DNA which is located in the proximity of known DNA. Recently, a variant of LAM-PCR has been developed that circumvents restriction digest thus abrogating retrieval bias of integration sites and enables a comprehensive analysis of provirus locations in host genomes. The following protocol explains step-by-step the amplification of both 3’- and 5’- sequences adjacent to the integrated lentiviral vector.
Genetics, Issue 88, gene therapy, integrome, integration site analysis, LAM-PCR, retroviral vectors, lentiviral vectors, AAV, deep sequencing, clonal inventory, mutagenesis screen
Polymerase Chain Reaction: Basic Protocol Plus Troubleshooting and Optimization Strategies
Institutions: University of California, Los Angeles .
In the biological sciences there have been technological advances that catapult the discipline into golden ages of discovery. For example, the field of microbiology was transformed with the advent of Anton van Leeuwenhoek's microscope, which allowed scientists to visualize prokaryotes for the first time. The development of the polymerase chain reaction (PCR) is one of those innovations that changed the course of molecular science with its impact spanning countless subdisciplines in biology. The theoretical process was outlined by Keppe and coworkers in 1971; however, it was another 14 years until the complete PCR procedure was described and experimentally applied by Kary Mullis while at Cetus Corporation in 1985. Automation and refinement of this technique progressed with the introduction of a thermal stable DNA polymerase from the bacterium Thermus aquaticus
, consequently the name Taq
PCR is a powerful amplification technique that can generate an ample supply of a specific segment of DNA (i.e., an amplicon) from only a small amount of starting material (i.e., DNA template or target sequence). While straightforward and generally trouble-free, there are pitfalls that complicate the reaction producing spurious results. When PCR fails it can lead to many non-specific DNA products of varying sizes that appear as a ladder or smear of bands on agarose gels. Sometimes no products form at all. Another potential problem occurs when mutations are unintentionally introduced in the amplicons, resulting in a heterogeneous population of PCR products. PCR failures can become frustrating unless patience and careful troubleshooting are employed to sort out and solve the problem(s). This protocol outlines the basic principles of PCR, provides a methodology that will result in amplification of most target sequences, and presents strategies for optimizing a reaction. By following this PCR guide, students should be able to:
● Set up reactions and thermal cycling conditions for a conventional PCR experiment
● Understand the function of various reaction components and their overall effect on a PCR experiment
● Design and optimize a PCR experiment for any DNA template
● Troubleshoot failed PCR experiments
Basic Protocols, Issue 63, PCR, optimization, primer design, melting temperature, Tm, troubleshooting, additives, enhancers, template DNA quantification, thermal cycler, molecular biology, genetics
Profiling of Estrogen-regulated MicroRNAs in Breast Cancer Cells
Institutions: University of Houston.
Estrogen plays vital roles in mammary gland development and breast cancer progression. It mediates its function by binding to and activating the estrogen receptors (ERs), ERα, and ERβ. ERα is frequently upregulated in breast cancer and drives the proliferation of breast cancer cells. The ERs function as transcription factors and regulate gene expression. Whereas ERα's regulation of protein-coding genes is well established, its regulation of noncoding microRNA (miRNA) is less explored. miRNAs play a major role in the post-transcriptional regulation of genes, inhibiting their translation or degrading their mRNA. miRNAs can function as oncogenes or tumor suppressors and are also promising biomarkers. Among the miRNA assays available, microarray and quantitative real-time polymerase chain reaction (qPCR) have been extensively used to detect and quantify miRNA levels. To identify miRNAs regulated by estrogen signaling in breast cancer, their expression in ERα-positive breast cancer cell lines were compared before and after estrogen-activation using both the µParaflo-microfluidic microarrays and Dual Labeled Probes-low density arrays. Results were validated using specific qPCR assays, applying both Cyanine dye-based and Dual Labeled Probes-based chemistry. Furthermore, a time-point assay was used to identify regulations over time. Advantages of the miRNA assay approach used in this study is that it enables a fast screening of mature miRNA regulations in numerous samples, even with limited sample amounts. The layout, including the specific conditions for cell culture and estrogen treatment, biological and technical replicates, and large-scale screening followed by in-depth confirmations using separate techniques, ensures a robust detection of miRNA regulations, and eliminates false positives and other artifacts. However, mutated or unknown miRNAs, or regulations at the primary and precursor transcript level, will not be detected. The method presented here represents a thorough investigation of estrogen-mediated miRNA regulation.
Medicine, Issue 84, breast cancer, microRNA, estrogen, estrogen receptor, microarray, qPCR
A Rapid High-throughput Method for Mapping Ribonucleoproteins (RNPs) on Human pre-mRNA
Institutions: Brown University, Brown University.
Sequencing RNAs that co-immunoprecipitate (co-IP) with RNA binding proteins has increased our understanding of splicing by demonstrating that binding location often influences function of a splicing factor. However, as with any sampling strategy the chance of identifying an RNA bound to a splicing factor is proportional to its cellular abundance. We have developed a novel in vitro approach for surveying binding specificity on otherwise transient pre-mRNA. This approach utilizes a specifically designed oligonucleotide pool that tiles across introns, exons, splice junctions, or other pre-mRNA. The pool is subjected to some kind of molecular selection. Here, we demonstrate the method by separating the oligonucleotide into a bound and unbound fraction and utilize a two color array strategy to record the enrichment of each oligonucleotide in the bound fraction. The array data generates high-resolution maps with the ability to identify sequence-specific and structural determinates of ribonucleoprotein (RNP) binding on pre-mRNA. A unique advantage to this method is its ability to avoid the sampling bias towards mRNA associated with current IP and SELEX techniques, as the pool is specifically designed and synthesized from pre-mRNA sequence. The flexibility of the oligonucleotide pool is another advantage since the experimenter chooses which regions to study and tile across, tailoring the pool to their individual needs. Using this technique, one can assay the effects of polymorphisms or mutations on binding on a large scale or clone the library into a functional splicing reporter and identify oligonucleotides that are enriched in the included fraction. This novel in vitro high-resolution mapping scheme provides a unique way to study RNP interactions with transient pre-mRNA species, whose low abundance makes them difficult to study with current in vivo techniques.
Cellular Biology, Issue 34, pre-mRNA, splicing factors, tiling array, ribonucleoprotein (RNP), binding maps
Molecular Evolution of the Tre Recombinase
Institutions: Max Plank Institute for Molecular Cell Biology and Genetics, Dresden.
Here we report the generation of Tre recombinase through directed, molecular evolution. Tre recombinase recognizes a pre-defined target sequence within the LTR sequences of the HIV-1 provirus, resulting in the excision and eradication of the provirus from infected human cells.
We started with Cre, a 38-kDa recombinase, that recognizes a 34-bp double-stranded DNA sequence known as loxP. Because Cre can effectively eliminate genomic sequences, we set out to tailor a recombinase that could remove the sequence between the 5'-LTR and 3'-LTR of an integrated HIV-1 provirus. As a first step we identified sequences within the LTR sites that were similar to loxP and tested for recombination activity. Initially Cre and mutagenized Cre libraries failed to recombine the chosen loxLTR sites of the HIV-1 provirus. As the start of any directed molecular evolution process requires at least residual activity, the original asymmetric loxLTR sequences were split into subsets and tested again for recombination activity. Acting as intermediates, recombination activity was shown with the subsets. Next, recombinase libraries were enriched through reiterative evolution cycles. Subsequently, enriched libraries were shuffled and recombined. The combination of different mutations proved synergistic and recombinases were created that were able to recombine loxLTR1 and loxLTR2. This was evidence that an evolutionary strategy through intermediates can be successful. After a total of 126 evolution cycles individual recombinases were functionally and structurally analyzed. The most active recombinase -- Tre -- had 19 amino acid changes as compared to Cre. Tre recombinase was able to excise the HIV-1 provirus from the genome HIV-1 infected HeLa cells (see "HIV-1 Proviral DNA Excision Using an Evolved Recombinase", Hauber J., Heinrich-Pette-Institute for Experimental Virology and Immunology, Hamburg, Germany). While still in its infancy, directed molecular evolution will allow the creation of custom enzymes that will serve as tools of "molecular surgery" and molecular medicine.
Cell Biology, Issue 15, HIV-1, Tre recombinase, Site-specific recombination, molecular evolution