Whole transcriptome sequencing by mRNA-Seq is now used extensively to perform global gene expression, mutation, allele-specific expression and other genome-wide analyses. mRNA-Seq even opens the gate for gene expression analysis of non-sequenced genomes. mRNA-Seq offers high sensitivity, a large dynamic range and allows measurement of transcript copy numbers in a sample. Illumina’s genome analyzer performs sequencing of a large number (> 107) of relatively short sequence reads (< 150 bp).The "paired end" approach, wherein a single long read is sequenced at both its ends, allows for tracking alternate splice junctions, insertions and deletions, and is useful for de novo transcriptome assembly.
One of the major challenges faced by researchers is a limited amount of starting material. For example, in experiments where cells are harvested by laser micro-dissection, available starting total RNA may measure in nanograms. Preparation of mRNA-Seq libraries from such samples have been described1, 2 but involves significant PCR amplification that may introduce bias. Other RNA-Seq library construction procedures with minimal PCR amplification have been published3, 4 but require microgram amounts of starting total RNA.
Here we describe a protocol for the Illumina Genome Analyzer II platform for mRNA-Seq sequencing for library preparation that avoids significant PCR amplification and requires only 10 nanograms of total RNA. While this protocol has been described previously and validated for single-end sequencing5, where it was shown to produce directional libraries without introducing significant amplification bias, here we validate it further for use as a paired end protocol. We selectively amplify polyadenylated messenger RNAs from starting total RNA using the T7 based Eberwine linear amplification method, coined "T7LA" (T7 linear amplification). The amplified poly-A mRNAs are fragmented, reverse transcribed and adapter ligated to produce the final sequencing library. For both single read and paired end runs, sequences are mapped to the human transcriptome6 and normalized so that data from multiple runs can be compared. We report the gene expression measurement in units of transcripts per million (TPM), which is a superior measure to RPKM when comparing samples7.
22 Related JoVE Articles!
Genomic MRI - a Public Resource for Studying Sequence Patterns within Genomic DNA
Institutions: University of Toledo Health Science Campus.
Non-coding genomic regions in complex eukaryotes, including intergenic areas, introns, and untranslated segments of exons, are profoundly non-random in their nucleotide composition and consist of a complex mosaic of sequence patterns. These patterns include so-called Mid-Range Inhomogeneity (MRI) regions -- sequences 30-10000 nucleotides in length that are enriched by a particular base or combination of bases (e.g. (G+T)-rich, purine-rich, etc.). MRI regions are associated with unusual (non-B-form) DNA structures that are often involved in regulation of gene expression, recombination, and other genetic processes (Fedorova & Fedorov 2010). The existence of a strong fixation bias within MRI regions against mutations that tend to reduce their sequence inhomogeneity additionally supports the functionality and importance of these genomic sequences (Prakash et al.
Here we demonstrate a freely available Internet resource -- the Genomic MRI
program package -- designed for computational analysis of genomic sequences in order to find and characterize various MRI patterns within them (Bechtel et al.
2008). This package also allows generation of randomized sequences with various properties and level of correspondence to the natural input DNA sequences. The main goal of this resource is to facilitate examination of vast regions of non-coding DNA that are still scarcely investigated and await thorough exploration and recognition.
Genetics, Issue 51, bioinformatics, computational biology, genomics, non-randomness, signals, gene regulation, DNA conformation
MicroRNA Detection in Prostate Tumors by Quantitative Real-time PCR (qPCR)
Institutions: University of Toronto, Sunnybrook Health Sciences Centre, Toronto, Canada, Sunnybrook Health Sciences Centre, Toronto, Canada, Sunnybrook Research Institute.
MicroRNAs (miRNAs) are single-stranded, 18–24 nucleotide long, non-coding RNA molecules. They are involved in virtually every cellular process including development1
, and cell cycle regulation3
. MiRNAs are estimated to regulate the expression of 30% to 90% of human genes4
by binding to their target messenger RNAs (mRNAs)5
. Widespread dysregulation of miRNAs has been reported in various diseases and cancer subtypes6
. Due to their prevalence and unique structure, these small molecules are likely to be the next generation of biomarkers, therapeutic agents and/or targets.
Methods used to investigate miRNA expression include SYBR green I dye- based as well as Taqman-probe based qPCR. If miRNAs are to be effectively used in the clinical setting, it is imperative that their detection in fresh and/or archived clinical samples be accurate, reproducible, and specific. qPCR has been widely used for validating expression of miRNAs in whole genome analyses such as microarray studies7
. The samples used in this protocol were from patients who underwent radical prostatectomy for clinically localized prostate cancer; however other tissues and cell lines can be substituted in. Prostate specimens were snap-frozen in liquid nitrogen after resection. Clinical variables and follow-up information for each patient were collected for subsequent analysis8
Quantification of miRNA levels in prostate tumor samples
. The main steps in qPCR analysis of tumors are: Total RNA extraction, cDNA synthesis, and detection of qPCR products using miRNA-specific primers. Total RNA, which includes mRNA, miRNA, and other small RNAs were extracted from specimens using TRIzol reagent. Qiagen's miScript System was used to synthesize cDNA and perform qPCR (Figure 1
). Endogenous miRNAs are not polyadenylated, therefore during the reverse transcription process, a poly(A) polymerase polyadenylates the miRNA. The miRNA is used as a template to synthesize cDNA using oligo-dT and Reverse Transcriptase. A universal tag sequence on the 5' end of oligo-dT primers facilitates the amplification of cDNA in the PCR step. PCR product amplification is detected by the level of fluorescence emitted by SYBR Green, a dye which intercalates into double stranded DNA. Specific miRNA primers, along with a Universal Primer that binds to the universal tag sequence will amplify specific miRNA sequences.
The miScript Primer Assays are available for over a thousand human-specific miRNAs, and hundreds of murine-specific miRNAs. Relative quantification method was used here to quantify the expression of miRNAs. To correct for variability amongst different samples, expression levels of a target miRNA is normalized to the expression levels of a reference gene. The choice of a gene on which to normalize the expression of targets is critical in relative quantification method of analysis. Examples of reference genes typically used in this capacity are the small RNAs RNU6B, RNU44, and RNU48 as they are considered to be stably expressed across most samples. In this protocol, RNU6B is used as the reference gene.
Cancer Biology, Issue 63, Medicine, cancer, primer assay, Prostate, microRNA, tumor, qPCR
Massively Parallel Reporter Assays in Cultured Mammalian Cells
Institutions: Broad Institute.
The genetic reporter assay is a well-established and powerful tool for dissecting the relationship between DNA sequences and their gene regulatory activities. The potential throughput of this assay has, however, been limited by the need to individually clone and assay the activity of each sequence on interest using protein fluorescence or enzymatic activity as a proxy for regulatory activity. Advances in high-throughput DNA synthesis and sequencing technologies have recently made it possible to overcome these limitations by multiplexing the construction and interrogation of large libraries of reporter constructs. This protocol describes implementation of a Massively Parallel Reporter Assay (MPRA) that allows direct comparison of hundreds of thousands of putative regulatory sequences in a single cell culture dish.
Genetics, Issue 90, gene regulation, transcriptional regulation, sequence-activity mapping, reporter assay, library cloning, transfection, tag sequencing, mammalian cells
Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
Institutions: Washington University School of Medicine, Washington University School of Medicine, Washington University School of Medicine.
As DNA sequencing technology has markedly advanced in recent years2
, it has become increasingly evident that the amount of genetic variation between any two individuals is greater than previously thought3
. In contrast, array-based genotyping has failed to identify a significant contribution of common sequence variants to the phenotypic variability of common disease4,5
. Taken together, these observations have led to the evolution of the Common Disease / Rare Variant hypothesis suggesting that the majority of the "missing heritability" in common and complex phenotypes is instead due to an individual's personal profile of rare or private DNA variants6-8
. However, characterizing how rare variation impacts complex phenotypes requires the analysis of many affected individuals at many genomic loci, and is ideally compared to a similar survey in an unaffected cohort. Despite the sequencing power offered by today's platforms, a population-based survey of many genomic loci and the subsequent computational analysis required remains prohibitive for many investigators.
To address this need, we have developed a pooled sequencing approach1,9
and a novel software package1
for highly accurate rare variant detection from the resulting data. The ability to pool genomes from entire populations of affected individuals and survey the degree of genetic variation at multiple targeted regions in a single sequencing library provides excellent cost and time savings to traditional single-sample sequencing methodology. With a mean sequencing coverage per allele of 25-fold, our custom algorithm, SPLINTER, uses an internal variant calling control strategy to call insertions, deletions and substitutions up to four base pairs in length with high sensitivity and specificity from pools of up to 1 mutant allele in 500 individuals. Here we describe the method for preparing the pooled sequencing library followed by step-by-step instructions on how to use the SPLINTER package for pooled sequencing analysis (https://www.ibridgenetwork.org/wustl/splinter). We show a comparison between pooled sequencing of 947 individuals, all of whom also underwent genome-wide array, at over 20kb of sequencing per person. Concordance between genotyping of tagged and novel variants called in the pooled sample were excellent. This method can be easily scaled up to any number of genomic loci and any number of individuals. By incorporating the internal positive and negative amplicon controls at ratios that mimic the population under study, the algorithm can be calibrated for optimal performance. This strategy can also be modified for use with hybridization capture or individual-specific barcodes and can be applied to the sequencing of naturally heterogeneous samples, such as tumor DNA.
Genetics, Issue 64, Genomics, Cancer Biology, Bioinformatics, Pooled DNA sequencing, SPLINTER, rare genetic variants, genetic screening, phenotype, high throughput, computational analysis, DNA, PCR, primers
Expression Analysis of Mammalian Linker-histone Subtypes
Institutions: Georgia Institute of Technology .
Linker histone H1 binds to the nucleosome core particle and linker DNA, facilitating folding of chromatin into higher order structure. H1 is essential for mammalian development1
and regulates specific gene expression in vivo2-4
. Among the highly conserved histone proteins, the family of H1 linker histones is the most heterogeneous group. There are 11 H1 subtypes in mammals that are differentially regulated during development and in different cell types. These H1 subtypes include 5 somatic H1s (H1a-e), the replacement H10
, 4 germ cell specific H1 subtypes, and H1x5
. The presence of multiple H1 subtypes that differ in DNA binding affinity and chromatin compaction ability6-9
provides an additional level of modulation of chromatin function. Thus, quantitative expression analysis of individual H1 subtypes, both of mRNA and proteins, is necessary for better understanding of the regulation of higher order chromatin structure and function.
Here we describe a set of assays designed for analyzing the expression levels of individual H1 subtypes (Figure 1
). mRNA expression of various H1 variant genes is measured by a set of highly sensitive and quantitative reverse transcription-PCR (qRT-PCR) assays, which are faster, more accurate and require much less samples compared with the alternative approach of Northern blot analysis. Unlike most other cellular mRNA messages, mRNAs for most histone genes, including the majority of H1 genes, lack a long polyA tail, but contain a stem-loop structure at the 3' untranslated region (UTR)10
. Therefore, cDNAs are prepared from total RNA by reverse transcription using random primers instead of oligo-dT primers. Realtime PCR assays with primers specific to each H1 subtypes (Table 1
) are performed to obtain highly quantitative measurement of mRNA levels of individual H1 subtypes. Expression of housekeeping genes are analyzed as controls for normalization.
The relative abundance of proteins of each H1 subtype and core histones is obtained through reverse phase high-performance liquid chromatography (RP-HPLC) analysis of total histones extracted from mammalian cells11-13
. The HPLC method and elution conditions described here give optimum separations of mouse H1 subtypes. By quantifying the HPLC profile, we calculate the relative proportion of individual H1 subtypes within H1 family, as well as determine the H1 to nucleosome ratio in the cells.
Genetics, Issue 61, H1 linker histones, histone H1 subtypes, chromatin, RT-PCR, HPLC, gene expression
Comprehensive Analysis of Transcription Dynamics from Brain Samples Following Behavioral Experience
Institutions: The Hebrew University of Jerusalem.
The encoding of experiences in the brain and the consolidation of long-term memories depend on gene transcription. Identifying the function of specific genes in encoding experience is one of the main objectives of molecular neuroscience. Furthermore, the functional association of defined genes with specific behaviors has implications for understanding the basis of neuropsychiatric disorders. Induction of robust transcription programs has been observed in the brains of mice following various behavioral manipulations. While some genetic elements are utilized recurrently following different behavioral manipulations and in different brain nuclei, transcriptional programs are overall unique to the inducing stimuli and the structure in which they are studied1,2
In this publication, a protocol is described for robust and comprehensive transcriptional profiling from brain nuclei of mice in response to behavioral manipulation. The protocol is demonstrated in the context of analysis of gene expression dynamics in the nucleus accumbens following acute cocaine experience. Subsequent to a defined in vivo
experience, the target neural tissue is dissected; followed by RNA purification, reverse transcription and utilization of microfluidic arrays for comprehensive qPCR analysis of multiple target genes. This protocol is geared towards comprehensive analysis (addressing 50-500 genes) of limiting quantities of starting material, such as small brain samples or even single cells.
The protocol is most advantageous for parallel analysis of multiple samples (e.g.
single cells, dynamic analysis following pharmaceutical, viral or behavioral perturbations). However, the protocol could also serve for the characterization and quality assurance of samples prior to whole-genome studies by microarrays or RNAseq, as well as validation of data obtained from whole-genome studies.
Behavior, Issue 90,
Brain, behavior, RNA, transcription, nucleus accumbens, cocaine, high-throughput qPCR, experience-dependent plasticity, gene regulatory networks, microdissection
Microarray-based Identification of Individual HERV Loci Expression: Application to Biomarker Discovery in Prostate Cancer
Institutions: Joint Unit Hospices de Lyon-bioMérieux, BioMérieux, Hospices Civils de Lyon, Lyon 1 University, BioMérieux, Hospices Civils de Lyon, Hospices Civils de Lyon.
The prostate-specific antigen (PSA) is the main diagnostic biomarker for prostate cancer in clinical use, but it lacks specificity and sensitivity, particularly in low dosage values1
. ‘How to use PSA' remains a current issue, either for diagnosis as a gray zone corresponding to a concentration in serum of 2.5-10 ng/ml which does not allow a clear differentiation to be made between cancer and noncancer2
or for patient follow-up as analysis of post-operative PSA kinetic parameters can pose considerable challenges for their practical application3,4
. Alternatively, noncoding RNAs (ncRNAs) are emerging as key molecules in human cancer, with the potential to serve as novel markers of disease, e.g.
PCA3 in prostate cancer5,6
and to reveal uncharacterized aspects of tumor biology. Moreover, data from the ENCODE project published in 2012 showed that different RNA types cover about 62% of the genome. It also appears that the amount of transcriptional regulatory motifs is at least 4.5x higher than the one corresponding to protein-coding exons. Thus, long terminal repeats (LTRs) of human endogenous retroviruses (HERVs) constitute a wide range of putative/candidate transcriptional regulatory sequences, as it is their primary function in infectious retroviruses. HERVs, which are spread throughout the human genome, originate from ancestral and independent infections within the germ line, followed by copy-paste propagation processes and leading to multicopy families occupying 8% of the human genome (note that exons span 2% of our genome). Some HERV loci still express proteins that have been associated with several pathologies including cancer7-10
. We have designed a high-density microarray, in Affymetrix format, aiming to optimally characterize individual HERV loci expression, in order to better understand whether they can be active, if they drive ncRNA transcription or modulate coding gene expression. This tool has been applied in the prostate cancer field (Figure 1
Medicine, Issue 81, Cancer Biology, Genetics, Molecular Biology, Prostate, Retroviridae, Biomarkers, Pharmacological, Tumor Markers, Biological, Prostatectomy, Microarray Analysis, Gene Expression, Diagnosis, Human Endogenous Retroviruses, HERV, microarray, Transcriptome, prostate cancer, Affymetrix
Development of Cell-type specific anti-HIV gp120 aptamers for siRNA delivery
Institutions: Beckman Research Institute of City of Hope, Beckman Research Institute of City of Hope, Beckman Research Institute of City of Hope.
The global epidemic of infection by HIV has created an urgent need for new classes of antiretroviral agents. The potent ability of small interfering (si)RNAs to inhibit the expression of complementary RNA transcripts is being exploited as a new class of therapeutics for a variety of diseases including HIV. Many previous reports have shown that novel RNAi-based anti-HIV/AIDS therapeutic strategies have considerable promise; however, a key obstacle to the successful therapeutic application and clinical translation of siRNAs is efficient delivery. Particularly, considering the safety and efficacy of RNAi-based therapeutics, it is highly desirable to develop a targeted intracellular siRNA delivery approach to specific cell populations or tissues. The HIV-1 gp120 protein, a glycoprotein envelope on the surface of HIV-1, plays an important role in viral entry into CD4 cells. The interaction of gp120 and CD4 that triggers HIV-1 entry and initiates cell fusion has been validated as a clinically relevant anti-viral strategy for drug discovery.
Herein, we firstly discuss the selection and identification of 2'-F modified anti-HIV gp120 RNA aptamers. Using a conventional nitrocellulose filter SELEX method, several new aptamers with nanomolar affinity were isolated from a 50 random nt RNA library. In order to successfully obtain bound species with higher affinity, the selection stringency is carefully controlled by adjusting the conditions. The selected aptamers can specifically bind and be rapidly internalized into cells expressing the HIV-1 envelope protein. Additionally, the aptamers alone can neutralize HIV-1 infectivity. Based upon the best aptamer A-1, we also create a novel dual inhibitory function anti-gp120 aptamer-siRNA chimera in which both the aptamer and the siRNA portions have potent anti-HIV activities. Further, we utilize the gp120 aptamer-siRNA chimeras for cell-type specific delivery of the siRNA into HIV-1 infected cells. This dual function chimera shows considerable potential for combining various nucleic acid therapeutic agents (aptamer and siRNA) in suppressing HIV-1 infection, making the aptamer-siRNA chimeras attractive therapeutic candidates for patients failing highly active antiretroviral therapy (HAART).
Immunology, Issue 52, SELEX (Systematic Evolution of Ligands by EXponential enrichment), RNA aptamer, HIV-1 gp120, RNAi (RNA interference), siRNA (small interfering RNA), cell-type specific delivery
Strategies for Study of Neuroprotection from Cold-preconditioning
Institutions: The University of Chicago Medical Center.
Neurological injury is a frequent cause of morbidity and mortality from general anesthesia and related surgical procedures that could be alleviated by development of effective, easy to administer and safe preconditioning treatments. We seek to define the neural immune signaling responsible for cold-preconditioning as means to identify novel targets for therapeutics development to protect brain before injury onset. Low-level pro-inflammatory mediator signaling changes over time are essential for cold-preconditioning neuroprotection. This signaling is consistent with the basic tenets of physiological conditioning hormesis, which require that irritative stimuli reach a threshold magnitude with sufficient time for adaptation to the stimuli for protection to become evident.
Accordingly, delineation of the immune signaling involved in cold-preconditioning neuroprotection requires that biological systems and experimental manipulations plus technical capacities are highly reproducible and sensitive. Our approach is to use hippocampal slice cultures as an in vitro
model that closely reflects their in vivo
counterparts with multi-synaptic neural networks influenced by mature and quiescent macroglia / microglia. This glial state is particularly important for microglia since they are the principal source of cytokines, which are operative in the femtomolar range. Also, slice cultures can be maintained in vitro
for several weeks, which is sufficient time to evoke activating stimuli and assess adaptive responses. Finally, environmental conditions can be accurately controlled using slice cultures so that cytokine signaling of cold-preconditioning can be measured, mimicked, and modulated to dissect the critical node aspects. Cytokine signaling system analyses require the use of sensitive and reproducible multiplexed techniques. We use quantitative PCR for TNF-α to screen for microglial activation followed by quantitative real-time qPCR array screening to assess tissue-wide cytokine changes. The latter is a most sensitive and reproducible means to measure multiple cytokine system signaling changes simultaneously. Significant changes are confirmed with targeted qPCR and then protein detection. We probe for tissue-based cytokine protein changes using multiplexed microsphere flow cytometric assays using Luminex technology. Cell-specific cytokine production is determined with double-label immunohistochemistry. Taken together, this brain tissue preparation and style of use, coupled to the suggested investigative strategies, may be an optimal approach for identifying potential targets for the development of novel therapeutics that could mimic the advantages of cold-preconditioning.
Neuroscience, Issue 43, innate immunity, hormesis, microglia, hippocampus, slice culture, immunohistochemistry, neural-immune, gene expression, real-time PCR
RNA-seq Analysis of Transcriptomes in Thrombin-treated and Control Human Pulmonary Microvascular Endothelial Cells
Institutions: Children's Mercy Hospital and Clinics, School of Medicine, University of Missouri-Kansas City.
The characterization of gene expression in cells via measurement of mRNA levels is a useful tool in determining how the transcriptional machinery of the cell is affected by external signals (e.g.
drug treatment), or how cells differ between a healthy state and a diseased state. With the advent and continuous refinement of next-generation DNA sequencing technology, RNA-sequencing (RNA-seq) has become an increasingly popular method of transcriptome analysis to catalog all species of transcripts, to determine the transcriptional structure of all expressed genes and to quantify the changing expression levels of the total set of transcripts in a given cell, tissue or organism1,2
. RNA-seq is gradually replacing DNA microarrays as a preferred method for transcriptome analysis because it has the advantages of profiling a complete transcriptome, providing a digital type datum (copy number of any transcript) and not relying on any known genomic sequence3
Here, we present a complete and detailed protocol to apply RNA-seq to profile transcriptomes in human pulmonary microvascular endothelial cells with or without thrombin treatment. This protocol is based on our recent published study entitled "RNA-seq Reveals Novel Transcriptome of Genes and Their Isoforms in Human Pulmonary Microvascular Endothelial Cells Treated with Thrombin,"4
in which we successfully performed the first complete transcriptome analysis of human pulmonary microvascular endothelial cells treated with thrombin using RNA-seq. It yielded unprecedented resources for further experimentation to gain insights into molecular mechanisms underlying thrombin-mediated endothelial dysfunction in the pathogenesis of inflammatory conditions, cancer, diabetes, and coronary heart disease, and provides potential new leads for therapeutic targets to those diseases.
The descriptive text of this protocol is divided into four parts. The first part describes the treatment of human pulmonary microvascular endothelial cells with thrombin and RNA isolation, quality analysis and quantification. The second part describes library construction and sequencing. The third part describes the data analysis. The fourth part describes an RT-PCR validation assay. Representative results of several key steps are displayed. Useful tips or precautions to boost success in key steps are provided in the Discussion section. Although this protocol uses human pulmonary microvascular endothelial cells treated with thrombin, it can be generalized to profile transcriptomes in both mammalian and non-mammalian cells and in tissues treated with different stimuli or inhibitors, or to compare transcriptomes in cells or tissues between a healthy state and a disease state.
Genetics, Issue 72, Molecular Biology, Immunology, Medicine, Genomics, Proteins, RNA-seq, Next Generation DNA Sequencing, Transcriptome, Transcription, Thrombin, Endothelial cells, high-throughput, DNA, genomic DNA, RT-PCR, PCR
RNA Secondary Structure Prediction Using High-throughput SHAPE
Institutions: Frederick National Laboratory for Cancer Research.
Understanding the function of RNA involved in biological processes requires a thorough knowledge of RNA structure. Toward this end, the methodology dubbed "high-throughput selective 2' hydroxyl acylation analyzed by primer extension", or SHAPE, allows prediction of RNA secondary structure with single nucleotide resolution. This approach utilizes chemical probing agents that preferentially acylate single stranded or flexible regions of RNA in aqueous solution. Sites of chemical modification are detected by reverse transcription of the modified RNA, and the products of this reaction are fractionated by automated capillary electrophoresis (CE). Since reverse transcriptase pauses at those RNA nucleotides modified by the SHAPE reagents, the resulting cDNA library indirectly maps those ribonucleotides that are single stranded in the context of the folded RNA. Using ShapeFinder software, the electropherograms produced by automated CE are processed and converted into nucleotide reactivity tables that are themselves converted into pseudo-energy constraints used in the RNAStructure (v5.3) prediction algorithm. The two-dimensional RNA structures obtained by combining SHAPE probing with in silico
RNA secondary structure prediction have been found to be far more accurate than structures obtained using either method alone.
Genetics, Issue 75, Molecular Biology, Biochemistry, Virology, Cancer Biology, Medicine, Genomics, Nucleic Acid Probes, RNA Probes, RNA, High-throughput SHAPE, Capillary electrophoresis, RNA structure, RNA probing, RNA folding, secondary structure, DNA, nucleic acids, electropherogram, synthesis, transcription, high throughput, sequencing
Identification of Key Factors Regulating Self-renewal and Differentiation in EML Hematopoietic Precursor Cells by RNA-sequencing Analysis
Institutions: The University of Texas Graduate School of Biomedical Sciences at Houston.
Hematopoietic stem cells (HSCs) are used clinically for transplantation treatment to rebuild a patient's hematopoietic system in many diseases such as leukemia and lymphoma. Elucidating the mechanisms controlling HSCs self-renewal and differentiation is important for application of HSCs for research and clinical uses. However, it is not possible to obtain large quantity of HSCs due to their inability to proliferate in vitro
. To overcome this hurdle, we used a mouse bone marrow derived cell line, the EML (Erythroid, Myeloid, and Lymphocytic) cell line, as a model system for this study.
RNA-sequencing (RNA-Seq) has been increasingly used to replace microarray for gene expression studies. We report here a detailed method of using RNA-Seq technology to investigate the potential key factors in regulation of EML cell self-renewal and differentiation. The protocol provided in this paper is divided into three parts. The first part explains how to culture EML cells and separate Lin-CD34+ and Lin-CD34- cells. The second part of the protocol offers detailed procedures for total RNA preparation and the subsequent library construction for high-throughput sequencing. The last part describes the method for RNA-Seq data analysis and explains how to use the data to identify differentially expressed transcription factors between Lin-CD34+ and Lin-CD34- cells. The most significantly differentially expressed transcription factors were identified to be the potential key regulators controlling EML cell self-renewal and differentiation. In the discussion section of this paper, we highlight the key steps for successful performance of this experiment.
In summary, this paper offers a method of using RNA-Seq technology to identify potential regulators of self-renewal and differentiation in EML cells. The key factors identified are subjected to downstream functional analysis in vitro
and in vivo
Genetics, Issue 93, EML Cells, Self-renewal, Differentiation, Hematopoietic precursor cell, RNA-Sequencing, Data analysis
Profiling of Estrogen-regulated MicroRNAs in Breast Cancer Cells
Institutions: University of Houston.
Estrogen plays vital roles in mammary gland development and breast cancer progression. It mediates its function by binding to and activating the estrogen receptors (ERs), ERα, and ERβ. ERα is frequently upregulated in breast cancer and drives the proliferation of breast cancer cells. The ERs function as transcription factors and regulate gene expression. Whereas ERα's regulation of protein-coding genes is well established, its regulation of noncoding microRNA (miRNA) is less explored. miRNAs play a major role in the post-transcriptional regulation of genes, inhibiting their translation or degrading their mRNA. miRNAs can function as oncogenes or tumor suppressors and are also promising biomarkers. Among the miRNA assays available, microarray and quantitative real-time polymerase chain reaction (qPCR) have been extensively used to detect and quantify miRNA levels. To identify miRNAs regulated by estrogen signaling in breast cancer, their expression in ERα-positive breast cancer cell lines were compared before and after estrogen-activation using both the µParaflo-microfluidic microarrays and Dual Labeled Probes-low density arrays. Results were validated using specific qPCR assays, applying both Cyanine dye-based and Dual Labeled Probes-based chemistry. Furthermore, a time-point assay was used to identify regulations over time. Advantages of the miRNA assay approach used in this study is that it enables a fast screening of mature miRNA regulations in numerous samples, even with limited sample amounts. The layout, including the specific conditions for cell culture and estrogen treatment, biological and technical replicates, and large-scale screening followed by in-depth confirmations using separate techniques, ensures a robust detection of miRNA regulations, and eliminates false positives and other artifacts. However, mutated or unknown miRNAs, or regulations at the primary and precursor transcript level, will not be detected. The method presented here represents a thorough investigation of estrogen-mediated miRNA regulation.
Medicine, Issue 84, breast cancer, microRNA, estrogen, estrogen receptor, microarray, qPCR
Characterization of Complex Systems Using the Design of Experiments Approach: Transient Protein Expression in Tobacco as a Case Study
Institutions: RWTH Aachen University, Fraunhofer Gesellschaft.
Plants provide multiple benefits for the production of biopharmaceuticals including low costs, scalability, and safety. Transient expression offers the additional advantage of short development and production times, but expression levels can vary significantly between batches thus giving rise to regulatory concerns in the context of good manufacturing practice. We used a design of experiments (DoE) approach to determine the impact of major factors such as regulatory elements in the expression construct, plant growth and development parameters, and the incubation conditions during expression, on the variability of expression between batches. We tested plants expressing a model anti-HIV monoclonal antibody (2G12) and a fluorescent marker protein (DsRed). We discuss the rationale for selecting certain properties of the model and identify its potential limitations. The general approach can easily be transferred to other problems because the principles of the model are broadly applicable: knowledge-based parameter selection, complexity reduction by splitting the initial problem into smaller modules, software-guided setup of optimal experiment combinations and step-wise design augmentation. Therefore, the methodology is not only useful for characterizing protein expression in plants but also for the investigation of other complex systems lacking a mechanistic description. The predictive equations describing the interconnectivity between parameters can be used to establish mechanistic models for other complex systems.
Bioengineering, Issue 83, design of experiments (DoE), transient protein expression, plant-derived biopharmaceuticals, promoter, 5'UTR, fluorescent reporter protein, model building, incubation conditions, monoclonal antibody
Isolation of Fidelity Variants of RNA Viruses and Characterization of Virus Mutation Frequency
Institutions: Institut Pasteur .
RNA viruses use RNA dependent RNA polymerases to replicate their genomes. The intrinsically high error rate of these enzymes is a large contributor to the generation of extreme population diversity that facilitates virus adaptation and evolution. Increasing evidence shows that the intrinsic error rates, and the resulting mutation frequencies, of RNA viruses can be modulated by subtle amino acid changes to the viral polymerase. Although biochemical assays exist for some viral RNA polymerases that permit quantitative measure of incorporation fidelity, here we describe a simple method of measuring mutation frequencies of RNA viruses that has proven to be as accurate as biochemical approaches in identifying fidelity altering mutations. The approach uses conventional virological and sequencing techniques that can be performed in most biology laboratories. Based on our experience with a number of different viruses, we have identified the key steps that must be optimized to increase the likelihood of isolating fidelity variants and generating data of statistical significance. The isolation and characterization of fidelity altering mutations can provide new insights into polymerase structure and function1-3
. Furthermore, these fidelity variants can be useful tools in characterizing mechanisms of virus adaptation and evolution4-7
Immunology, Issue 52, Polymerase fidelity, RNA virus, mutation frequency, mutagen, RNA polymerase, viral evolution
Transcriptome Analysis of Single Cells
Institutions: University of Pennsylvania, University of Pennsylvania.
Many gene expression analysis techniques rely on material isolated from heterogeneous populations of cells from tissue homogenates or cells in culture.1,2,3
In the case of the brain, regions such as the hippocampus contain a complex arrangement of different cell types, each with distinct mRNA profiles. The ability to harvest single cells allows for a more in depth investigation into the molecular differences between and within cell populations. We describe a simple and rapid method for harvesting cells for further processing. Pipettes often used in electrophysiology are utilized to isolate (using aspiration) a cell of interest and conveniently deposit it into an Eppendorf tube for further processing with any number of molecular biology techniques. Our protocol can be modified for the harvest of dendrites from cell culture or even individual cells from acute slices.
We also describe the aRNA amplification method as a major downstream application of single cell isolations. This method was developed previously by our lab as an alternative to other gene expression analysis techniques such as reverse-transcription or real-time polymerase chain reaction (PCR).4,5,6,7,8
This technique provides for linear amplification of the polyadenylated RNA beginning with only femtograms of material and resulting in microgram amounts of antisense RNA. The linearly amplified material provides a more accurate estimation than PCR exponential amplification of the relative abundance of components of the transcriptome of the isolated cell. The basic procedure consists of two rounds of amplification. Briefly, a T7 RNA polymerase promoter site is incorporated into double stranded cDNA created from the mRNA transcripts. An overnight in vitro transcription (IVT) reaction is then performed in which T7 RNA polymerase produces many antisense transcripts from the double stranded cDNA. The second round repeats this process but with some technical differences since the starting material is antisense RNA. It is standard to repeat the second round, resulting in three rounds of amplification. Often, the third round in vitro transcription reaction is performed using biotinylated nucleoside triphosphates so that the antisense RNA produced can be hybridized and detected on a microarray.7,8
Neuroscience, Issue 50, single-cell, transcriptome, aRNA amplification, RT-PCR, molecular biology, gene expression
Modeling Neural Immune Signaling of Episodic and Chronic Migraine Using Spreading Depression In Vitro
Institutions: The University of Chicago Medical Center, The University of Chicago Medical Center.
Migraine and its transformation to chronic migraine are healthcare burdens in need of improved treatment options. We seek to define how neural immune signaling modulates the susceptibility to migraine, modeled in vitro
using spreading depression (SD), as a means to develop novel therapeutic targets for episodic and chronic migraine. SD is the likely cause of migraine aura and migraine pain. It is a paroxysmal loss of neuronal function triggered by initially increased neuronal activity, which slowly propagates within susceptible brain regions. Normal brain function is exquisitely sensitive to, and relies on, coincident low-level immune signaling. Thus, neural immune signaling likely affects electrical activity of SD, and therefore migraine. Pain perception studies of SD in whole animals are fraught with difficulties, but whole animals are well suited to examine systems biology aspects of migraine since SD activates trigeminal nociceptive pathways. However, whole animal studies alone cannot be used to decipher the cellular and neural circuit mechanisms of SD. Instead, in vitro
preparations where environmental conditions can be controlled are necessary. Here, it is important to recognize limitations of acute slices and distinct advantages of hippocampal slice cultures. Acute brain slices cannot reveal subtle changes in immune signaling since preparing the slices alone triggers: pro-inflammatory changes that last days, epileptiform behavior due to high levels of oxygen tension needed to vitalize the slices, and irreversible cell injury at anoxic slice centers.
In contrast, we examine immune signaling in mature hippocampal slice cultures since the cultures closely parallel their in vivo
counterpart with mature trisynaptic function; show quiescent astrocytes, microglia, and cytokine levels; and SD is easily induced in an unanesthetized preparation. Furthermore, the slices are long-lived and SD can be induced on consecutive days without injury, making this preparation the sole means to-date capable of modeling the neuroimmune consequences of chronic SD, and thus perhaps chronic migraine. We use electrophysiological techniques and non-invasive imaging to measure
neuronal cell and circuit functions coincident with SD. Neural immune gene expression variables are measured with qPCR screening, qPCR arrays, and, importantly, use of cDNA preamplification for detection of ultra-low level targets such as interferon-gamma using whole, regional, or specific cell enhanced (via laser dissection microscopy) sampling. Cytokine cascade signaling is further assessed with multiplexed phosphoprotein related targets with gene expression and phosphoprotein changes confirmed via cell-specific immunostaining. Pharmacological and siRNA strategies are used to mimic
SD immune signaling.
Neuroscience, Issue 52, innate immunity, hormesis, microglia, T-cells, hippocampus, slice culture, gene expression, laser dissection microscopy, real-time qPCR, interferon-gamma
Polymerase Chain Reaction: Basic Protocol Plus Troubleshooting and Optimization Strategies
Institutions: University of California, Los Angeles .
In the biological sciences there have been technological advances that catapult the discipline into golden ages of discovery. For example, the field of microbiology was transformed with the advent of Anton van Leeuwenhoek's microscope, which allowed scientists to visualize prokaryotes for the first time. The development of the polymerase chain reaction (PCR) is one of those innovations that changed the course of molecular science with its impact spanning countless subdisciplines in biology. The theoretical process was outlined by Keppe and coworkers in 1971; however, it was another 14 years until the complete PCR procedure was described and experimentally applied by Kary Mullis while at Cetus Corporation in 1985. Automation and refinement of this technique progressed with the introduction of a thermal stable DNA polymerase from the bacterium Thermus aquaticus
, consequently the name Taq
PCR is a powerful amplification technique that can generate an ample supply of a specific segment of DNA (i.e., an amplicon) from only a small amount of starting material (i.e., DNA template or target sequence). While straightforward and generally trouble-free, there are pitfalls that complicate the reaction producing spurious results. When PCR fails it can lead to many non-specific DNA products of varying sizes that appear as a ladder or smear of bands on agarose gels. Sometimes no products form at all. Another potential problem occurs when mutations are unintentionally introduced in the amplicons, resulting in a heterogeneous population of PCR products. PCR failures can become frustrating unless patience and careful troubleshooting are employed to sort out and solve the problem(s). This protocol outlines the basic principles of PCR, provides a methodology that will result in amplification of most target sequences, and presents strategies for optimizing a reaction. By following this PCR guide, students should be able to:
● Set up reactions and thermal cycling conditions for a conventional PCR experiment
● Understand the function of various reaction components and their overall effect on a PCR experiment
● Design and optimize a PCR experiment for any DNA template
● Troubleshoot failed PCR experiments
Basic Protocols, Issue 63, PCR, optimization, primer design, melting temperature, Tm, troubleshooting, additives, enhancers, template DNA quantification, thermal cycler, molecular biology, genetics
Increasing cDNA Yields from Single-cell Quantities of mRNA in Standard Laboratory Reverse Transcriptase Reactions using Acoustic Microstreaming
Institutions: University of Melbourne, CSIRO Materials Science and Engineering, Faculty of Engineering and Industrial Sciences.
Correlating gene expression with cell behavior is ideally done at the single-cell level. However, this is not easily achieved because the small amount of labile mRNA present in a single cell (1-5% of 1-50pg total RNA, or 0.01-2.5pg mRNA, per cell 1
) mostly degrades before it can be reverse transcribed into a stable cDNA copy. For example, using standard laboratory reagents and hardware, only a small number of genes can be qualitatively assessed per cell 2
. One way to increase the efficiency of standard laboratory reverse transcriptase (RT) reactions (i.e. standard reagents in microliter volumes) comprising single-cell amounts of mRNA would be to more rapidly mix the reagents so the mRNA can be converted to cDNA before it degrades. However this is not trivial because at microliter scales liquid flow is laminar, i.e. currently available methods of mixing (i.e. shaking, vortexing and trituration) fail to produce sufficient chaotic motion to effectively mix reagents. To solve this problem, micro-scale mixing techniques have to be used 3,4
. A number of microfluidic-based mixing technologies have been developed which successfully increase RT reaction yields 5-8
. However, microfluidics technologies require specialized hardware that is relatively expensive and not yet widely available. A cheaper, more convenient solution is desirable. The main objective of this study is to demonstrate how application of a novel "micromixing" technique to standard laboratory RT reactions comprising single-cell quantities of mRNA significantly increases their cDNA yields. We find cDNA yields increase by approximately 10-100-fold, which enables: (1) greater numbers of genes to be analyzed per cell; (2) more quantitative analysis of gene expression; and (3) better detection of low-abundance genes in single cells. The micromixing is based on acoustic microstreaming 9-12
, a phenomenon where sound waves propagating around a small obstacle create a mean flow near the obstacle. We have developed an acoustic microstreaming-based device ("micromixer") with a key simplification; acoustic microstreaming can be achieved at audio frequencies by ensuring the system has a liquid-air interface with a small radius of curvature 13
. The meniscus of a microliter volume of solution in a tube provides an appropriately small radius of curvature. The use of audio frequencies means that the hardware can be inexpensive and versatile 13
, and nucleic acids and other biochemical reagents are not damaged like they can be with standard laboratory sonicators.
Bioengineering, Issue 53, neuroscience, brain, cells, reverse transcription, qPCR, gene expression, acoustic microstreaming, micromixer, microfluidics
Analyzing Gene Expression from Marine Microbial Communities using Environmental Transcriptomics
Institutions: University of Georgia (UGA).
Analogous to metagenomics, environmental transcriptomics (metatranscriptomics) retrieves and sequences environmental mRNAs from a microbial assemblage without prior knowledge of what genes the community might be expressing. Thus it provides the most unbiased perspective on community gene expression in situ
. Environmental transcriptomics protocols are technically difficult since prokaryotic mRNAs generally lack the poly(A) tails that make isolation of eukaryotic messages relatively straightforward 1
and because of the relatively short half lives of mRNAs 2
. In addition, mRNAs are much less abundant than rRNAs in total RNA extracts, thus an rRNA background often overwhelms mRNA signals. However, techniques for overcoming some of these difficulties have recently been developed. A procedure for analyzing environmental transcriptomes by creating clone libraries using random primers to reverse-transcribe and amplify environmental mRNAs was recently described was successful in two different natural environments, but results were biased by selection of the random primers used to initiate cDNA synthesis 3
. Advances in linear amplification of mRNA obviate the need for random primers in the amplification step and make it possible to use less starting material decreasing the collection and processing time of samples and thereby minimizing RNA degradation 4
. In vitro
transcription methods for amplifying mRNA involve polyadenylating the mRNA and incorporating a T7 promoter onto the 3 end of the transcript. Amplified RNA (aRNA) can then be converted to double stranded cDNA using random hexamers and directly sequenced by pyrosequencing 5
. A first use of this method at Station ALOHA demonstrated its utility for characterizing microbial community gene expression 6
Microbiology, Issue 24, transcriptomics, bacterioplankton, mRNA, microbial communities, gene expression
In vitro Transcription and Capping of Gaussia Luciferase mRNA Followed by HeLa Cell Transfection
Institutions: New England Biolabs.
transcription is the synthesis of RNA transcripts by RNA polymerase from a linear DNA template containing the corresponding promoter sequence (T7, T3, SP6) and the gene to be transcribed (Figure 1A
). A typical transcription reaction consists of the template DNA, RNA polymerase, ribonucleotide triphosphates, RNase inhibitor and buffer containing Mg2+
Large amounts of high quality RNA are often required for a variety of applications. Use of in vitro
transcription has been reported for RNA structure and function studies such as splicing1
, RNAi experiments in mammalian cells2
, antisense RNA amplification by the "Eberwine method"3
, microarray analysis4
and for RNA vaccine studies5
. The technique can also be used for producing radiolabeled and dye labeled probes6
. Warren, et al.
recently reported reprogramming of human cells by transfection with in vitro
transcribed capped RNA7
. The T7 High Yield RNA Synthesis Kit from New England Biolabs has been designed to synthesize up to 180 μg RNA per 20 μl reaction. RNA of length up to 10kb has been successfully transcribed using this kit. Linearized plasmid DNA, PCR products and synthetic DNA oligonucleotides can be used as templates for transcription as long as they have the T7 promoter sequence upstream of the gene to be transcribed.
Addition of a 5' end cap structure to the RNA is an important process in eukaryotes. It is essential for RNA stability8
, efficient translation9
, nuclear transport10
. The process involves addition of a 7-methylguanosine cap at the 5' triphosphate end of the RNA. RNA capping can be carried out post-transcriptionally using capping enzymes or co-transcriptionally using cap analogs. In the enzymatic method, the mRNA is capped using the Vaccinia
virus capping enzyme12,13
. The enzyme adds on a 7-methylguanosine cap at the 5' end of the RNA using GTP and S-adenosyl methionine as donors (cap 0 structure). Both methods yield functionally active capped RNA suitable for transfection or other applications14
such as generating viral genomic RNA for reverse-genetic systems15
and crystallographic studies of cap binding proteins such as eIF4E16
In the method described below, the T7 High Yield RNA Synthesis Kit from NEB is used to synthesize capped and uncapped RNA transcripts of Gaussia
luciferase (GLuc) and Cypridina
luciferase (CLuc). A portion of the uncapped GLuc RNA is capped using the Vaccinia Capping System (NEB). A linearized plasmid containing the GLuc or CLuc gene and T7 promoter is used as the template DNA. The transcribed RNA is transfected into HeLa cells and cell culture supernatants are assayed for luciferase activity. Capped CLuc RNA is used as the internal control to normalize GLuc expression.
Genetics, Issue 61, In vitro transcription, Vaccinia capping enzyme, transfection, T7 RNA Polymerase, RNA synthesis
Bacterial Gene Expression Analysis Using Microarrays
Institutions: University of California Santa Cruz - UCSC.
Microbiology, issue 4, microbial community, gene expression, microarray, genome