Hematopoietic stem cells (HSCs) are used clinically for transplantation treatment to rebuild a patient's hematopoietic system in many diseases such as leukemia and lymphoma. Elucidating the mechanisms controlling HSCs self-renewal and differentiation is important for application of HSCs for research and clinical uses. However, it is not possible to obtain large quantity of HSCs due to their inability to proliferate in vitro. To overcome this hurdle, we used a mouse bone marrow derived cell line, the EML (Erythroid, Myeloid, and Lymphocytic) cell line, as a model system for this study.
RNA-sequencing (RNA-Seq) has been increasingly used to replace microarray for gene expression studies. We report here a detailed method of using RNA-Seq technology to investigate the potential key factors in regulation of EML cell self-renewal and differentiation. The protocol provided in this paper is divided into three parts. The first part explains how to culture EML cells and separate Lin-CD34+ and Lin-CD34- cells. The second part of the protocol offers detailed procedures for total RNA preparation and the subsequent library construction for high-throughput sequencing. The last part describes the method for RNA-Seq data analysis and explains how to use the data to identify differentially expressed transcription factors between Lin-CD34+ and Lin-CD34- cells. The most significantly differentially expressed transcription factors were identified to be the potential key regulators controlling EML cell self-renewal and differentiation. In the discussion section of this paper, we highlight the key steps for successful performance of this experiment.
In summary, this paper offers a method of using RNA-Seq technology to identify potential regulators of self-renewal and differentiation in EML cells. The key factors identified are subjected to downstream functional analysis in vitro and in vivo.
26 Related JoVE Articles!
Detecting Somatic Genetic Alterations in Tumor Specimens by Exon Capture and Massively Parallel Sequencing
Institutions: Memorial Sloan-Kettering Cancer Center, Memorial Sloan-Kettering Cancer Center.
Efforts to detect and investigate key oncogenic mutations have proven valuable to facilitate the appropriate treatment for cancer patients. The establishment of high-throughput, massively parallel "next-generation" sequencing has aided the discovery of many such mutations. To enhance the clinical and translational utility of this technology, platforms must be high-throughput, cost-effective, and compatible with formalin-fixed paraffin embedded (FFPE) tissue samples that may yield small amounts of degraded or damaged DNA. Here, we describe the preparation of barcoded and multiplexed DNA libraries followed by hybridization-based capture of targeted exons for the detection of cancer-associated mutations in fresh frozen and FFPE tumors by massively parallel sequencing. This method enables the identification of sequence mutations, copy number alterations, and select structural rearrangements involving all targeted genes. Targeted exon sequencing offers the benefits of high throughput, low cost, and deep sequence coverage, thus conferring high sensitivity for detecting low frequency mutations.
Molecular Biology, Issue 80, Molecular Diagnostic Techniques, High-Throughput Nucleotide Sequencing, Genetics, Neoplasms, Diagnosis, Massively parallel sequencing, targeted exon sequencing, hybridization capture, cancer, FFPE, DNA mutations
A Protocol for Computer-Based Protein Structure and Function Prediction
Institutions: University of Michigan , University of Kansas.
Genome sequencing projects have ciphered millions of protein sequence, which require knowledge of their structure and function to improve the understanding of their biological role. Although experimental methods can provide detailed information for a small fraction of these proteins, computational modeling is needed for the majority of protein molecules which are experimentally uncharacterized. The I-TASSER server is an on-line workbench for high-resolution modeling of protein structure and function. Given a protein sequence, a typical output from the I-TASSER server includes secondary structure prediction, predicted solvent accessibility of each residue, homologous template proteins detected by threading and structure alignments, up to five full-length tertiary structural models, and structure-based functional annotations for enzyme classification, Gene Ontology terms and protein-ligand binding sites. All the predictions are tagged with a confidence score which tells how accurate the predictions are without knowing the experimental data. To facilitate the special requests of end users, the server provides channels to accept user-specified inter-residue distance and contact maps to interactively change the I-TASSER modeling; it also allows users to specify any proteins as template, or to exclude any template proteins during the structure assembly simulations. The structural information could be collected by the users based on experimental evidences or biological insights with the purpose of improving the quality of I-TASSER predictions. The server was evaluated as the best programs for protein structure and function predictions in the recent community-wide CASP experiments. There are currently >20,000 registered scientists from over 100 countries who are using the on-line I-TASSER server.
Biochemistry, Issue 57, On-line server, I-TASSER, protein structure prediction, function prediction
Generation of Induced Pluripotent Stem Cells from Frozen Buffy Coats using Non-integrating Episomal Plasmids
Institutions: European Academy Bozen/Bolzano (EURAC), Fondazione IRCCS Ca´ Granda, Ospedale Maggiore Policlinico, Sanford-Burnham Medical Research Institute.
Somatic cells can be reprogrammed into induced pluripotent stem cells (iPSCs) by forcing the expression of four transcription factors (Oct-4, Sox-2, Klf-4, and c-Myc), typically expressed by human embryonic stem cells (hESCs). Due to their similarity with hESCs, iPSCs have become an important tool for potential patient-specific regenerative medicine, avoiding ethical issues associated with hESCs. In order to obtain cells suitable for clinical application, transgene-free iPSCs need to be generated to avoid transgene reactivation, altered gene expression and misguided differentiation. Moreover, a highly efficient and inexpensive reprogramming method is necessary to derive sufficient iPSCs for therapeutic purposes. Given this need, an efficient non-integrating episomal plasmid approach is the preferable choice for iPSC derivation. Currently the most common cell type used for reprogramming purposes are fibroblasts, the isolation of which requires tissue biopsy, an invasive surgical procedure for the patient. Therefore, human peripheral blood represents the most accessible and least invasive tissue for iPSC generation.
In this study, a cost-effective and viral-free protocol using non-integrating episomal plasmids is reported for the generation of iPSCs from human peripheral blood mononuclear cells (PBMNCs) obtained from frozen buffy coats after whole blood centrifugation and without density gradient separation.
Developmental Biology, Issue 100, Stem cell biology, cellular biology, molecular biology, induced pluripotent stem cells, peripheral blood mononuclear cells, reprogramming, episomal plasmids.
Isolation and Characterization of Neutrophils with Anti-Tumor Properties
Institutions: Hebrew University Medical School, Hadassah-Hebrew University Medical Center.
Neutrophils, the most abundant of all white blood cells in the human circulation, play an important role in the host defense against invading microorganisms. In addition, neutrophils play a central role in the immune surveillance of tumor cells. They have the ability to recognize tumor cells and induce tumor cell death either through a cell contact-dependent mechanism involving hydrogen peroxide or through antibody-dependent cell-mediated cytotoxicity (ADCC). Neutrophils with anti-tumor activity can be isolated from peripheral blood of cancer patients and of tumor-bearing mice. These neutrophils are termed tumor-entrained neutrophils (TEN) to distinguish them from neutrophils of healthy subjects or naïve mice that show no significant tumor cytotoxic activity. Compared with other white blood cells, neutrophils show different buoyancy making it feasible to obtain a > 98% pure neutrophil population when subjected to a density gradient. However, in addition to the normal high-density neutrophil population (HDN), in cancer patients, in tumor-bearing mice, as well as under chronic inflammatory conditions, distinct low-density neutrophil populations (LDN) appear in the circulation. LDN co-purify with the mononuclear fraction and can be separated from mononuclear cells using either positive or negative selection strategies. Once the purity of the isolated neutrophils is determined by flow cytometry, they can be used for in vitro
and in vivo
functional assays. We describe techniques for monitoring the anti-tumor activity of neutrophils, their ability to migrate and to produce reactive oxygen species, as well as monitoring their phagocytic capacity ex vivo
. We further describe techniques to label the neutrophils for in vivo
tracking, and to determine their anti-metastatic capacity in vivo
. All these techniques are essential for understanding how to obtain and characterize neutrophils with anti-tumor function.
Immunology, Issue 100, Neutrophil isolation, tumor-entrained neutrophils, high-density neutrophils, low-density neutrophils, anti-tumor cytotoxicity, BrdU labeling, CFSE labeling, luciferase assay, neutrophil depletion, anti-metastatic activity, lung metastatic seeding assay, neutrophil adoptive transfer.
Development of an in vitro model system for studying the interaction of Equus caballus IgE with its high-affinity receptor FcεRI
Institutions: King Abdulaziz University, The University of Sheffield.
The interaction of IgE with its high-affinity Fc receptor (FcεRI) followed by an antigenic challenge is the principal pathway in IgE mediated allergic reactions. As a consequence of the high affinity binding between IgE and FcεRI, along with the continuous production of IgE by B cells, allergies usually persist throughout life, with currently no permanent cure available. Horses, especially race horses, which are commonly inbred, are a species of mammals that are very prone to the development of hypersensitivity responses, which can seriously affect their performance. Physiological responses to allergic sensitization in horses mirror that observed in humans and dogs. In this paper we describe the development of an in situ
assay system for the quantitative assessment of the release of mediators of the allergic response pertaining to the equine system. To this end, the gene encoding equine FcεRIα was transfected into and expressed onto the surface of parental Rat Basophil Leukemia (RBL-2H3.1) cells. The gene product of the transfected equine α-chain formed a functional receptor complex with the endogenous rat β- and γ-chains 1
. The resultant assay system facilitated an assessment of the quantity of mediator secreted from equine FcεRIα transfected RBL-2H3.1 cells following sensitization with equine IgE and antigenic challenge using β-hexosaminidase release as a readout 2, 3
. Mediator release peaked at 36.68% ± 4.88% at 100 ng ml-1
of antigen. This assay was modified from previous assays used to study human and canine allergic responses 4, 5
. We have also shown that this type of assay system has multiple applications for the development of diagnostic tools and the safety assessment of potential therapeutic intervention strategies in allergic disease 6, 2, 3
Immunology, Issue 93, Allergy, Immunology, IgE, Fcε, RI, horse (Equus caballus), Immunoassay
Isolation of Precursor B-cell Subsets from Umbilical Cord Blood
Institutions: University of Missouri-Columbia, University of Missouri-Columbia.
Umbilical cord blood is highly enriched for hematopoietic progenitor cells at different lineage commitment stages. We have developed a protocol for isolating precursor B-cells at four different stages of differentiation. Because genes are expressed and epigenetic modifications occur in a tissue specific manner, it is vital to discriminate between tissues and cell types in order to be able to identify alterations in the genome and the epigenome that may lead to the development of disease. This method can be adapted to any type of cell present in umbilical cord blood at any stage of differentiation.
This method comprises 4 main steps. First, mononuclear cells are separated by density centrifugation. Second, B-cells are enriched using biotin conjugated antibodies that recognize and remove non B-cells from the mononuclear cells. Third the B-cells are fluorescently labeled with cell surface protein antibodies specific to individual stages of B-cell development. Finally, the fluorescently labeled cells are sorted and individual populations are recovered. The recovered cells are of sufficient quantity and quality to be utilized in downstream nucleic acid assays.
Immunology, Issue 74, Cellular Biology, Molecular Biology, Genetics, Medicine, Biomedical Engineering, Anatomy, Physiology, Neoplasms, Precursor B-cells, B cells, Umbilical cord blood, Cell sorting, DNA methylation, Tissue specific expression, labeling, enrichment, isolation, blood, tissue, cells, flow cytometry
Generation of Human Induced Pluripotent Stem Cells from Peripheral Blood Using the STEMCCA Lentiviral Vector
Institutions: Boston University School of Medicine, Children's Hospital of Philadelphia, Children's Hospital of Philadelphia.
Through the ectopic expression of four transcription factors, Oct4, Klf4, Sox2 and cMyc, human somatic cells can be converted to a pluripotent state, generating so-called induced pluripotent stem cells (iPSCs)1-4
. Patient-specific iPSCs lack the ethical concerns that surround embryonic stem cells (ESCs) and would bypass possible immune rejection. Thus, iPSCs have attracted considerable attention for disease modeling studies, the screening of pharmacological compounds, and regenerative therapies5
We have shown the generation of transgene-free human iPSCs from patients with different lung diseases using a single excisable polycistronic lentiviral Stem Cell Cassette (STEMCCA) encoding the Yamanaka factors6
. These iPSC lines were generated from skin fibroblasts, the most common cell type used for reprogramming. Normally, obtaining fibroblasts requires a skin punch biopsy followed by expansion of the cells in culture for a few passages. Importantly, a number of groups have reported the reprogramming of human peripheral blood cells into iPSCs7-9
. In one study, a Tet inducible version of the STEMCCA vector was employed9
, which required the blood cells to be simultaneously infected with a constitutively active lentivirus encoding the reverse tetracycline transactivator. In contrast to fibroblasts, peripheral blood cells can be collected via minimally invasive procedures, greatly reducing the discomfort and distress of the patient. A simple and effective protocol for reprogramming blood cells using a constitutive single excisable vector may accelerate the application of iPSC technology by making it accessible to a broader research community. Furthermore, reprogramming of peripheral blood cells allows for the generation of iPSCs from individuals in which skin biopsies should be avoided (i.e
. aberrant scarring) or due to pre-existing disease conditions preventing access to punch biopsies.
Here we demonstrate a protocol for the generation of human iPSCs from peripheral blood mononuclear cells (PBMCs) using a single floxed-excisable lentiviral vector constitutively expressing the 4 factors. Freshly collected or thawed PBMCs are expanded for 9 days as described10,11
in medium containing ascorbic acid, SCF, IGF-1, IL-3 and EPO before being transduced with the STEMCCA lentivirus. Cells are then plated onto MEFs and ESC-like colonies can be visualized two weeks after infection. Finally, selected clones are expanded and tested for the expression of the pluripotency markers SSEA-4, Tra-1-60 and Tra-1-81. This protocol is simple, robust and highly consistent, providing a reliable methodology for the generation of human iPSCs from readily accessible 4 ml of blood.
Stem Cell Biology, Issue 68, Induced pluripotent stem cells (iPSCs), peripheral blood mononuclear cells (PBMCs), reprogramming, single excisable lentiviral vector, STEMCCA
Methods to Evaluate Cytotoxicity and Immunosuppression of Combustible Tobacco Product Preparations
Institutions: Wake Forest University Health Sciences, R.J. Reynolds Tobacco Company.
Among other pathophysiological changes, chronic exposure to cigarette smoke causes inflammation and immune suppression, which have been linked to increased susceptibility of smokers to microbial infections and tumor incidence. Ex vivo
suppression of receptor-mediated immune responses in human peripheral blood mononuclear cells (PBMCs) treated with smoke constituents is an attractive approach to study mechanisms and evaluate the likely long-term effects of exposure to tobacco products. Here, we optimized methods to perform ex vivo
assays using PBMCs stimulated by bacterial lipopolysaccharide, a Toll-like receptor-4 ligand. The effects of whole smoke-conditioned medium (WS-CM), a combustible tobacco product preparation (TPP), and nicotine were investigated on cytokine secretion and target cell killing by PBMCs in the ex vivo
assays. We show that secreted cytokines IFN-γ, TNF, IL-10, IL-6, and IL-8 and intracellular cytokines IFN-γ, TNF-α, and MIP-1α were suppressed in WS-CM-exposed PBMCs. The cytolytic function of effector PBMCs, as determined by a K562 target cell killing assay was also reduced by exposure to WS-CM; nicotine was minimally effective in these assays. In summary, we present a set of improved assays to evaluate the effects of TPPs in ex vivo
assays, and these methods could be readily adapted for testing other products of interest.
Immunology, Issue 95, Tobacco product preparation, whole smoke-conditioned medium, human peripheral blood mononuclear cells, PBMC, lipopolysaccharide, cell death, secreted cytokines, intracellular cytokines, K562 killing assay.
Femoral Bone Marrow Aspiration in Live Mice
Institutions: Memorial Sloan-Kettering Cancer Center.
Serial sampling of the cellular composition of bone marrow (BM) is a routine procedure critical to clinical hematology. This protocol describes a detailed step-by-step technical procedure for an analogous procedure in live mice which allows for serial characterization of cells present in the BM. This procedure facilitates studies aimed to detect the presence of exogenously administered cells within the BM of mice as would be done in xenograft studies for instance. Moreover, this procedure allows for the retrieval and characterization of cells enriched in the BM such as hematopoietic stem and progenitor cells (HSPCs) without sacrifice of mice. Given that the cellular composition of peripheral blood is not necessarily reflective of proportions and types of stem and progenitor cells present in the marrow, procedures which provide access to this compartment without requiring termination of the mice are very helpful. The use of femoral bone marrow aspiration is illustrated here for cytological analysis of marrow cells, flow cytometric characterization of the hematopoietic stem/progenitor compartment, and culture of sorted HSPCs obtained by femoral BM aspiration compared with conventional marrow harvest.
Medicine, Issue 89, Bone marrow, Leukemia, Hematopoiesis, Aspiration, Mouse Model, Hematopoietic Stem Cell
Collection, Isolation, and Flow Cytometric Analysis of Human Endocervical Samples
Institutions: University of Manitoba, University of Manitoba.
Despite the public health importance of mucosal pathogens (including HIV), relatively little is known about mucosal immunity, particularly at the female genital tract (FGT). Because heterosexual transmission now represents the dominant mechanism of HIV transmission, and given the continual spread of sexually transmitted infections (STIs), it is critical to understand the interplay between host and pathogen at the genital mucosa. The substantial gaps in knowledge around FGT immunity are partially due to the difficulty in successfully collecting and processing mucosal samples. In order to facilitate studies with sufficient sample size, collection techniques must be minimally invasive and efficient. To this end, a protocol for the collection of cervical cytobrush samples and subsequent isolation of cervical mononuclear cells (CMC) has been optimized. Using ex vivo
flow cytometry-based immunophenotyping, it is possible to accurately and reliably quantify CMC lymphocyte/monocyte population frequencies and phenotypes. This technique can be coupled with the collection of cervical-vaginal lavage (CVL), which contains soluble immune mediators including cytokines, chemokines and anti-proteases, all of which can be used to determine the anti- or pro-inflammatory environment in the vagina.
Medicine, Issue 89, mucosal, immunology, FGT, lavage, cervical, CMC
Bioenergetics and the Oxidative Burst: Protocols for the Isolation and Evaluation of Human Leukocytes and Platelets
Institutions: University of Alabama at Birmingham.
Mitochondrial dysfunction is known to play a significant role in a number of pathological conditions such as atherosclerosis, diabetes, septic shock, and neurodegenerative diseases but assessing changes in bioenergetic function in patients is challenging. Although diseases such as diabetes or atherosclerosis present clinically with specific organ impairment, the systemic components of the pathology, such as hyperglycemia or inflammation, can alter bioenergetic function in circulating leukocytes or platelets. This concept has been recognized for some time but its widespread application has been constrained by the large number of primary cells needed for bioenergetic analysis. This technical limitation has been overcome by combining the specificity of the magnetic bead isolation techniques, cell adhesion techniques, which allow cells to be attached without activation to microplates, and the sensitivity of new technologies designed for high throughput microplate respirometry. An example of this equipment is the extracellular flux analyzer. Such instrumentation typically uses oxygen and pH sensitive probes to measure rates of change in these parameters in adherent cells, which can then be related to metabolism. Here we detail the methods for the isolation and plating of monocytes, lymphocytes, neutrophils and platelets, without activation, from human blood and the analysis of mitochondrial bioenergetic function in these cells. In addition, we demonstrate how the oxidative burst in monocytes and neutrophils can also be measured in the same samples. Since these methods use only 8-20 ml human blood they have potential for monitoring reactive oxygen species generation and bioenergetics in a clinical setting.
Immunology, Issue 85, bioenergetics, translational, mitochondria, oxidative stress, reserve capacity, leukocytes
Profiling Individual Human Embryonic Stem Cells by Quantitative RT-PCR
Institutions: Johns Hopkins University School of Medicine.
Heterogeneity of stem cell population hampers detailed understanding of stem cell biology, such as their differentiation propensity toward different lineages. A single cell transcriptome assay can be a new approach for dissecting individual variation. We have developed the single cell qRT-PCR method, and confirmed that this method works well in several gene expression profiles. In single cell level, each human embryonic stem cell, sorted by OCT4::EGFP positive cells, has high expression in OCT4
, but a different level of NANOG
expression. Our single cell gene expression assay should be useful to interrogate population heterogeneities.
Molecular Biology, Issue 87, Single cell, heterogeneity, Amplification, qRT-PCR, Reverse transcriptase, human Embryonic Stem cell, FACS
An Allele-specific Gene Expression Assay to Test the Functional Basis of Genetic Associations
Institutions: University of Oxford.
The number of significant genetic associations with common complex traits is constantly increasing. However, most of these associations have not been understood at molecular level. One of the mechanisms mediating the effect of DNA variants on phenotypes is gene expression, which has been shown to be particularly relevant for complex traits1
This method tests in a cellular context the effect of specific DNA sequences on gene expression. The principle is to measure the relative abundance of transcripts arising from the two alleles of a gene, analysing cells which carry one copy of the DNA sequences associated with disease (the risk variants)2,3
. Therefore, the cells used for this method should meet two fundamental genotypic requirements: they have to be heterozygous both for DNA risk variants and for DNA markers, typically coding polymorphisms, which can distinguish transcripts based on their chromosomal origin (Figure 1). DNA risk variants and DNA markers do not need to have the same allele frequency but the phase (haplotypic) relationship of the genetic markers needs to be understood. It is also important to choose cell types which express the gene of interest. This protocol refers specifically to the procedure adopted to extract nucleic acids from fibroblasts but the method is equally applicable to other cells types including primary cells.
DNA and RNA are extracted from the selected cell lines and cDNA is generated. DNA and cDNA are analysed with a primer extension assay, designed to target the coding DNA markers4
. The primer extension assay is carried out using the MassARRAY (Sequenom)5
platform according to the manufacturer's specifications. Primer extension products are then analysed by matrix-assisted laser desorption/ionization time of-flight mass spectrometry (MALDI-TOF/MS). Because the selected markers are heterozygous they will generate two peaks on the MS profiles. The area of each peak is proportional to the transcript abundance and can be measured with a function of the MassARRAY Typer software to generate an allelic ratio (allele 1: allele 2) calculation. The allelic ratio obtained for cDNA is normalized using that measured from genomic DNA, where the allelic ratio is expected to be 1:1 to correct for technical artifacts. Markers with a normalised allelic ratio significantly different to 1 indicate that the amount of transcript generated from the two chromosomes in the same cell is different, suggesting that the DNA variants associated with the phenotype have an effect on gene expression. Experimental controls should be used to confirm the results.
Cellular Biology, Issue 45, Gene expression, regulatory variant, haplotype, association study, primer extension, MALDI-TOF mass spectrometry, single nucleotide polymorphism, allele-specific
Physiological Recordings and RNA Sequencing of the Gustatory Appendages of the Yellow-fever Mosquito Aedes aegypti
Institutions: United States Department of Agriculture.
Electrophysiological recording of action potentials from sensory neurons of mosquitoes provides investigators a glimpse into the chemical perception of these disease vectors. We have recently identified a bitter sensing neuron in the labellum of female Aedes aegypti
that responds to DEET and other repellents, as well as bitter quinine, through direct electrophysiological investigation. These gustatory receptor neuron responses prompted our sequencing of total mRNA from both male and female labella and tarsi samples to elucidate the putative chemoreception genes expressed in these contact chemoreception tissues. Samples of tarsi were divided into pro-, meso- and metathoracic subtypes for both sexes. We then validated our dataset by conducting qRT-PCR on the same tissue samples and used statistical methods to compare results between the two methods. Studies addressing molecular function may now target specific genes to determine those involved in repellent perception by mosquitoes. These receptor pathways may be used to screen novel repellents towards disruption of host-seeking behavior to curb the spread of harmful viruses.
Molecular Biology, Issue 94, Gustation, insect, Aedes aegypti, electrophysiology, mosquito, RNA-seq, qRT-PCR, taste, chemosensory
RNA-seq Analysis of Transcriptomes in Thrombin-treated and Control Human Pulmonary Microvascular Endothelial Cells
Institutions: Children's Mercy Hospital and Clinics, School of Medicine, University of Missouri-Kansas City.
The characterization of gene expression in cells via measurement of mRNA levels is a useful tool in determining how the transcriptional machinery of the cell is affected by external signals (e.g.
drug treatment), or how cells differ between a healthy state and a diseased state. With the advent and continuous refinement of next-generation DNA sequencing technology, RNA-sequencing (RNA-seq) has become an increasingly popular method of transcriptome analysis to catalog all species of transcripts, to determine the transcriptional structure of all expressed genes and to quantify the changing expression levels of the total set of transcripts in a given cell, tissue or organism1,2
. RNA-seq is gradually replacing DNA microarrays as a preferred method for transcriptome analysis because it has the advantages of profiling a complete transcriptome, providing a digital type datum (copy number of any transcript) and not relying on any known genomic sequence3
Here, we present a complete and detailed protocol to apply RNA-seq to profile transcriptomes in human pulmonary microvascular endothelial cells with or without thrombin treatment. This protocol is based on our recent published study entitled "RNA-seq Reveals Novel Transcriptome of Genes and Their Isoforms in Human Pulmonary Microvascular Endothelial Cells Treated with Thrombin,"4
in which we successfully performed the first complete transcriptome analysis of human pulmonary microvascular endothelial cells treated with thrombin using RNA-seq. It yielded unprecedented resources for further experimentation to gain insights into molecular mechanisms underlying thrombin-mediated endothelial dysfunction in the pathogenesis of inflammatory conditions, cancer, diabetes, and coronary heart disease, and provides potential new leads for therapeutic targets to those diseases.
The descriptive text of this protocol is divided into four parts. The first part describes the treatment of human pulmonary microvascular endothelial cells with thrombin and RNA isolation, quality analysis and quantification. The second part describes library construction and sequencing. The third part describes the data analysis. The fourth part describes an RT-PCR validation assay. Representative results of several key steps are displayed. Useful tips or precautions to boost success in key steps are provided in the Discussion section. Although this protocol uses human pulmonary microvascular endothelial cells treated with thrombin, it can be generalized to profile transcriptomes in both mammalian and non-mammalian cells and in tissues treated with different stimuli or inhibitors, or to compare transcriptomes in cells or tissues between a healthy state and a disease state.
Genetics, Issue 72, Molecular Biology, Immunology, Medicine, Genomics, Proteins, RNA-seq, Next Generation DNA Sequencing, Transcriptome, Transcription, Thrombin, Endothelial cells, high-throughput, DNA, genomic DNA, RT-PCR, PCR
RNA-Seq Analysis of Differential Gene Expression in Electroporated Chick Embryonic Spinal Cord
Institutions: Universidade de São Paulo.
electroporation of the chick neural tube is a fast and inexpensive method for identification of gene function during neural development. Genome wide analysis of differentially expressed transcripts after such an experimental manipulation has the potential to uncover an almost complete picture of the downstream effects caused by the transfected construct. This work describes a simple method for comparing transcriptomes from samples of transfected embryonic spinal cords comprising all steps between electroporation and identification of differentially expressed transcripts. The first stage consists of guidelines for electroporation and instructions for dissection of transfected spinal cord halves from HH23 embryos in ribonuclease-free environment and extraction of high-quality RNA samples suitable for transcriptome sequencing. The next stage is that of bioinformatic analysis with general guidelines for filtering and comparison of RNA-Seq datasets in the Galaxy public server, which eliminates the need of a local computational structure for small to medium scale experiments. The representative results show that the dissection methods generate high quality RNA samples and that the transcriptomes obtained from two control samples are essentially the same, an important requirement for detection of differential expression genes in experimental samples. Furthermore, one example is provided where experimental overexpression of a DNA construct can be visually verified after comparison with control samples. The application of this method may be a powerful tool to facilitate new discoveries on the function of neural factors involved in spinal cord early development.
Developmental Biology, Issue 93, chicken embryo, in ovo electroporation, spinal cord, RNA-Seq, transcriptome profiling, Galaxy workflow
Metabolic Labeling of Newly Transcribed RNA for High Resolution Gene Expression Profiling of RNA Synthesis, Processing and Decay in Cell Culture
Institutions: Max von Pettenkofer Institute, University of Cambridge, Ludwig-Maximilians-University Munich.
The development of whole-transcriptome microarrays and next-generation sequencing has revolutionized our understanding of the complexity of cellular gene expression. Along with a better understanding of the involved molecular mechanisms, precise measurements of the underlying kinetics have become increasingly important. Here, these powerful methodologies face major limitations due to intrinsic properties of the template samples they study, i.e.
total cellular RNA. In many cases changes in total cellular RNA occur either too slowly or too quickly to represent the underlying molecular events and their kinetics with sufficient resolution. In addition, the contribution of alterations in RNA synthesis, processing, and decay are not readily differentiated.
We recently developed high-resolution gene expression profiling to overcome these limitations. Our approach is based on metabolic labeling of newly transcribed RNA with 4-thiouridine (thus also referred to as 4sU-tagging) followed by rigorous purification of newly transcribed RNA using thiol-specific biotinylation and streptavidin-coated magnetic beads. It is applicable to a broad range of organisms including vertebrates, Drosophila
, and yeast. We successfully applied 4sU-tagging to study real-time kinetics of transcription factor activities, provide precise measurements of RNA half-lives, and obtain novel insights into the kinetics of RNA processing. Finally, computational modeling can be employed to generate an integrated, comprehensive analysis of the underlying molecular mechanisms.
Genetics, Issue 78, Cellular Biology, Molecular Biology, Microbiology, Biochemistry, Eukaryota, Investigative Techniques, Biological Phenomena, Gene expression profiling, RNA synthesis, RNA processing, RNA decay, 4-thiouridine, 4sU-tagging, microarray analysis, RNA-seq, RNA, DNA, PCR, sequencing
Purifying the Impure: Sequencing Metagenomes and Metatranscriptomes from Complex Animal-associated Samples
Institutions: San Diego State University, DOE Joint Genome Institute, University of Colorado, University of Colorado.
The accessibility of high-throughput sequencing has revolutionized many fields of biology. In order to better understand host-associated viral and microbial communities, a comprehensive workflow for DNA and RNA extraction was developed. The workflow concurrently generates viral and microbial metagenomes, as well as metatranscriptomes, from a single sample for next-generation sequencing. The coupling of these approaches provides an overview of both the taxonomical characteristics and the community encoded functions. The presented methods use Cystic Fibrosis (CF) sputum, a problematic sample type, because it is exceptionally viscous and contains high amount of mucins, free neutrophil DNA, and other unknown contaminants. The protocols described here target these problems and successfully recover viral and microbial DNA with minimal human DNA contamination. To complement the metagenomics studies, a metatranscriptomics protocol was optimized to recover both microbial and host mRNA that contains relatively few ribosomal RNA (rRNA) sequences. An overview of the data characteristics is presented to serve as a reference for assessing the success of the methods. Additional CF sputum samples were also collected to (i) evaluate the consistency of the microbiome profiles across seven consecutive days within a single patient, and (ii) compare the consistency of metagenomic approach to a 16S ribosomal RNA gene-based sequencing. The results showed that daily fluctuation of microbial profiles without antibiotic perturbation was minimal and the taxonomy profiles of the common CF-associated bacteria were highly similar between the 16S rDNA libraries and metagenomes generated from the hypotonic lysis (HL)-derived DNA. However, the differences between 16S rDNA taxonomical profiles generated from total DNA and HL-derived DNA suggest that hypotonic lysis and the washing steps benefit in not only removing the human-derived DNA, but also microbial-derived extracellular DNA that may misrepresent the actual microbial profiles.
Molecular Biology, Issue 94, virome, microbiome, metagenomics, metatranscriptomics, cystic fibrosis, mucosal-surface
Phage Phenomics: Physiological Approaches to Characterize Novel Viral Proteins
Institutions: San Diego State University, San Diego State University, San Diego State University, San Diego State University, San Diego State University, Argonne National Laboratory, Broad Institute.
Current investigations into phage-host interactions are dependent on extrapolating knowledge from (meta)genomes. Interestingly, 60 - 95% of all phage sequences share no homology to current annotated proteins. As a result, a large proportion of phage genes are annotated as hypothetical. This reality heavily affects the annotation of both structural and auxiliary metabolic genes. Here we present phenomic methods designed to capture the physiological response(s) of a selected host during expression of one of these unknown phage genes. Multi-phenotype Assay Plates (MAPs) are used to monitor the diversity of host substrate utilization and subsequent biomass formation, while metabolomics provides bi-product analysis by monitoring metabolite abundance and diversity. Both tools are used simultaneously to provide a phenotypic profile associated with expression of a single putative phage open reading frame (ORF). Representative results for both methods are compared, highlighting the phenotypic profile differences of a host carrying either putative structural or metabolic phage genes. In addition, the visualization techniques and high throughput computational pipelines that facilitated experimental analysis are presented.
Immunology, Issue 100, phenomics, phage, viral metagenome, Multi-phenotype Assay Plates (MAPs), continuous culture, metabolomics
Enhanced Reduced Representation Bisulfite Sequencing for Assessment of DNA Methylation at Base Pair Resolution
Institutions: Weill Cornell Medical College, Weill Cornell Medical College, Weill Cornell Medical College, University of Michigan.
DNA methylation pattern mapping is heavily studied in normal and diseased tissues. A variety of methods have been established to interrogate the cytosine methylation patterns in cells. Reduced representation of whole genome bisulfite sequencing was developed to detect quantitative base pair resolution cytosine methylation patterns at GC-rich genomic loci. This is accomplished by combining the use of a restriction enzyme followed by bisulfite conversion. Enhanced Reduced Representation Bisulfite Sequencing (ERRBS) increases the biologically relevant genomic loci covered and has been used to profile cytosine methylation in DNA from human, mouse and other organisms. ERRBS initiates with restriction enzyme digestion of DNA to generate low molecular weight fragments for use in library preparation. These fragments are subjected to standard library construction for next generation sequencing. Bisulfite conversion of unmethylated cytosines prior to the final amplification step allows for quantitative base resolution of cytosine methylation levels in covered genomic loci. The protocol can be completed within four days. Despite low complexity in the first three bases sequenced, ERRBS libraries yield high quality data when using a designated sequencing control lane. Mapping and bioinformatics analysis is then performed and yields data that can be easily integrated with a variety of genome-wide platforms. ERRBS can utilize small input material quantities making it feasible to process human clinical samples and applicable in a range of research applications. The video produced demonstrates critical steps of the ERRBS protocol.
Genetics, Issue 96, Epigenetics, bisulfite sequencing, DNA methylation, genomic DNA, 5-methylcytosine, high-throughput
Profiling of Estrogen-regulated MicroRNAs in Breast Cancer Cells
Institutions: University of Houston.
Estrogen plays vital roles in mammary gland development and breast cancer progression. It mediates its function by binding to and activating the estrogen receptors (ERs), ERα, and ERβ. ERα is frequently upregulated in breast cancer and drives the proliferation of breast cancer cells. The ERs function as transcription factors and regulate gene expression. Whereas ERα's regulation of protein-coding genes is well established, its regulation of noncoding microRNA (miRNA) is less explored. miRNAs play a major role in the post-transcriptional regulation of genes, inhibiting their translation or degrading their mRNA. miRNAs can function as oncogenes or tumor suppressors and are also promising biomarkers. Among the miRNA assays available, microarray and quantitative real-time polymerase chain reaction (qPCR) have been extensively used to detect and quantify miRNA levels. To identify miRNAs regulated by estrogen signaling in breast cancer, their expression in ERα-positive breast cancer cell lines were compared before and after estrogen-activation using both the µParaflo-microfluidic microarrays and Dual Labeled Probes-low density arrays. Results were validated using specific qPCR assays, applying both Cyanine dye-based and Dual Labeled Probes-based chemistry. Furthermore, a time-point assay was used to identify regulations over time. Advantages of the miRNA assay approach used in this study is that it enables a fast screening of mature miRNA regulations in numerous samples, even with limited sample amounts. The layout, including the specific conditions for cell culture and estrogen treatment, biological and technical replicates, and large-scale screening followed by in-depth confirmations using separate techniques, ensures a robust detection of miRNA regulations, and eliminates false positives and other artifacts. However, mutated or unknown miRNAs, or regulations at the primary and precursor transcript level, will not be detected. The method presented here represents a thorough investigation of estrogen-mediated miRNA regulation.
Medicine, Issue 84, breast cancer, microRNA, estrogen, estrogen receptor, microarray, qPCR
Genome-wide Snapshot of Chromatin Regulators and States in Xenopus Embryos by ChIP-Seq
Institutions: MRC National Institute for Medical Research.
The recruitment of chromatin regulators and the assignment of chromatin states to specific genomic loci are pivotal to cell fate decisions and tissue and organ formation during development. Determining the locations and levels of such chromatin features in vivo
will provide valuable information about the spatio-temporal regulation of genomic elements, and will support aspirations to mimic embryonic tissue development in vitro
. The most commonly used method for genome-wide and high-resolution profiling is chromatin immunoprecipitation followed by next-generation sequencing (ChIP-Seq). This protocol outlines how yolk-rich embryos such as those of the frog Xenopus
can be processed for ChIP-Seq experiments, and it offers simple command lines for post-sequencing analysis. Because of the high efficiency with which the protocol extracts nuclei from formaldehyde-fixed tissue, the method allows easy upscaling to obtain enough ChIP material for genome-wide profiling. Our protocol has been used successfully to map various DNA-binding proteins such as transcription factors, signaling mediators, components of the transcription machinery, chromatin modifiers and post-translational histone modifications, and for this to be done at various stages of embryogenesis. Lastly, this protocol should be widely applicable to other model and non-model organisms as more and more genome assemblies become available.
Developmental Biology, Issue 96, Chromatin immunoprecipitation, next-generation sequencing, ChIP-Seq, developmental biology, Xenopus embryos, cross-linking, transcription factor, post-sequencing analysis, DNA occupancy, metagene, binding motif, GO term
Microarray-based Identification of Individual HERV Loci Expression: Application to Biomarker Discovery in Prostate Cancer
Institutions: Joint Unit Hospices de Lyon-bioMérieux, BioMérieux, Hospices Civils de Lyon, Lyon 1 University, BioMérieux, Hospices Civils de Lyon, Hospices Civils de Lyon.
The prostate-specific antigen (PSA) is the main diagnostic biomarker for prostate cancer in clinical use, but it lacks specificity and sensitivity, particularly in low dosage values1
. ‘How to use PSA' remains a current issue, either for diagnosis as a gray zone corresponding to a concentration in serum of 2.5-10 ng/ml which does not allow a clear differentiation to be made between cancer and noncancer2
or for patient follow-up as analysis of post-operative PSA kinetic parameters can pose considerable challenges for their practical application3,4
. Alternatively, noncoding RNAs (ncRNAs) are emerging as key molecules in human cancer, with the potential to serve as novel markers of disease, e.g.
PCA3 in prostate cancer5,6
and to reveal uncharacterized aspects of tumor biology. Moreover, data from the ENCODE project published in 2012 showed that different RNA types cover about 62% of the genome. It also appears that the amount of transcriptional regulatory motifs is at least 4.5x higher than the one corresponding to protein-coding exons. Thus, long terminal repeats (LTRs) of human endogenous retroviruses (HERVs) constitute a wide range of putative/candidate transcriptional regulatory sequences, as it is their primary function in infectious retroviruses. HERVs, which are spread throughout the human genome, originate from ancestral and independent infections within the germ line, followed by copy-paste propagation processes and leading to multicopy families occupying 8% of the human genome (note that exons span 2% of our genome). Some HERV loci still express proteins that have been associated with several pathologies including cancer7-10
. We have designed a high-density microarray, in Affymetrix format, aiming to optimally characterize individual HERV loci expression, in order to better understand whether they can be active, if they drive ncRNA transcription or modulate coding gene expression. This tool has been applied in the prostate cancer field (Figure 1
Medicine, Issue 81, Cancer Biology, Genetics, Molecular Biology, Prostate, Retroviridae, Biomarkers, Pharmacological, Tumor Markers, Biological, Prostatectomy, Microarray Analysis, Gene Expression, Diagnosis, Human Endogenous Retroviruses, HERV, microarray, Transcriptome, prostate cancer, Affymetrix
A Strategy to Identify de Novo Mutations in Common Disorders such as Autism and Schizophrenia
Institutions: Universite de Montreal, Universite de Montreal, Universite de Montreal.
There are several lines of evidence supporting the role of de novo
mutations as a mechanism for common disorders, such as autism and schizophrenia. First, the de novo
mutation rate in humans is relatively high, so new mutations are generated at a high frequency in the population. However, de novo
mutations have not been reported in most common diseases. Mutations in genes leading to severe diseases where there is a strong negative selection against the phenotype, such as lethality in embryonic stages or reduced reproductive fitness, will not be transmitted to multiple family members, and therefore will not be detected by linkage gene mapping or association studies. The observation of very high concordance in monozygotic twins and very low concordance in dizygotic twins also strongly supports the hypothesis that a significant fraction of cases may result from new mutations. Such is the case for diseases such as autism and schizophrenia. Second, despite reduced reproductive fitness1
and extremely variable environmental factors, the incidence of some diseases is maintained worldwide at a relatively high and constant rate. This is the case for autism and schizophrenia, with an incidence of approximately 1% worldwide. Mutational load can be thought of as a balance between selection for or against a deleterious mutation and its production by de novo
mutation. Lower rates of reproduction constitute a negative selection factor that should reduce the number of mutant alleles in the population, ultimately leading to decreased disease prevalence. These selective pressures tend to be of different intensity in different environments. Nonetheless, these severe mental disorders have been maintained at a constant relatively high prevalence in the worldwide population across a wide range of cultures and countries despite a strong negative selection against them2
. This is not what one would predict in diseases with reduced reproductive fitness, unless there was a high new mutation rate. Finally, the effects of paternal age: there is a significantly increased risk of the disease with increasing paternal age, which could result from the age related increase in paternal de novo
mutations. This is the case for autism and schizophrenia3
. The male-to-female ratio of mutation rate is estimated at about 4–6:1, presumably due to a higher number of germ-cell divisions with age in males. Therefore, one would predict that de novo
mutations would more frequently come from males, particularly older males4
. A high rate of new mutations may in part explain why genetic studies have so far failed to identify many genes predisposing to complexes diseases genes, such as autism and schizophrenia, and why diseases have been identified for a mere 3% of genes in the human genome. Identification for de novo
mutations as a cause of a disease requires a targeted molecular approach, which includes studying parents and affected subjects. The process for determining if the genetic basis of a disease may result in part from de novo
mutations and the molecular approach to establish this link will be illustrated, using autism and schizophrenia as examples.
Medicine, Issue 52, de novo mutation, complex diseases, schizophrenia, autism, rare variations, DNA sequencing
Single Read and Paired End mRNA-Seq Illumina Libraries from 10 Nanograms Total RNA
Institutions: Morgridge Institute for Research, University of Wisconsin, University of California.
Whole transcriptome sequencing by mRNA-Seq is now used extensively to perform global gene expression, mutation, allele-specific expression and other genome-wide analyses. mRNA-Seq even opens the gate for gene expression analysis of non-sequenced genomes. mRNA-Seq offers high sensitivity, a large dynamic range and allows measurement of transcript copy numbers in a sample. Illumina’s genome analyzer performs sequencing of a large number (> 107
) of relatively short sequence reads (< 150 bp).The "paired end" approach, wherein a single long read is sequenced at both its ends, allows for tracking alternate splice junctions, insertions and deletions, and is useful for de novo
One of the major challenges faced by researchers is a limited amount of starting material. For example, in experiments where cells are harvested by laser micro-dissection, available starting total RNA may measure in nanograms. Preparation of mRNA-Seq libraries from such samples have been described1, 2
but involves significant PCR amplification that may introduce bias. Other RNA-Seq library construction procedures with minimal PCR amplification have been published3, 4
but require microgram amounts of starting total RNA.
Here we describe a protocol for the Illumina Genome Analyzer II platform for mRNA-Seq sequencing for library preparation that avoids significant PCR amplification and requires only 10 nanograms of total RNA. While this protocol has been described previously and validated for single-end sequencing5
, where it was shown to produce directional libraries without introducing significant amplification bias, here we validate it further for use as a paired end protocol. We selectively amplify polyadenylated messenger RNAs from starting total RNA using the T7 based Eberwine linear amplification method, coined "T7LA" (T7 linear amplification). The amplified poly-A mRNAs are fragmented, reverse transcribed and adapter ligated to produce the final sequencing library. For both single read and paired end runs, sequences are mapped to the human transcriptome6
and normalized so that data from multiple runs can be compared. We report the gene expression measurement in units of transcripts per million (TPM), which is a superior measure to RPKM when comparing samples7
Molecular Biology, Issue 56, Genetics, mRNA-Seq, Illumina-Seq, gene expression profiling, high throughput sequencing
Simultaneous Quantification of T-Cell Receptor Excision Circles (TRECs) and K-Deleting Recombination Excision Circles (KRECs) by Real-time PCR
Institutions: Spedali Civili di Brescia.
T-cell receptor excision circles (TRECs) and K-deleting recombination excision circles (KRECs) are circularized DNA elements formed during recombination process that creates T- and B-cell receptors. Because TRECs and KRECs are unable to replicate, they are diluted after each cell division, and therefore persist in the cell. Their quantity in peripheral blood can be considered as an estimation of thymic and bone marrow output. By combining well established and commonly used TREC assay with a modified version of KREC assay, we have developed a duplex quantitative real-time PCR that allows quantification of both newly-produced T and B lymphocytes in a single assay. The number of TRECs and KRECs are obtained using a standard curve prepared by serially diluting TREC and KREC signal joints cloned in a bacterial plasmid, together with a fragment of T-cell receptor alpha constant gene that serves as reference gene. Results are reported as number of TRECs and KRECs/106
cells or per ml of blood. The quantification of these DNA fragments have been proven useful for monitoring immune reconstitution following bone marrow transplantation in both children and adults, for improved characterization of immune deficiencies, or for better understanding of certain immunomodulating drug activity.
Immunology, Issue 94, B lymphocytes, primary immunodeficiency, real-time PCR, immune recovery, T-cell homeostasis, T lymphocytes, thymic output, bone marrow output