In vivo methods such as ChIP-chip are well-established techniques used to determine global gene targets for transcription factors. However, they are of limited use in exploring bacterial two component regulatory systems with uncharacterized activation conditions. Such systems regulate transcription only when activated in the presence of unique signals. Since these signals are often unknown, the in vitro microarray based method described in this video article can be used to determine gene targets and binding sites for response regulators. This DNA-affinity-purified-chip method may be used for any purified regulator in any organism with a sequenced genome. The protocol involves allowing the purified tagged protein to bind to sheared genomic DNA and then affinity purifying the protein-bound DNA, followed by fluorescent labeling of the DNA and hybridization to a custom tiling array. Preceding steps that may be used to optimize the assay for specific regulators are also described. The peaks generated by the array data analysis are used to predict binding site motifs, which are then experimentally validated. The motif predictions can be further used to determine gene targets of orthologous response regulators in closely related species. We demonstrate the applicability of this method by determining the gene targets and binding site motifs and thus predicting the function for a sigma54-dependent response regulator DVU3023 in the environmental bacterium Desulfovibrio vulgaris Hildenborough.
24 Related JoVE Articles!
High-throughput Functional Screening using a Homemade Dual-glow Luciferase Assay
Institutions: Massachusetts General Hospital.
We present a rapid and inexpensive high-throughput screening protocol to identify transcriptional regulators of alpha-synuclein, a gene associated with Parkinson's disease. 293T cells are transiently transfected with plasmids from an arrayed ORF expression library, together with luciferase reporter plasmids, in a one-gene-per-well microplate format. Firefly luciferase activity is assayed after 48 hr to determine the effects of each library gene upon alpha-synuclein transcription, normalized to expression from an internal control construct (a hCMV promoter directing Renilla
luciferase). This protocol is facilitated by a bench-top robot enclosed in a biosafety cabinet, which performs aseptic liquid handling in 96-well format. Our automated transfection protocol is readily adaptable to high-throughput lentiviral library production or other functional screening protocols requiring triple-transfections of large numbers of unique library plasmids in conjunction with a common set of helper plasmids. We also present an inexpensive and validated alternative to commercially-available, dual luciferase reagents which employs PTC124, EDTA, and pyrophosphate to suppress firefly luciferase activity prior to measurement of Renilla
luciferase. Using these methods, we screened 7,670 human genes and identified 68 regulators of alpha-synuclein. This protocol is easily modifiable to target other genes of interest.
Cellular Biology, Issue 88, Luciferases, Gene Transfer Techniques, Transfection, High-Throughput Screening Assays, Transfections, Robotics
Identification of Novel Genes Associated with Alginate Production in Pseudomonas aeruginosa Using Mini-himar1 Mariner Transposon-mediated Mutagenesis
Institutions: Marshall University.
is a Gram-negative, environmental bacterium with versatile metabolic capabilities. P. aeruginosa
is an opportunistic bacterial pathogen which establishes chronic pulmonary infections in patients with cystic fibrosis (CF). The overproduction of a capsular polysaccharide called alginate, also known as mucoidy, promotes the formation of mucoid biofilms which are more resistant than planktonic cells to antibiotic chemotherapy and host defenses. Additionally, the conversion from the nonmucoid to mucoid phenotype is a clinical marker for the onset of chronic infection in CF. Alginate overproduction by P. aeruginosa
is an endergonic process which heavily taxes cellular energy. Therefore, alginate production is highly regulated in P. aeruginosa
. To better understand alginate regulation, we describe a protocol using the mini-himar1
transposon mutagenesis for the identification of novel alginate regulators in a prototypic strain PAO1. The procedure consists of two basic steps. First, we transferred the mini-himar1
transposon (pFAC) from host E. coli
SM10/λpir into recipient P. aeruginosa
biparental conjugation to create a high-density insertion mutant library, which were selected on Pseudomonas
isolation agar plates supplemented with gentamycin. Secondly, we screened and isolated the mucoid colonies to map the insertion site through inverse PCR using DNA primers pointing outward from the gentamycin cassette and DNA sequencing. Using this protocol, we have identified two novel alginate regulators, mucE
(PA4033) and kinB
(PA5484), in strain PAO1 with a wild-type mucA
encoding the anti-sigma factor MucA for the master alginate regulator AlgU (AlgT, σ22
). This high-throughput mutagenesis protocol can be modified for the identification of other virulence-related genes causing change in colony morphology.
Immunology, Issue 85, Pseudomonas aeruginosa, alginate, mucoidy, mutagenesis, mini-himar1 mariner transposon, pFAC
Iterative Optimization of DNA Duplexes for Crystallization of SeqA-DNA Complexes
Institutions: McMaster University .
SeqA is a negative regulator of DNA replication that prevents premature reinitiation events by sequestering hemimethylated GATC clusters within the origin of replication1
. Beyond the origin, SeqA is found at the replication forks, where it organizes newly replicated DNA into higher ordered structures2
. SeqA associates only weakly with single GATC sequences, but it forms high affinity complexes with DNA duplexes containing multiple GATC sites. The minimal functional and structural unit of SeqA is a dimer, thereby explaining the requirement of at least two GATC sequences to form a high-affinity complex with hemimethylated DNA3
. Additionally, the SeqA architecture, with the oligomerization and DNA-binding domains separated by a flexible linker, allows binding to GATC repeats separated by up to three helical turns. Therefore, understanding the function of SeqA at a molecular level requires the structural analysis of SeqA bound to multiple GATC sequences. In protein-DNA crystallization, DNA can have none to an exceptional effect on the packing interactions depending on the relative sizes and architecture of the protein and the DNA. If the protein is larger than the DNA or footprints most of the DNA, the crystal packing is primarily mediated by protein-protein interactions. Conversely, when the protein is the same size or smaller than the DNA or it only covers a fraction of the DNA, DNA-DNA and DNA-protein interactions dominate crystal packing. Therefore, crystallization of protein-DNA complexes requires the systematic screening of DNA length4
and DNA ends (blunt or overhang)5-7
. In this report, we describe how to design, optimize, purify and crystallize hemimethylated DNA duplexes containing tandem GATC repeats in complex with a dimeric variant of SeqA (SeqAΔ(41-59)-A25R) to obtain crystals suitable for structure determination.
Structural Biology, Issue 69, SeqA, DNA replication, DNA purification, protein-DNA complexes, protein-DNA cocrystallization, X-ray crystallography
Identifying DNA Mutations in Purified Hematopoietic Stem/Progenitor Cells
Institutions: UT Health Science Center at San Antonio, UT Health Science Center at San Antonio, UT Health Science Center at San Antonio, UT Health Science Center at San Antonio, UT Health Science Center at San Antonio.
In recent years, it has become apparent that genomic instability is tightly related to many developmental disorders, cancers, and aging. Given that stem cells are responsible for ensuring tissue homeostasis and repair throughout life, it is reasonable to hypothesize that the stem cell population is critical for preserving genomic integrity of tissues. Therefore, significant interest has arisen in assessing the impact of endogenous and environmental factors on genomic integrity in stem cells and their progeny, aiming to understand the etiology of stem-cell based diseases.
transgenic mice carry a recoverable λ phage vector encoding the LacI
reporter system, in which the LacI
gene serves as the mutation reporter. The result of a mutated LacI
gene is the production of β-galactosidase that cleaves a chromogenic substrate, turning it blue. The LacI
reporter system is carried in all cells, including stem/progenitor cells and can easily be recovered and used to subsequently infect E. coli
. After incubating infected E. coli
on agarose that contains the correct substrate, plaques can be scored; blue plaques indicate a mutant LacI
gene, while clear plaques harbor wild-type. The frequency of blue (among clear) plaques indicates the mutant frequency in the original cell population the DNA was extracted from. Sequencing the mutant LacI
gene will show the location of the mutations in the gene and the type of mutation.
transgenic mouse model is well-established as an in vivo
mutagenesis assay. Moreover, the mice and the reagents for the assay are commercially available. Here we describe in detail how this model can be adapted to measure the frequency of spontaneously occurring DNA mutants in stem cell-enriched Lin-
(LSK) cells and other subpopulations of the hematopoietic system.
Infection, Issue 84, In vivo mutagenesis, hematopoietic stem/progenitor cells, LacI mouse model, DNA mutations, E. coli
Combining Single-molecule Manipulation and Imaging for the Study of Protein-DNA Interactions
Institutions: University of Florence, University of Oxford, University of Florence, University of Florence, National Institute of Optics-National Research Council, Italy, International Center of Computational Neurophotonics.
The paper describes the combination of optical tweezers and single molecule fluorescence detection for the study of protein-DNA interaction. The method offers the opportunity of investigating interactions occurring in solution (thus avoiding problems due to closeby surfaces as in other single molecule methods), controlling the DNA extension and tracking interaction dynamics as a function of both mechanical parameters and DNA sequence. The methods for establishing successful optical trapping and nanometer localization of single molecules are illustrated. We illustrate the experimental conditions allowing the study of interaction of lactose repressor (lacI), labeled with Atto532, with a DNA molecule containing specific target sequences (operators) for LacI binding. The method allows the observation of specific interactions at the operators, as well as one-dimensional diffusion of the protein during the process of target search. The method is broadly applicable to the study of protein-DNA interactions but also to molecular motors, where control of the tension applied to the partner track polymer (for example actin or microtubules) is desirable.
Bioengineering, Issue 90, Single molecule biophysics, Optical tweezers, fluorescence microscopy, DNA binding proteins, lactose repressor, microfluidics
RNA-seq Analysis of Transcriptomes in Thrombin-treated and Control Human Pulmonary Microvascular Endothelial Cells
Institutions: Children's Mercy Hospital and Clinics, School of Medicine, University of Missouri-Kansas City.
The characterization of gene expression in cells via measurement of mRNA levels is a useful tool in determining how the transcriptional machinery of the cell is affected by external signals (e.g.
drug treatment), or how cells differ between a healthy state and a diseased state. With the advent and continuous refinement of next-generation DNA sequencing technology, RNA-sequencing (RNA-seq) has become an increasingly popular method of transcriptome analysis to catalog all species of transcripts, to determine the transcriptional structure of all expressed genes and to quantify the changing expression levels of the total set of transcripts in a given cell, tissue or organism1,2
. RNA-seq is gradually replacing DNA microarrays as a preferred method for transcriptome analysis because it has the advantages of profiling a complete transcriptome, providing a digital type datum (copy number of any transcript) and not relying on any known genomic sequence3
Here, we present a complete and detailed protocol to apply RNA-seq to profile transcriptomes in human pulmonary microvascular endothelial cells with or without thrombin treatment. This protocol is based on our recent published study entitled "RNA-seq Reveals Novel Transcriptome of Genes and Their Isoforms in Human Pulmonary Microvascular Endothelial Cells Treated with Thrombin,"4
in which we successfully performed the first complete transcriptome analysis of human pulmonary microvascular endothelial cells treated with thrombin using RNA-seq. It yielded unprecedented resources for further experimentation to gain insights into molecular mechanisms underlying thrombin-mediated endothelial dysfunction in the pathogenesis of inflammatory conditions, cancer, diabetes, and coronary heart disease, and provides potential new leads for therapeutic targets to those diseases.
The descriptive text of this protocol is divided into four parts. The first part describes the treatment of human pulmonary microvascular endothelial cells with thrombin and RNA isolation, quality analysis and quantification. The second part describes library construction and sequencing. The third part describes the data analysis. The fourth part describes an RT-PCR validation assay. Representative results of several key steps are displayed. Useful tips or precautions to boost success in key steps are provided in the Discussion section. Although this protocol uses human pulmonary microvascular endothelial cells treated with thrombin, it can be generalized to profile transcriptomes in both mammalian and non-mammalian cells and in tissues treated with different stimuli or inhibitors, or to compare transcriptomes in cells or tissues between a healthy state and a disease state.
Genetics, Issue 72, Molecular Biology, Immunology, Medicine, Genomics, Proteins, RNA-seq, Next Generation DNA Sequencing, Transcriptome, Transcription, Thrombin, Endothelial cells, high-throughput, DNA, genomic DNA, RT-PCR, PCR
Identification of Key Factors Regulating Self-renewal and Differentiation in EML Hematopoietic Precursor Cells by RNA-sequencing Analysis
Institutions: The University of Texas Graduate School of Biomedical Sciences at Houston.
Hematopoietic stem cells (HSCs) are used clinically for transplantation treatment to rebuild a patient's hematopoietic system in many diseases such as leukemia and lymphoma. Elucidating the mechanisms controlling HSCs self-renewal and differentiation is important for application of HSCs for research and clinical uses. However, it is not possible to obtain large quantity of HSCs due to their inability to proliferate in vitro
. To overcome this hurdle, we used a mouse bone marrow derived cell line, the EML (Erythroid, Myeloid, and Lymphocytic) cell line, as a model system for this study.
RNA-sequencing (RNA-Seq) has been increasingly used to replace microarray for gene expression studies. We report here a detailed method of using RNA-Seq technology to investigate the potential key factors in regulation of EML cell self-renewal and differentiation. The protocol provided in this paper is divided into three parts. The first part explains how to culture EML cells and separate Lin-CD34+ and Lin-CD34- cells. The second part of the protocol offers detailed procedures for total RNA preparation and the subsequent library construction for high-throughput sequencing. The last part describes the method for RNA-Seq data analysis and explains how to use the data to identify differentially expressed transcription factors between Lin-CD34+ and Lin-CD34- cells. The most significantly differentially expressed transcription factors were identified to be the potential key regulators controlling EML cell self-renewal and differentiation. In the discussion section of this paper, we highlight the key steps for successful performance of this experiment.
In summary, this paper offers a method of using RNA-Seq technology to identify potential regulators of self-renewal and differentiation in EML cells. The key factors identified are subjected to downstream functional analysis in vitro
and in vivo
Genetics, Issue 93, EML Cells, Self-renewal, Differentiation, Hematopoietic precursor cell, RNA-Sequencing, Data analysis
Preparation of Primary Myogenic Precursor Cell/Myoblast Cultures from Basal Vertebrate Lineages
Institutions: University of Alabama at Birmingham, INRA UR1067, INRA UR1037.
Due to the inherent difficulty and time involved with studying the myogenic program in vivo
, primary culture systems derived from the resident adult stem cells of skeletal muscle, the myogenic precursor cells (MPCs), have proven indispensible to our understanding of mammalian skeletal muscle development and growth. Particularly among the basal taxa of Vertebrata,
however, data are limited describing the molecular mechanisms controlling the self-renewal, proliferation, and differentiation of MPCs. Of particular interest are potential mechanisms that underlie the ability of basal vertebrates to undergo considerable postlarval skeletal myofiber hyperplasia (i.e.
teleost fish) and full regeneration following appendage loss (i.e.
urodele amphibians). Additionally, the use of cultured myoblasts could aid in the understanding of regeneration and the recapitulation of the myogenic program and the differences between them. To this end, we describe in detail a robust and efficient protocol (and variations therein) for isolating and maintaining MPCs and their progeny, myoblasts and immature myotubes, in cell culture as a platform for understanding the evolution of the myogenic program, beginning with the more basal vertebrates. Capitalizing on the model organism status of the zebrafish (Danio rerio
), we report on the application of this protocol to small fishes of the cyprinid clade Danioninae
. In tandem, this protocol can be utilized to realize a broader comparative approach by isolating MPCs from the Mexican axolotl (Ambystomamexicanum
) and even laboratory rodents. This protocol is now widely used in studying myogenesis in several fish species, including rainbow trout, salmon, and sea bream1-4
Basic Protocol, Issue 86, myogenesis, zebrafish, myoblast, cell culture, giant danio, moustached danio, myotubes, proliferation, differentiation, Danioninae, axolotl
Fluorescence-based Monitoring of PAD4 Activity via a Pro-fluorescence Substrate Analog
Institutions: Lehigh University.
Post-translational modifications may lead to altered protein functional states by increasing the covalent variations on the side chains of many protein substrates. The histone tails represent one of the most heavily modified stretches within all human proteins. Peptidyl-arginine deiminase 4 (PAD4) has been shown to convert arginine residues into the non-genetically encoded citrulline residue. Few assays described to date have been operationally facile with satisfactory sensitivity. Thus, the lack of adequate assays has likely contributed to the absence of potent non-covalent PAD4 inhibitors. Herein a novel fluorescence-based assay that allows for the monitoring of PAD4 activity is described. A pro-fluorescent substrate analog was designed to link PAD4 enzymatic activity to fluorescence liberation upon the addition of the protease trypsin. It was shown that the assay is compatible with high-throughput screening conditions and has a strong signal-to-noise ratio. Furthermore, the assay can also be performed with crude cell lysates containing over-expressed PAD4.
Chemistry, Issue 93, PAD4, PADI4, citrullination, arginine, post-translational modification, HTS, assay, fluorescence, citrulline
Engineering and Evolution of Synthetic Adeno-Associated Virus (AAV) Gene Therapy Vectors via DNA Family Shuffling
Institutions: Heidelberg University, Heidelberg University.
Adeno-associated viral (AAV) vectors represent some of the most potent and promising vehicles for therapeutic human gene transfer due to a unique combination of beneficial properties1
. These include the apathogenicity of the underlying wildtype viruses and the highly advanced methodologies for production of high-titer, high-purity and clinical-grade recombinant vectors2
. A further particular advantage of the AAV system over other viruses is the availability of a wealth of naturally occurring serotypes which differ in essential properties yet can all be easily engineered as vectors using a common protocol1,2
. Moreover, a number of groups including our own have recently devised strategies to use these natural viruses as templates for the creation of synthetic vectors which either combine the assets of multiple input serotypes, or which enhance the properties of a single isolate. The respective technologies to achieve these goals are either DNA family shuffling3
fragmentation of various AAV capsid genes followed by their re-assembly based on partial homologies (typically >80% for most AAV serotypes), or peptide display4,5
insertion of usually seven amino acids into an exposed loop of the viral capsid where the peptide ideally mediates re-targeting to a desired cell type. For maximum success, both methods are applied in a high-throughput fashion whereby the protocols are up-scaled to yield libraries of around one million distinct capsid variants. Each clone is then comprised of a unique combination of numerous parental viruses (DNA shuffling approach) or contains a distinctive peptide within the same viral backbone (peptide display approach). The subsequent final step is iterative selection of such a library on target cells in order to enrich for individual capsids fulfilling most or ideally all requirements of the selection process. The latter preferably combines positive pressure, such as growth on a certain cell type of interest, with negative selection, for instance elimination of all capsids reacting with anti-AAV antibodies. This combination increases chances that synthetic capsids surviving the selection match the needs of the given application in a manner that would probably not have been found in any naturally occurring AAV isolate. Here, we focus on the DNA family shuffling method as the theoretically and experimentally more challenging of the two technologies. We describe and demonstrate all essential steps for the generation and selection of shuffled AAV libraries (Fig. 1
), and then discuss the pitfalls and critical aspects of the protocols that one needs to be aware of in order to succeed with molecular AAV evolution.
Immunology, Issue 62, Adeno-associated virus, AAV, gene therapy, synthetic biology, viral vector, molecular evolution, DNA shuffling
Investigating Protein-protein Interactions in Live Cells Using Bioluminescence Resonance Energy Transfer
Institutions: Max Planck Institute for Psycholinguistics, Donders Institute for Brain, Cognition and Behaviour.
Assays based on Bioluminescence Resonance Energy Transfer (BRET) provide a sensitive and reliable means to monitor protein-protein interactions in live cells. BRET is the non-radiative transfer of energy from a 'donor' luciferase enzyme to an 'acceptor' fluorescent protein. In the most common configuration of this assay, the donor is Renilla reniformis
luciferase and the acceptor is Yellow Fluorescent Protein (YFP). Because the efficiency of energy transfer is strongly distance-dependent, observation of the BRET phenomenon requires that the donor and acceptor be in close proximity. To test for an interaction between two proteins of interest in cultured mammalian cells, one protein is expressed as a fusion with luciferase and the second as a fusion with YFP. An interaction between the two proteins of interest may bring the donor and acceptor sufficiently close for energy transfer to occur. Compared to other techniques for investigating protein-protein interactions, the BRET assay is sensitive, requires little hands-on time and few reagents, and is able to detect interactions which are weak, transient, or dependent on the biochemical environment found within a live cell. It is therefore an ideal approach for confirming putative interactions suggested by yeast two-hybrid or mass spectrometry proteomics studies, and in addition it is well-suited for mapping interacting regions, assessing the effect of post-translational modifications on protein-protein interactions, and evaluating the impact of mutations identified in patient DNA.
Cellular Biology, Issue 87, Protein-protein interactions, Bioluminescence Resonance Energy Transfer, Live cell, Transfection, Luciferase, Yellow Fluorescent Protein, Mutations
A Protocol for Computer-Based Protein Structure and Function Prediction
Institutions: University of Michigan , University of Kansas.
Genome sequencing projects have ciphered millions of protein sequence, which require knowledge of their structure and function to improve the understanding of their biological role. Although experimental methods can provide detailed information for a small fraction of these proteins, computational modeling is needed for the majority of protein molecules which are experimentally uncharacterized. The I-TASSER server is an on-line workbench for high-resolution modeling of protein structure and function. Given a protein sequence, a typical output from the I-TASSER server includes secondary structure prediction, predicted solvent accessibility of each residue, homologous template proteins detected by threading and structure alignments, up to five full-length tertiary structural models, and structure-based functional annotations for enzyme classification, Gene Ontology terms and protein-ligand binding sites. All the predictions are tagged with a confidence score which tells how accurate the predictions are without knowing the experimental data. To facilitate the special requests of end users, the server provides channels to accept user-specified inter-residue distance and contact maps to interactively change the I-TASSER modeling; it also allows users to specify any proteins as template, or to exclude any template proteins during the structure assembly simulations. The structural information could be collected by the users based on experimental evidences or biological insights with the purpose of improving the quality of I-TASSER predictions. The server was evaluated as the best programs for protein structure and function predictions in the recent community-wide CASP experiments. There are currently >20,000 registered scientists from over 100 countries who are using the on-line I-TASSER server.
Biochemistry, Issue 57, On-line server, I-TASSER, protein structure prediction, function prediction
High Throughput Quantitative Expression Screening and Purification Applied to Recombinant Disulfide-rich Venom Proteins Produced in E. coli
Institutions: Aix-Marseille Université, Commissariat à l'énergie atomique et aux énergies alternatives (CEA) Saclay, France.
Escherichia coli (E. coli)
is the most widely used expression system for the production of recombinant proteins for structural and functional studies. However, purifying proteins is sometimes challenging since many proteins are expressed in an insoluble form. When working with difficult or multiple targets it is therefore recommended to use high throughput (HTP) protein expression screening on a small scale (1-4 ml cultures) to quickly identify conditions for soluble expression. To cope with the various structural genomics programs of the lab, a quantitative (within a range of 0.1-100 mg/L culture of recombinant protein) and HTP protein expression screening protocol was implemented and validated on thousands of proteins. The protocols were automated with the use of a liquid handling robot but can also be performed manually without specialized equipment.
Disulfide-rich venom proteins are gaining increasing recognition for their potential as therapeutic drug leads. They can be highly potent and selective, but their complex disulfide bond networks make them challenging to produce. As a member of the FP7 European Venomics project (www.venomics.eu), our challenge is to develop successful production strategies with the aim of producing thousands of novel venom proteins for functional characterization. Aided by the redox properties of disulfide bond isomerase DsbC, we adapted our HTP production pipeline for the expression of oxidized, functional venom peptides in the E. coli
cytoplasm. The protocols are also applicable to the production of diverse disulfide-rich proteins. Here we demonstrate our pipeline applied to the production of animal venom proteins. With the protocols described herein it is likely that soluble disulfide-rich proteins will be obtained in as little as a week. Even from a small scale, there is the potential to use the purified proteins for validating the oxidation state by mass spectrometry, for characterization in pilot studies, or for sensitive micro-assays.
Bioengineering, Issue 89, E. coli, expression, recombinant, high throughput (HTP), purification, auto-induction, immobilized metal affinity chromatography (IMAC), tobacco etch virus protease (TEV) cleavage, disulfide bond isomerase C (DsbC) fusion, disulfide bonds, animal venom proteins/peptides
Monitoring Plasmid Replication in Live Mammalian Cells over Multiple Generations by Fluorescence Microscopy
Institutions: University of Wisconsin - Madison.
Few naturally-occurring plasmids are maintained in mammalian cells. Among these are genomes of gamma-herpesviruses, including Epstein-Barr virus (EBV) and Kaposi's Sarcoma-associated herpesvirus (KSHV), which cause multiple human malignancies 1-3
. These two genomes are replicated in a licensed manner, each using a single viral protein and cellular replication machinery, and are passed to daughter cells during cell division despite their lacking traditional centromeres 4-8
Much work has been done to characterize the replications of these plasmid genomes using methods such as Southern blotting and fluorescence in situ
hybridization (FISH). These methods are limited, though. Quantitative PCR and Southern blots provide information about the average number of plasmids per cell in a population of cells. FISH is a single-cell assay that reveals both the average number and the distribution of plasmids per cell in the population of cells but is static, allowing no information about the parent or progeny of the examined cell.
Here, we describe a method for visualizing plasmids in live cells. This method is based on the binding of a fluorescently tagged lactose repressor protein to multiple sites in the plasmid of interest 9
. The DNA of interest is engineered to include approximately 250 tandem repeats of the lactose operator (LacO) sequence. LacO is specifically bound by the lactose repressor protein (LacI), which can be fused to a fluorescent protein. The fusion protein can either be expressed from the engineered plasmid or introduced by a retroviral vector. In this way, the DNA molecules are fluorescently tagged and therefore become visible via fluorescence microscopy. The fusion protein is blocked from binding the plasmid DNA by culturing cells in the presence of IPTG until the plasmids are ready to be viewed.
This system allows the plasmids to be monitored in living cells through several generations, revealing properties of their synthesis and partitioning to daughter cells. Ideal cells are adherent, easily transfected, and have large nuclei. This technique has been used to determine that 84% of EBV-derived plasmids are synthesized each generation and 88% of the newly synthesized plasmids partition faithfully to daughter cells in HeLa cells. Pairs of these EBV plasmids were seen to be tethered to or associated with sister chromatids after their synthesis in S-phase until they were seen to separate as the sister chromatids separated in Anaphase10
. The method is currently being used to study replication of KSHV genomes in HeLa cells and SLK cells. HeLa cells are immortalized human epithelial cells, and SLK cells are immortalized human endothelial cells. Though SLK cells were originally derived from a KSHV lesion, neither the HeLa nor SLK cell line naturally harbors KSHV genomes11
. In addition to studying viral replication, this visualization technique can be used to investigate the effects of the addition, removal, or mutation of various DNA sequence elements on synthesis, localization, and partitioning of other recombinant plasmid DNAs.
Genetics, Issue 70, Molecular Biology, Cellular Biology, Genomics, DNA synthesis, DNA partitioning, plasmids, lac operator, lac repressor, mammalian cells, fluorescence microscopy
Measuring the Kinetics of mRNA Transcription in Single Living Cells
Institutions: Bar-Ilan University.
The transcriptional activity of RNA polymerase II (Pol II) is a dynamic process and therefore measuring the kinetics of the transcriptional process in vivo
is of importance. Pol II kinetics have been measured using biochemical or molecular methods.1-3
In recent years, with the development of new visualization methods, it has become possible to follow transcription as it occurs in real time in single living cells.4
Herein we describe how to perform analysis of Pol II elongation kinetics on a specific gene in living cells.5, 6
Using a cell line in which a specific gene locus (DNA), its mRNA product, and the final protein product can be fluorescently labeled and visualized in vivo
, it is possible to detect the actual transcription of mRNAs on the gene of interest.7, 8
The mRNA is fluorescently tagged using the MS2 system for tagging mRNAs in vivo
, where the 3'UTR of the mRNA transcripts contain 24 MS2 stem-loop repeats, which provide highly specific binding sites for the YFP-MS2 coat protein that labels the mRNA as it is transcribed.9
To monitor the kinetics of transcription we use the Fluorescence Recovery After Photobleaching (FRAP) method. By photobleaching the YFP-MS2-tagged nascent transcripts at the site of transcription and then following the recovery of this signal over time, we obtain the synthesis rate of the newly made mRNAs.5
In other words, YFP-MS2 fluorescence recovery reflects the generation of new MS2 stem-loops in the nascent transcripts and their binding by fluorescent free YFP-MS2 molecules entering from the surrounding nucleoplasm. The FRAP recovery curves are then analyzed using mathematical mechanistic models formalized by a series of differential equations, in order to retrieve the kinetic time parameters of transcription.
Cell Biology, Issue 54, mRNA transcription, nucleus, live-cell imaging, cellular dynamics, FRAP
A Toolkit to Enable Hydrocarbon Conversion in Aqueous Environments
Institutions: Delft University of Technology, Delft University of Technology.
This work puts forward a toolkit that enables the conversion of alkanes by Escherichia coli
and presents a proof of principle of its applicability. The toolkit consists of multiple standard interchangeable parts (BioBricks)9
addressing the conversion of alkanes, regulation of gene expression and survival in toxic hydrocarbon-rich environments.
A three-step pathway for alkane degradation was implemented in E. coli
to enable the conversion of medium- and long-chain alkanes to their respective alkanols, alkanals and ultimately alkanoic-acids. The latter were metabolized via the native β-oxidation pathway. To facilitate the oxidation of medium-chain alkanes (C5-C13) and cycloalkanes (C5-C8), four genes (alkB2
) of the alkane hydroxylase system from Gordonia
were transformed into E. coli
. For the conversion of long-chain alkanes (C15-C36), theladA
gene from Geobacillus thermodenitrificans
was implemented. For the required further steps of the degradation process, ADH
and ALDH (
originating from G. thermodenitrificans
) were introduced10,11
. The activity was measured by resting cell assays. For each oxidative step, enzyme activity was observed.
To optimize the process efficiency, the expression was only induced under low glucose conditions: a substrate-regulated promoter, pCaiF, was used. pCaiF is present in E. coli
K12 and regulates the expression of the genes involved in the degradation of non-glucose carbon sources.
The last part of the toolkit - targeting survival - was implemented using solvent tolerance genes, PhPFDα and β, both from Pyrococcus horikoshii
OT3. Organic solvents can induce cell stress and decreased survivability by negatively affecting protein folding. As chaperones, PhPFDα and β improve the protein folding process e.g.
under the presence of alkanes. The expression of these genes led to an improved hydrocarbon tolerance shown by an increased growth rate (up to 50%) in the presences of 10% n
-hexane in the culture medium were observed.
Summarizing, the results indicate that the toolkit enables E. coli
to convert and tolerate hydrocarbons in aqueous environments. As such, it represents an initial step towards a sustainable solution for oil-remediation using a synthetic biology approach.
Bioengineering, Issue 68, Microbiology, Biochemistry, Chemistry, Chemical Engineering, Oil remediation, alkane metabolism, alkane hydroxylase system, resting cell assay, prefoldin, Escherichia coli, synthetic biology, homologous interaction mapping, mathematical model, BioBrick, iGEM
In Vitro Reconstitution of Light-harvesting Complexes of Plants and Green Algae
Institutions: VU University Amsterdam.
In plants and green algae, light is captured by the light-harvesting complexes (LHCs), a family of integral membrane proteins that coordinate chlorophylls and carotenoids. In vivo
, these proteins are folded with pigments to form complexes which are inserted in the thylakoid membrane of the chloroplast. The high similarity in the chemical and physical properties of the members of the family, together with the fact that they can easily lose pigments during isolation, makes their purification in a native state challenging. An alternative approach to obtain homogeneous preparations of LHCs was developed by Plumley and Schmidt in 19871
, who showed that it was possible to reconstitute these complexes in vitro
starting from purified pigments and unfolded apoproteins, resulting in complexes with properties very similar to that of native complexes. This opened the way to the use of bacterial expressed recombinant proteins for in vitro
reconstitution. The reconstitution method is powerful for various reasons: (1) pure preparations of individual complexes can be obtained, (2) pigment composition can be controlled to assess their contribution to structure and function, (3) recombinant proteins can be mutated to study the functional role of the individual residues (e.g.,
pigment binding sites) or protein domain (e.g.,
protein-protein interaction, folding). This method has been optimized in several laboratories and applied to most of the light-harvesting complexes. The protocol described here details the method of reconstituting light-harvesting complexes in vitro
currently used in our laboratory,
and examples describing applications of the method are provided.
Biochemistry, Issue 92, Reconstitution, Photosynthesis, Chlorophyll, Carotenoids, Light Harvesting Protein, Chlamydomonas reinhardtii, Arabidopsis thaliana
Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules
Institutions: Princeton University.
The aim of de novo
protein design is to find the amino acid sequences that will fold into a desired 3-dimensional structure with improvements in specific properties, such as binding affinity, agonist or antagonist behavior, or stability, relative to the native sequence. Protein design lies at the center of current advances drug design and discovery. Not only does protein design provide predictions for potentially useful drug targets, but it also enhances our understanding of the protein folding process and protein-protein interactions. Experimental methods such as directed evolution have shown success in protein design. However, such methods are restricted by the limited sequence space that can be searched tractably. In contrast, computational design strategies allow for the screening of a much larger set of sequences covering a wide variety of properties and functionality. We have developed a range of computational de novo
protein design methods capable of tackling several important areas of protein design. These include the design of monomeric proteins for increased stability and complexes for increased binding affinity.
To disseminate these methods for broader use we present Protein WISDOM (https://www.proteinwisdom.org), a tool that provides automated methods for a variety of protein design problems. Structural templates are submitted to initialize the design process. The first stage of design is an optimization sequence selection stage that aims at improving stability through minimization of potential energy in the sequence space. Selected sequences are then run through a fold specificity stage and a binding affinity stage. A rank-ordered list of the sequences for each step of the process, along with relevant designed structures, provides the user with a comprehensive quantitative assessment of the design. Here we provide the details of each design method, as well as several notable experimental successes attained through the use of the methods.
Genetics, Issue 77, Molecular Biology, Bioengineering, Biochemistry, Biomedical Engineering, Chemical Engineering, Computational Biology, Genomics, Proteomics, Protein, Protein Binding, Computational Biology, Drug Design, optimization (mathematics), Amino Acids, Peptides, and Proteins, De novo protein and peptide design, Drug design, In silico sequence selection, Optimization, Fold specificity, Binding affinity, sequencing
Generation of Enterobacter sp. YSU Auxotrophs Using Transposon Mutagenesis
Institutions: Youngstown State University.
Prototrophic bacteria grow on M-9 minimal salts medium supplemented with glucose (M-9 medium), which is used as a carbon and energy source. Auxotrophs can be generated using a transposome. The commercially available, Tn5
-derived transposome used in this protocol consists of a linear segment of DNA containing an R6Kγ
replication origin, a gene for kanamycin resistance and two mosaic sequence ends, which serve as transposase binding sites. The transposome, provided as a DNA/transposase protein complex, is introduced by electroporation into the prototrophic strain, Enterobacter
sp. YSU, and randomly incorporates itself into this host’s genome. Transformants are replica plated onto Luria-Bertani agar plates containing kanamycin, (LB-kan) and onto M-9 medium agar plates containing kanamycin (M-9-kan). The transformants that grow on LB-kan plates but not on M-9-kan plates are considered to be auxotrophs. Purified genomic DNA from an auxotroph is partially digested, ligated and transformed into a pir+ Escherichia coli
) strain. The R6Kγ
replication origin allows the plasmid to replicate in pir+ E. coli
strains, and the kanamycin resistance marker allows for plasmid selection. Each transformant possesses a new plasmid containing the transposon flanked by the interrupted chromosomal region. Sanger sequencing and the Basic Local Alignment Search Tool (BLAST) suggest a putative identity of the interrupted gene. There are three advantages to using this transposome mutagenesis strategy. First, it does not rely on the expression of a transposase gene by the host. Second, the transposome is introduced into the target host by electroporation, rather than by conjugation or by transduction and therefore is more efficient. Third, the R6Kγ
replication origin makes it easy to identify the mutated gene which is partially recovered in a recombinant plasmid. This technique can be used to investigate the genes involved in other characteristics of Enterobacter
sp. YSU or of a wider variety of bacterial strains.
Microbiology, Issue 92, Auxotroph, transposome, transposon, mutagenesis, replica plating, glucose minimal medium, complex medium, Enterobacter
Chromatin Interaction Analysis with Paired-End Tag Sequencing (ChIA-PET) for Mapping Chromatin Interactions and Understanding Transcription Regulation
Institutions: Agency for Science, Technology and Research, Singapore, A*STAR-Duke-NUS Neuroscience Research Partnership, Singapore, National University of Singapore, Singapore.
Genomes are organized into three-dimensional structures, adopting higher-order conformations inside the micron-sized nuclear spaces 7, 2, 12
. Such architectures are not random and involve interactions between gene promoters and regulatory elements 13
. The binding of transcription factors to specific regulatory sequences brings about a network of transcription regulation and coordination 1, 14
Chromatin Interaction Analysis by Paired-End Tag Sequencing (ChIA-PET) was developed to identify these higher-order chromatin structures 5,6
. Cells are fixed and interacting loci are captured by covalent DNA-protein cross-links. To minimize non-specific noise and reduce complexity, as well as to increase the specificity of the chromatin interaction analysis, chromatin immunoprecipitation (ChIP) is used against specific protein factors to enrich chromatin fragments of interest before proximity ligation. Ligation involving half-linkers subsequently forms covalent links between pairs of DNA fragments tethered together within individual chromatin complexes. The flanking MmeI restriction enzyme sites in the half-linkers allow extraction of paired end tag-linker-tag constructs (PETs) upon MmeI digestion. As the half-linkers are biotinylated, these PET constructs are purified using streptavidin-magnetic beads. The purified PETs are ligated with next-generation sequencing adaptors and a catalog of interacting fragments is generated via next-generation sequencers such as the Illumina Genome Analyzer. Mapping and bioinformatics analysis is then performed to identify ChIP-enriched binding sites and ChIP-enriched chromatin interactions 8
We have produced a video to demonstrate critical aspects of the ChIA-PET protocol, especially the preparation of ChIP as the quality of ChIP plays a major role in the outcome of a ChIA-PET library. As the protocols are very long, only the critical steps are shown in the video.
Genetics, Issue 62, ChIP, ChIA-PET, Chromatin Interactions, Genomics, Next-Generation Sequencing
Molecular Evolution of the Tre Recombinase
Institutions: Max Plank Institute for Molecular Cell Biology and Genetics, Dresden.
Here we report the generation of Tre recombinase through directed, molecular evolution. Tre recombinase recognizes a pre-defined target sequence within the LTR sequences of the HIV-1 provirus, resulting in the excision and eradication of the provirus from infected human cells.
We started with Cre, a 38-kDa recombinase, that recognizes a 34-bp double-stranded DNA sequence known as loxP. Because Cre can effectively eliminate genomic sequences, we set out to tailor a recombinase that could remove the sequence between the 5'-LTR and 3'-LTR of an integrated HIV-1 provirus. As a first step we identified sequences within the LTR sites that were similar to loxP and tested for recombination activity. Initially Cre and mutagenized Cre libraries failed to recombine the chosen loxLTR sites of the HIV-1 provirus. As the start of any directed molecular evolution process requires at least residual activity, the original asymmetric loxLTR sequences were split into subsets and tested again for recombination activity. Acting as intermediates, recombination activity was shown with the subsets. Next, recombinase libraries were enriched through reiterative evolution cycles. Subsequently, enriched libraries were shuffled and recombined. The combination of different mutations proved synergistic and recombinases were created that were able to recombine loxLTR1 and loxLTR2. This was evidence that an evolutionary strategy through intermediates can be successful. After a total of 126 evolution cycles individual recombinases were functionally and structurally analyzed. The most active recombinase -- Tre -- had 19 amino acid changes as compared to Cre. Tre recombinase was able to excise the HIV-1 provirus from the genome HIV-1 infected HeLa cells (see "HIV-1 Proviral DNA Excision Using an Evolved Recombinase", Hauber J., Heinrich-Pette-Institute for Experimental Virology and Immunology, Hamburg, Germany). While still in its infancy, directed molecular evolution will allow the creation of custom enzymes that will serve as tools of "molecular surgery" and molecular medicine.
Cell Biology, Issue 15, HIV-1, Tre recombinase, Site-specific recombination, molecular evolution
Monitoring Actin Disassembly with Time-lapse Microscopy
Institutions: Harvard Medical School.
Cellular Biology, Issue 1, cytoskeleton, actin, timelapse, filament, chamber
Principles of Site-Specific Recombinase (SSR) Technology
Institutions: Max Plank Institute for Molecular Cell Biology and Genetics, Dresden.
Site-specific recombinase (SSR) technology allows the manipulation of gene structure to explore gene function and has become an integral tool of molecular biology. Site-specific recombinases are proteins that bind to distinct DNA target sequences. The Cre/lox system was first described in bacteriophages during the 1980's. Cre recombinase is a Type I topoisomerase that catalyzes site-specific recombination of DNA between two loxP (locus of X-over P1) sites. The Cre/lox system does not require any cofactors. LoxP sequences contain distinct binding sites for Cre recombinases that surround a directional core sequence where recombination and rearrangement takes place. When cells contain loxP sites and express the Cre recombinase, a recombination event occurs. Double-stranded DNA is cut at both loxP sites by the Cre recombinase, rearranged, and ligated ("scissors and glue"). Products of the recombination event depend on the relative orientation of the asymmetric sequences.
SSR technology is frequently used as a tool to explore gene function. Here the gene of interest is flanked with Cre target sites loxP ("floxed"). Animals are then crossed with animals expressing the Cre recombinase under the control of a tissue-specific promoter. In tissues that express the Cre recombinase it binds to target sequences and excises the floxed gene. Controlled gene deletion allows the investigation of gene function in specific tissues and at distinct time points. Analysis of gene function employing SSR technology --- conditional mutagenesis -- has significant advantages over traditional knock-outs where gene deletion is frequently lethal.
Cellular Biology, Issue 15, Molecular Biology, Site-Specific Recombinase, Cre recombinase, Cre/lox system, transgenic animals, transgenic technology
Using SCOPE to Identify Potential Regulatory Motifs in Coregulated Genes
Institutions: Dartmouth College.
SCOPE is an ensemble motif finder that uses three component algorithms in parallel to identify potential regulatory motifs by over-representation and motif position preference1
. Each component algorithm is optimized to find a different kind of motif. By taking the best of these three approaches, SCOPE performs better than any single algorithm, even in the presence of noisy data1
. In this article, we utilize a web version of SCOPE2
to examine genes that are involved in telomere maintenance. SCOPE has been incorporated into at least two other motif finding programs3,4
and has been used in other studies5-8
The three algorithms that comprise SCOPE are BEAM9
, which finds non-degenerate motifs (ACCGGT), PRISM10
, which finds degenerate motifs (ASCGWT), and SPACER11
, which finds longer bipartite motifs (ACCnnnnnnnnGGT). These three algorithms have been optimized to find their corresponding type of motif. Together, they allow SCOPE to perform extremely well.
Once a gene set has been analyzed and candidate motifs identified, SCOPE can look for other genes that contain the motif which, when added to the original set, will improve the motif score. This can occur through over-representation or motif position preference. Working with partial gene sets that have biologically verified transcription factor binding sites, SCOPE was able to identify most of the rest of the genes also regulated by the given transcription factor.
Output from SCOPE shows candidate motifs, their significance, and other information both as a table and as a graphical motif map. FAQs and video tutorials are available at the SCOPE web site which also includes a "Sample Search" button that allows the user to perform a trial run.
Scope has a very friendly user interface that enables novice users to access the algorithm's full power without having to become an expert in the bioinformatics of motif finding. As input, SCOPE can take a list of genes, or FASTA sequences. These can be entered in browser text fields, or read from a file. The output from SCOPE contains a list of all identified motifs with their scores, number of occurrences, fraction of genes containing the motif, and the algorithm used to identify the motif. For each motif, result details include a consensus representation of the motif, a sequence logo, a position weight matrix, and a list of instances for every motif occurrence (with exact positions and "strand" indicated). Results are returned in a browser window and also optionally by email. Previous papers describe the SCOPE algorithms in detail1,2,9-11
Genetics, Issue 51, gene regulation, computational biology, algorithm, promoter sequence motif