Lysine methylation is an emerging post-translation modification and it has been identified on several histone and non-histone proteins, where it plays crucial roles in cell development and many diseases. Approximately 5,000 lysine methylation sites were identified on different proteins, which are set by few dozens of protein lysine methyltransferases. This suggests that each PKMT methylates multiple proteins, however till now only one or two substrates have been identified for several of these enzymes. To approach this problem, we have introduced peptide array based substrate specificity analyses of PKMTs. Peptide arrays are powerful tools to characterize the specificity of PKMTs because methylation of several substrates with different sequences can be tested on one array. We synthesized peptide arrays on cellulose membrane using an Intavis SPOT synthesizer and analyzed the specificity of various PKMTs. Based on the results, for several of these enzymes, novel substrates could be identified. For example, for NSD1 by employing peptide arrays, we showed that it methylates K44 of H4 instead of the reported H4K20 and in addition H1.5K168 is the highly preferred substrate over the previously known H3K36. Hence, peptide arrays are powerful tools to biochemically characterize the PKMTs.
26 Related JoVE Articles!
Optimization and Utilization of Agrobacterium-mediated Transient Protein Production in Nicotiana
Institutions: Fraunhofer USA Center for Molecular Biotechnology.
-mediated transient protein production in plants is a promising approach to produce vaccine antigens and therapeutic proteins within a short period of time. However, this technology is only just beginning to be applied to large-scale production as many technological obstacles to scale up are now being overcome. Here, we demonstrate a simple and reproducible method for industrial-scale transient protein production based on vacuum infiltration of Nicotiana
plants with Agrobacteria
carrying launch vectors. Optimization of Agrobacterium
cultivation in AB medium allows direct dilution of the bacterial culture in Milli-Q water, simplifying the infiltration process. Among three tested species of Nicotiana
, N. excelsiana
× N. excelsior
) was selected as the most promising host due to the ease of infiltration, high level of reporter protein production, and about two-fold higher biomass production under controlled environmental conditions. Induction of Agrobacterium
harboring pBID4-GFP (Tobacco mosaic virus
-based) using chemicals such as acetosyringone and monosaccharide had no effect on the protein production level. Infiltrating plant under 50 to 100 mbar for 30 or 60 sec resulted in about 95% infiltration of plant leaf tissues. Infiltration with Agrobacterium
laboratory strain GV3101 showed the highest protein production compared to Agrobacteria
laboratory strains LBA4404 and C58C1 and wild-type Agrobacteria
strains at6, at10, at77 and A4. Co-expression of a viral RNA silencing suppressor, p23 or p19, in N. benthamiana
resulted in earlier accumulation and increased production (15-25%) of target protein (influenza virus hemagglutinin).
Plant Biology, Issue 86, Agroinfiltration, Nicotiana benthamiana, transient protein production, plant-based expression, viral vector, Agrobacteria
Characterization of Complex Systems Using the Design of Experiments Approach: Transient Protein Expression in Tobacco as a Case Study
Institutions: RWTH Aachen University, Fraunhofer Gesellschaft.
Plants provide multiple benefits for the production of biopharmaceuticals including low costs, scalability, and safety. Transient expression offers the additional advantage of short development and production times, but expression levels can vary significantly between batches thus giving rise to regulatory concerns in the context of good manufacturing practice. We used a design of experiments (DoE) approach to determine the impact of major factors such as regulatory elements in the expression construct, plant growth and development parameters, and the incubation conditions during expression, on the variability of expression between batches. We tested plants expressing a model anti-HIV monoclonal antibody (2G12) and a fluorescent marker protein (DsRed). We discuss the rationale for selecting certain properties of the model and identify its potential limitations. The general approach can easily be transferred to other problems because the principles of the model are broadly applicable: knowledge-based parameter selection, complexity reduction by splitting the initial problem into smaller modules, software-guided setup of optimal experiment combinations and step-wise design augmentation. Therefore, the methodology is not only useful for characterizing protein expression in plants but also for the investigation of other complex systems lacking a mechanistic description. The predictive equations describing the interconnectivity between parameters can be used to establish mechanistic models for other complex systems.
Bioengineering, Issue 83, design of experiments (DoE), transient protein expression, plant-derived biopharmaceuticals, promoter, 5'UTR, fluorescent reporter protein, model building, incubation conditions, monoclonal antibody
Expression, Isolation, and Purification of Soluble and Insoluble Biotinylated Proteins for Nerve Tissue Regeneration
Institutions: University of Akron.
Recombinant protein engineering has utilized Escherichia coli (E. coli)
expression systems for nearly 4 decades, and today E. coli
is still the most widely used host organism. The flexibility of the system allows for the addition of moieties such as a biotin tag (for streptavidin interactions) and larger functional proteins like green fluorescent protein or cherry red protein. Also, the integration of unnatural amino acids like metal ion chelators, uniquely reactive functional groups, spectroscopic probes, and molecules imparting post-translational modifications has enabled better manipulation of protein properties and functionalities. As a result this technique creates customizable fusion proteins that offer significant utility for various fields of research. More specifically, the biotinylatable protein sequence has been incorporated into many target proteins because of the high affinity interaction between biotin with avidin and streptavidin. This addition has aided in enhancing detection and purification of tagged proteins as well as opening the way for secondary applications such as cell sorting. Thus, biotin-labeled molecules show an increasing and widespread influence in bioindustrial and biomedical fields. For the purpose of our research we have engineered recombinant biotinylated fusion proteins containing nerve growth factor (NGF) and semaphorin3A (Sema3A) functional regions. We have reported previously how these biotinylated fusion proteins, along with other active protein sequences, can be tethered to biomaterials for tissue engineering and regenerative purposes. This protocol outlines the basics of engineering biotinylatable proteins at the milligram scale, utilizing a T7 lac
inducible vector and E. coli
expression hosts, starting from transformation to scale-up and purification.
Bioengineering, Issue 83, protein engineering, recombinant protein production, AviTag, BirA, biotinylation, pET vector system, E. coli, inclusion bodies, Ni-NTA, size exclusion chromatography
High Throughput Quantitative Expression Screening and Purification Applied to Recombinant Disulfide-rich Venom Proteins Produced in E. coli
Institutions: Aix-Marseille Université, Commissariat à l'énergie atomique et aux énergies alternatives (CEA) Saclay, France.
Escherichia coli (E. coli)
is the most widely used expression system for the production of recombinant proteins for structural and functional studies. However, purifying proteins is sometimes challenging since many proteins are expressed in an insoluble form. When working with difficult or multiple targets it is therefore recommended to use high throughput (HTP) protein expression screening on a small scale (1-4 ml cultures) to quickly identify conditions for soluble expression. To cope with the various structural genomics programs of the lab, a quantitative (within a range of 0.1-100 mg/L culture of recombinant protein) and HTP protein expression screening protocol was implemented and validated on thousands of proteins. The protocols were automated with the use of a liquid handling robot but can also be performed manually without specialized equipment.
Disulfide-rich venom proteins are gaining increasing recognition for their potential as therapeutic drug leads. They can be highly potent and selective, but their complex disulfide bond networks make them challenging to produce. As a member of the FP7 European Venomics project (www.venomics.eu), our challenge is to develop successful production strategies with the aim of producing thousands of novel venom proteins for functional characterization. Aided by the redox properties of disulfide bond isomerase DsbC, we adapted our HTP production pipeline for the expression of oxidized, functional venom peptides in the E. coli
cytoplasm. The protocols are also applicable to the production of diverse disulfide-rich proteins. Here we demonstrate our pipeline applied to the production of animal venom proteins. With the protocols described herein it is likely that soluble disulfide-rich proteins will be obtained in as little as a week. Even from a small scale, there is the potential to use the purified proteins for validating the oxidation state by mass spectrometry, for characterization in pilot studies, or for sensitive micro-assays.
Bioengineering, Issue 89, E. coli, expression, recombinant, high throughput (HTP), purification, auto-induction, immobilized metal affinity chromatography (IMAC), tobacco etch virus protease (TEV) cleavage, disulfide bond isomerase C (DsbC) fusion, disulfide bonds, animal venom proteins/peptides
A Restriction Enzyme Based Cloning Method to Assess the In vitro Replication Capacity of HIV-1 Subtype C Gag-MJ4 Chimeric Viruses
Institutions: Emory University, Emory University.
The protective effect of many HLA class I alleles on HIV-1 pathogenesis and disease progression is, in part, attributed to their ability to target conserved portions of the HIV-1 genome that escape with difficulty. Sequence changes attributed to cellular immune pressure arise across the genome during infection, and if found within conserved regions of the genome such as Gag, can affect the ability of the virus to replicate in vitro
. Transmission of HLA-linked polymorphisms in Gag to HLA-mismatched recipients has been associated with reduced set point viral loads. We hypothesized this may be due to a reduced replication capacity of the virus. Here we present a novel method for assessing the in vitro
replication of HIV-1 as influenced by the gag
gene isolated from acute time points from subtype C infected Zambians. This method uses restriction enzyme based cloning to insert the gag
gene into a common subtype C HIV-1 proviral backbone, MJ4. This makes it more appropriate to the study of subtype C sequences than previous recombination based methods that have assessed the in vitro
replication of chronically derived gag-pro
sequences. Nevertheless, the protocol could be readily modified for studies of viruses from other subtypes. Moreover, this protocol details a robust and reproducible method for assessing the replication capacity of the Gag-MJ4 chimeric viruses on a CEM-based T cell line. This method was utilized for the study of Gag-MJ4 chimeric viruses derived from 149 subtype C acutely infected Zambians, and has allowed for the identification of residues in Gag that affect replication. More importantly, the implementation of this technique has facilitated a deeper understanding of how viral replication defines parameters of early HIV-1 pathogenesis such as set point viral load and longitudinal CD4+ T cell decline.
Infectious Diseases, Issue 90, HIV-1, Gag, viral replication, replication capacity, viral fitness, MJ4, CEM, GXR25
Non-chromatographic Purification of Recombinant Elastin-like Polypeptides and their Fusions with Peptides and Proteins from Escherichia coli
Institutions: Duke University, Duke University.
Elastin-like polypeptides are repetitive biopolymers that exhibit a lower critical solution temperature phase transition behavior, existing as soluble unimers below a characteristic transition temperature and aggregating into micron-scale coacervates above their transition temperature. The design of elastin-like polypeptides at the genetic level permits precise control of their sequence and length, which dictates their thermal properties. Elastin-like polypeptides are used in a variety of applications including biosensing, tissue engineering, and drug delivery, where the transition temperature and biopolymer architecture of the ELP can be tuned for the specific application of interest. Furthermore, the lower critical solution temperature phase transition behavior of elastin-like polypeptides allows their purification by their thermal response, such that their selective coacervation and resolubilization allows the removal of both soluble and insoluble contaminants following expression in Escherichia coli
. This approach can be used for the purification of elastin-like polypeptides alone or as a purification tool for peptide or protein fusions where recombinant peptides or proteins genetically appended to elastin-like polypeptide tags can be purified without chromatography. This protocol describes the purification of elastin-like polypeptides and their peptide or protein fusions and discusses basic characterization techniques to assess the thermal behavior of pure elastin-like polypeptide products.
Molecular Biology, Issue 88, elastin-like polypeptides, lower critical solution temperature, phase separation, inverse transition cycling, protein purification, batch purification
Expression of Recombinant Cellulase Cel5A from Trichoderma reesei in Tobacco Plants
Institutions: RWTH Aachen University, Fraunhofer Institute for Molecular Biology and Applied Ecology.
Cellulose degrading enzymes, cellulases, are targets of both research and industrial interests. The preponderance of these enzymes in difficult-to-culture organisms, such as hyphae-building fungi and anaerobic bacteria, has hastened the use of recombinant technologies in this field. Plant expression methods are a desirable system for large-scale production of enzymes and other industrially useful proteins. Herein, methods for the transient expression of a fungal endoglucanase, Trichoderma reesei
Cel5A, in Nicotiana tabacum
are demonstrated. Successful protein expression is shown, monitored by fluorescence using an mCherry-enzyme fusion protein. Additionally, a set of basic tests are used to examine the activity of transiently expressed T. reesei
Cel5A, including SDS-PAGE, Western blotting, zymography, as well as fluorescence and dye-based substrate degradation assays. The system described here can be used to produce an active cellulase in a short time period, so as to assess the potential for further production in plants through constitutive or inducible expression systems.
Environmental Sciences, Issue 88, heterologous expression, endoplasmic reticulum, endoglucanase, cellulose, glycosyl-hydrolase, fluorescence, cellulase, Trichoderma reesei, tobacco plants
High Efficiency Differentiation of Human Pluripotent Stem Cells to Cardiomyocytes and Characterization by Flow Cytometry
Institutions: Medical College of Wisconsin, Stanford University School of Medicine, Medical College of Wisconsin, Hong Kong University, Johns Hopkins University School of Medicine, Medical College of Wisconsin.
There is an urgent need to develop approaches for repairing the damaged heart, discovering new therapeutic drugs that do not have toxic effects on the heart, and improving strategies to accurately model heart disease. The potential of exploiting human induced pluripotent stem cell (hiPSC) technology to generate cardiac muscle “in a dish” for these applications continues to generate high enthusiasm. In recent years, the ability to efficiently generate cardiomyogenic cells from human pluripotent stem cells (hPSCs) has greatly improved, offering us new opportunities to model very early stages of human cardiac development not otherwise accessible. In contrast to many previous methods, the cardiomyocyte differentiation protocol described here does not require cell aggregation or the addition of Activin A or BMP4 and robustly generates cultures of cells that are highly positive for cardiac troponin I and T (TNNI3, TNNT2), iroquois-class homeodomain protein IRX-4 (IRX4), myosin regulatory light chain 2, ventricular/cardiac muscle isoform (MLC2v) and myosin regulatory light chain 2, atrial isoform (MLC2a) by day 10 across all human embryonic stem cell (hESC) and hiPSC lines tested to date. Cells can be passaged and maintained for more than 90 days in culture. The strategy is technically simple to implement and cost-effective. Characterization of cardiomyocytes derived from pluripotent cells often includes the analysis of reference markers, both at the mRNA and protein level. For protein analysis, flow cytometry is a powerful analytical tool for assessing quality of cells in culture and determining subpopulation homogeneity. However, technical variation in sample preparation can significantly affect quality of flow cytometry data. Thus, standardization of staining protocols should facilitate comparisons among various differentiation strategies. Accordingly, optimized staining protocols for the analysis of IRX4, MLC2v, MLC2a, TNNI3, and TNNT2 by flow cytometry are described.
Cellular Biology, Issue 91, human induced pluripotent stem cell, flow cytometry, directed differentiation, cardiomyocyte, IRX4, TNNI3, TNNT2, MCL2v, MLC2a
Synthesis and Characterization of Functionalized Metal-organic Frameworks
Institutions: Northwestern University, Warsaw University of Technology, King Abdulaziz University.
Metal-organic frameworks have attracted extraordinary amounts of research attention, as they are attractive candidates for numerous industrial and technological applications. Their signature property is their ultrahigh porosity, which however imparts a series of challenges when it comes to both constructing them and working with them. Securing desired MOF chemical and physical functionality by linker/node assembly into a highly porous framework of choice can pose difficulties, as less porous and more thermodynamically stable congeners (e.g.
, other crystalline polymorphs, catenated analogues) are often preferentially obtained by conventional synthesis methods. Once the desired product is obtained, its characterization often requires specialized techniques that address complications potentially arising from, for example, guest-molecule loss or preferential orientation of microcrystallites. Finally, accessing the large voids inside the MOFs for use in applications that involve gases can be problematic, as frameworks may be subject to collapse during removal of solvent molecules (remnants of solvothermal synthesis). In this paper, we describe synthesis and characterization methods routinely utilized in our lab either to solve or circumvent these issues. The methods include solvent-assisted linker exchange, powder X-ray diffraction in capillaries, and materials activation (cavity evacuation) by supercritical CO2
drying. Finally, we provide a protocol for determining a suitable pressure region for applying the Brunauer-Emmett-Teller analysis to nitrogen isotherms, so as to estimate surface area of MOFs with good accuracy.
Chemistry, Issue 91, Metal-organic frameworks, porous coordination polymers, supercritical CO2 activation, crystallography, solvothermal, sorption, solvent-assisted linker exchange
Identification of Key Factors Regulating Self-renewal and Differentiation in EML Hematopoietic Precursor Cells by RNA-sequencing Analysis
Institutions: The University of Texas Graduate School of Biomedical Sciences at Houston.
Hematopoietic stem cells (HSCs) are used clinically for transplantation treatment to rebuild a patient's hematopoietic system in many diseases such as leukemia and lymphoma. Elucidating the mechanisms controlling HSCs self-renewal and differentiation is important for application of HSCs for research and clinical uses. However, it is not possible to obtain large quantity of HSCs due to their inability to proliferate in vitro
. To overcome this hurdle, we used a mouse bone marrow derived cell line, the EML (Erythroid, Myeloid, and Lymphocytic) cell line, as a model system for this study.
RNA-sequencing (RNA-Seq) has been increasingly used to replace microarray for gene expression studies. We report here a detailed method of using RNA-Seq technology to investigate the potential key factors in regulation of EML cell self-renewal and differentiation. The protocol provided in this paper is divided into three parts. The first part explains how to culture EML cells and separate Lin-CD34+ and Lin-CD34- cells. The second part of the protocol offers detailed procedures for total RNA preparation and the subsequent library construction for high-throughput sequencing. The last part describes the method for RNA-Seq data analysis and explains how to use the data to identify differentially expressed transcription factors between Lin-CD34+ and Lin-CD34- cells. The most significantly differentially expressed transcription factors were identified to be the potential key regulators controlling EML cell self-renewal and differentiation. In the discussion section of this paper, we highlight the key steps for successful performance of this experiment.
In summary, this paper offers a method of using RNA-Seq technology to identify potential regulators of self-renewal and differentiation in EML cells. The key factors identified are subjected to downstream functional analysis in vitro
and in vivo
Genetics, Issue 93, EML Cells, Self-renewal, Differentiation, Hematopoietic precursor cell, RNA-Sequencing, Data analysis
Modeling Mucosal Candidiasis in Larval Zebrafish by Swimbladder Injection
Institutions: University of Maine, University of Maine.
Early defense against mucosal pathogens consists of both an epithelial barrier and innate immune cells. The immunocompetency of both, and their intercommunication, are paramount for the protection against infections. The interactions of epithelial and innate immune cells with a pathogen are best investigated in vivo
, where complex behavior unfolds over time and space. However, existing models do not allow for easy spatio-temporal imaging of the battle with pathogens at the mucosal level.
The model developed here creates a mucosal infection by direct injection of the fungal pathogen, Candida albicans
, into the swimbladder of juvenile zebrafish. The resulting infection enables high-resolution imaging of epithelial and innate immune cell behavior throughout the development of mucosal disease. The versatility of this method allows for interrogation of the host to probe the detailed sequence of immune events leading to phagocyte recruitment and to examine the roles of particular cell types and molecular pathways in protection. In addition, the behavior of the pathogen as a function of immune attack can be imaged simultaneously by using fluorescent protein-expressing C. albicans
. Increased spatial resolution of the host-pathogen interaction is also possible using the described rapid swimbladder dissection technique.
The mucosal infection model described here is straightforward and highly reproducible, making it a valuable tool for the study of mucosal candidiasis. This system may also be broadly translatable to other mucosal pathogens such as mycobacterial, bacterial or viral microbes that normally infect through epithelial surfaces.
Immunology, Issue 93, Zebrafish, mucosal candidiasis, mucosal infection, epithelial barrier, epithelial cells, innate immunity, swimbladder, Candida albicans, in vivo.
Isolation of Cellular Lipid Droplets: Two Purification Techniques Starting from Yeast Cells and Human Placentas
Institutions: University of Tennessee, University of Tennessee.
Lipid droplets are dynamic organelles that can be found in most eukaryotic and certain prokaryotic cells. Structurally, the droplets consist of a core of neutral lipids surrounded by a phospholipid monolayer. One of the most useful techniques in determining the cellular roles of droplets has been proteomic identification of bound proteins, which can be isolated along with the droplets. Here, two methods are described to isolate lipid droplets and their bound proteins from two wide-ranging eukaryotes: fission yeast and human placental villous cells. Although both techniques have differences, the main method - density gradient centrifugation - is shared by both preparations. This shows the wide applicability of the presented droplet isolation techniques.
In the first protocol, yeast cells are converted into spheroplasts by enzymatic digestion of their cell walls. The resulting spheroplasts are then gently lysed in a loose-fitting homogenizer. Ficoll is added to the lysate to provide a density gradient, and the mixture is centrifuged three times. After the first spin, the lipid droplets are localized to the white-colored floating layer of the centrifuge tubes along with the endoplasmic reticulum (ER), the plasma membrane, and vacuoles. Two subsequent spins are used to remove these other three organelles. The result is a layer that has only droplets and bound proteins.
In the second protocol, placental villous cells are isolated from human term placentas by enzymatic digestion with trypsin and DNase I. The cells are homogenized in a loose-fitting homogenizer. Low-speed and medium-speed centrifugation steps are used to remove unbroken cells, cellular debris, nuclei, and mitochondria. Sucrose is added to the homogenate to provide a density gradient and the mixture is centrifuged to separate the lipid droplets from the other cellular fractions.
The purity of the lipid droplets in both protocols is confirmed by Western Blot analysis. The droplet fractions from both preps are suitable for subsequent proteomic and lipidomic analysis.
Bioengineering, Issue 86, Lipid droplet, lipid body, fat body, oil body, Yeast, placenta, placental villous cells, isolation, purification, density gradient centrifugation
Genetic Manipulation in Δku80 Strains for Functional Genomic Analysis of Toxoplasma gondii
Institutions: The Geisel School of Medicine at Dartmouth.
Targeted genetic manipulation using homologous recombination is the method of choice for functional genomic analysis to obtain a detailed view of gene function and phenotype(s). The development of mutant strains with targeted gene deletions, targeted mutations, complemented gene function, and/or tagged genes provides powerful strategies to address gene function, particularly if these genetic manipulations can be efficiently targeted to the gene locus of interest using integration mediated by double cross over homologous recombination.
Due to very high rates of nonhomologous recombination, functional genomic analysis of Toxoplasma gondii
has been previously limited by the absence of efficient methods for targeting gene deletions and gene replacements to specific genetic loci. Recently, we abolished the major pathway of nonhomologous recombination in type I and type II strains of T. gondii
by deleting the gene encoding the KU80 protein1,2
. The Δku80
strains behave normally during tachyzoite (acute) and bradyzoite (chronic) stages in vitro
and in vivo
and exhibit essentially a 100% frequency of homologous recombination. The Δku80
strains make functional genomic studies feasible on the single gene as well as on the genome scale1-4
Here, we report methods for using type I and type II Δku80Δhxgprt
strains to advance gene targeting approaches in T. gondii
. We outline efficient methods for generating gene deletions, gene replacements, and tagged genes by targeted insertion or deletion of the hypoxanthine-xanthine-guanine phosphoribosyltransferase (HXGPRT
) selectable marker. The described gene targeting protocol can be used in a variety of ways in Δku80
strains to advance functional analysis of the parasite genome and to develop single strains that carry multiple targeted genetic manipulations. The application of this genetic method and subsequent phenotypic assays will reveal fundamental and unique aspects of the biology of T. gondii
and related significant human pathogens that cause malaria (Plasmodium
sp.) and cryptosporidiosis (Cryptosporidium
Infectious Diseases, Issue 77, Genetics, Microbiology, Infection, Medicine, Immunology, Molecular Biology, Cellular Biology, Biomedical Engineering, Bioengineering, Genomics, Parasitology, Pathology, Apicomplexa, Coccidia, Toxoplasma, Genetic Techniques, Gene Targeting, Eukaryota, Toxoplasma gondii, genetic manipulation, gene targeting, gene deletion, gene replacement, gene tagging, homologous recombination, DNA, sequencing
Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules
Institutions: Princeton University.
The aim of de novo
protein design is to find the amino acid sequences that will fold into a desired 3-dimensional structure with improvements in specific properties, such as binding affinity, agonist or antagonist behavior, or stability, relative to the native sequence. Protein design lies at the center of current advances drug design and discovery. Not only does protein design provide predictions for potentially useful drug targets, but it also enhances our understanding of the protein folding process and protein-protein interactions. Experimental methods such as directed evolution have shown success in protein design. However, such methods are restricted by the limited sequence space that can be searched tractably. In contrast, computational design strategies allow for the screening of a much larger set of sequences covering a wide variety of properties and functionality. We have developed a range of computational de novo
protein design methods capable of tackling several important areas of protein design. These include the design of monomeric proteins for increased stability and complexes for increased binding affinity.
To disseminate these methods for broader use we present Protein WISDOM (http://www.proteinwisdom.org), a tool that provides automated methods for a variety of protein design problems. Structural templates are submitted to initialize the design process. The first stage of design is an optimization sequence selection stage that aims at improving stability through minimization of potential energy in the sequence space. Selected sequences are then run through a fold specificity stage and a binding affinity stage. A rank-ordered list of the sequences for each step of the process, along with relevant designed structures, provides the user with a comprehensive quantitative assessment of the design. Here we provide the details of each design method, as well as several notable experimental successes attained through the use of the methods.
Genetics, Issue 77, Molecular Biology, Bioengineering, Biochemistry, Biomedical Engineering, Chemical Engineering, Computational Biology, Genomics, Proteomics, Protein, Protein Binding, Computational Biology, Drug Design, optimization (mathematics), Amino Acids, Peptides, and Proteins, De novo protein and peptide design, Drug design, In silico sequence selection, Optimization, Fold specificity, Binding affinity, sequencing
Development of a Negative Selectable Marker for Entamoeba histolytica
Institutions: University of Virginia Health System.
is the causative agent of amebiasis and infects up to 10% of the world's population. The molecular techniques that have enabled the up- and down-regulation of gene expression rely on the transfection of stably maintained plasmids. While these have increased our understanding of Entamoeba virulence factors, the capacity to integrate exogenous DNA into genome, which would allow reverse genetics experiments, would be a significant advantage in the study of this parasite. The challenges presented by this organism include inability to select for homologous recombination events and difficulty to cure episomal plasmid DNA from transfected trophozoites. The later results in a high background of exogenous DNA, a major problem in the identification of trophozoites in which a bona fide genomic integration event has occurred. We report the development of a negative selection system based upon transgenic expression of a yeast cytosine deaminase and uracil phosphoribosyl transferase chimera (FCU1) and selection with prodrug 5-fluorocytosine (5-FC). The FCU1 enzyme converts non-toxic 5-FC into toxic 5-fluorouracil and 5-fluorouridine-5'-monophosphate. E. histolytica
lines expressing FCU1 were found to be 30 fold more sensitive to the prodrug compared to the control strain.
Infectious Disease, Issue 46, Entamoeba, negative selectable marker, 5-fluorocytosine, gene knockout, Cytosine deaminase, UPRT CMFDA.
GENPLAT: an Automated Platform for Biomass Enzyme Discovery and Cocktail Optimization
Institutions: Michigan State University, Michigan State University.
The high cost of enzymes for biomass deconstruction is a major impediment to the economic conversion of lignocellulosic feedstocks to liquid transportation fuels such as ethanol. We have developed an integrated high throughput platform, called GENPLAT, for the discovery and development of novel enzymes and enzyme cocktails for the release of sugars from diverse pretreatment/biomass combinations. GENPLAT comprises four elements: individual pure enzymes, statistical design of experiments, robotic pipeting of biomass slurries and enzymes, and automated colorimeteric determination of released Glc and Xyl. Individual enzymes are produced by expression in Pichia pastoris
or Trichoderma reesei
, or by chromatographic purification from commercial cocktails or from extracts of novel microorganisms. Simplex lattice (fractional factorial) mixture models are designed using commercial Design of Experiment statistical software. Enzyme mixtures of high complexity are constructed using robotic pipeting into a 96-well format. The measurement of released Glc and Xyl is automated using enzyme-linked colorimetric assays. Optimized enzyme mixtures containing as many as 16 components have been tested on a variety of feedstock and pretreatment combinations.
GENPLAT is adaptable to mixtures of pure enzymes, mixtures of commercial products (e.g., Accellerase 1000 and Novozyme 188), extracts of novel microbes, or combinations thereof. To make and test mixtures of ˜10 pure enzymes requires less than 100 μg of each protein and fewer than 100 total reactions, when operated at a final total loading of 15 mg protein/g glucan. We use enzymes from several sources. Enzymes can be purified from natural sources such as fungal cultures (e.g., Aspergillus niger
, Cochliobolus carbonum
, and Galerina marginata
), or they can be made by expression of the encoding genes (obtained from the increasing number of microbial genome sequences) in hosts such as E. coli, Pichia pastoris
, or a filamentous fungus such as T. reesei
. Proteins can also be purified from commercial enzyme cocktails (e.g., Multifect Xylanase, Novozyme 188). An increasing number of pure enzymes, including glycosyl hydrolases, cell wall-active esterases, proteases, and lyases, are available from commercial sources, e.g., Megazyme, Inc. (www.megazyme.com), NZYTech (www.nzytech.com), and PROZOMIX (www.prozomix.com).
Design-Expert software (Stat-Ease, Inc.) is used to create simplex-lattice designs and to analyze responses (in this case, Glc and Xyl release). Mixtures contain 4-20 components, which can vary in proportion between 0 and 100%. Assay points typically include the extreme vertices with a sufficient number of intervening points to generate a valid model. In the terminology of experimental design, most of our studies are "mixture" experiments, meaning that the sum of all components adds to a total fixed protein loading (expressed as mg/g glucan). The number of mixtures in the simplex-lattice depends on both the number of components in the mixture and the degree of polynomial (quadratic or cubic). For example, a 6-component experiment will entail 63 separate reactions with an augmented special cubic model, which can detect three-way interactions, whereas only 23 individual reactions are necessary with an augmented quadratic model. For mixtures containing more than eight components, a quadratic experimental design is more practical, and in our experience such models are usually statistically valid.
All enzyme loadings are expressed as a percentage of the final total loading (which for our experiments is typically 15 mg protein/g glucan). For "core" enzymes, the lower percentage limit is set to 5%. This limit was derived from our experience in which yields of Glc and/or Xyl were very low if any core enzyme was present at 0%. Poor models result from too many samples showing very low Glc or Xyl yields. Setting a lower limit in turn determines an upper limit. That is, for a six-component experiment, if the lower limit for each single component is set to 5%, then the upper limit of each single component will be 75%. The lower limits of all other enzymes considered as "accessory" are set to 0%. "Core" and "accessory" are somewhat arbitrary designations and will differ depending on the substrate, but in our studies the core enzymes for release of Glc from corn stover comprise the following enzymes from T. reesei
: CBH1 (also known as Cel7A), CBH2 (Cel6A), EG1(Cel7B), BG (β-glucosidase), EX3 (endo-β1,4-xylanase, GH10), and BX (β-xylosidase).
Bioengineering, Issue 56, cellulase, cellobiohydrolase, glucanase, xylanase, hemicellulase, experimental design, biomass, bioenergy, corn stover, glycosyl hydrolase
Efficient Recombinant Parvovirus Production with the Help of Adenovirus-derived Systems
Institutions: German Cancer Research Center (DKFZ), German Cancer Research Center (DKFZ).
Rodent parvoviruses (PV) such as rat H-1PV and MVM, are small icosahedral, single stranded, DNA viruses. Their genome includes two promoters P4 and P38 which regulate the expression of non-structural (NS1 and NS2) and capsid proteins (VP1 and VP2) respectively1
. They attract high interest as anticancer agents for their oncolytic and oncosuppressive abilities while being non-pathogenic for humans2
. NS1 is the major effector of viral cytotoxicity3
. In order to further enhance their natural antineoplastic activities, derivatives from these vectors have been generated by replacing the gene encoding for the capsid proteins with a therapeutic transgene (e.g.
a cytotoxic polypeptide, cytokine, chemokine, tumour suppressor gene etc.)4
. The recombinant parvoviruses (recPVs) vector retains the NS1/2 coding sequences and the PV genome telomeres which are necessary for viral DNA amplification and packaging. Production of recPVs occurs only in the producer cells (generally HEK293T), by co-transfecting the cells with a second vector (pCMV-VP) expressing the gene encoding for the VP proteins (Fig. 1
. The recPV vectors generated in this way are replication defective. Although recPVs proved to possess enhanced oncotoxic activities with respect to the parental viruses from which they have been generated, their production remains a major challenge and strongly hampers the use of these agents in anti-cancer clinical applications.
We found that introduction of an Ad-5 derived vector containing the E2a, E4(orf6
) and the VA RNA
pXX6 plasmid) into HEK293T improved the production of recPVs by more than 10 fold in comparison to other protocols in use. Based on this finding, we have constructed a novel Ad-VP-helper that contains the genomic adenoviral elements necessary to enhance recPVs production as well as the parvovirus VP gene unit5
. The use of Ad-VP-helper, allows production of rec-PVs using a protocol that relies entirely on viral infection steps (as opposed to plasmid transfection), making possible the use of cell lines that are difficult to transfect (e.g.
NB324K) (Fig. 2
). We present a method that greatly improves the amount of recombinant virus produced, reducing both the production time and costs, without affecting the quality of the final product5
. In addition, large scale production of recPV (in suspension cells and bioreactors) is now conceivable.
Immunology, Issue 62, Recombinant parvovirus, adenovirus, virus production, pXX6, virus helper, virology, oncology
Isolation of Ribosome Bound Nascent Polypeptides in vitro to Identify Translational Pause Sites Along mRNA
Institutions: Cleveland State University.
The rate of translational elongation is non-uniform. mRNA secondary structure, codon usage and mRNA associated proteins may alter ribosome movement on the messagefor review see 1
. However, it's now widely accepted that synonymous codon usage is the primary cause of non-uniform translational elongation rates1
. Synonymous codons are not used with identical frequency. A bias exists in the use of synonymous codons with some codons used more frequently than others2
. Codon bias is organism as well as tissue specific2,3
. Moreover, frequency of codon usage is directly proportional to the concentrations of cognate tRNAs4
. Thus, a frequently used codon will have higher multitude of corresponding tRNAs, which further implies that a frequent codon will be translated faster than an infrequent one. Thus, regions on mRNA enriched in rare codons (potential pause sites) will as a rule slow down ribosome movement on the message and cause accumulation of nascent peptides of the respective sizes5-8
. These pause sites can have functional impact on the protein expression, mRNA stability and protein foldingfor review see 9
. Indeed, it was shown that alleviation of such pause sites can alter ribosome movement on mRNA and subsequently may affect the efficiency of co-translational (in vivo
) protein folding1,7,10,11
. To understand the process of protein folding in vivo
, in the cell, that is ultimately coupled to the process of protein synthesis it is essential to gain comprehensive insights into the impact of codon usage/tRNA content on the movement of ribosomes along mRNA during translational elongation.
Here we describe a simple technique that can be used to locate major translation pause sites for a given mRNA translated in various cell-free systems6-8
. This procedure is based on isolation of nascent polypeptides accumulating on ribosomes during in vitro
translation of a target mRNA. The rationale is that at low-frequency codons, the increase in the residence time of the ribosomes results in increased amounts of nascent peptides of the corresponding sizes. In vitro
transcribed mRNA is used for in vitro
translational reactions in the presence of radioactively labeled amino acids to allow the detection of the nascent chains. In order to isolate ribosome bound nascent polypeptide complexes the translation reaction is layered on top of 30% glycerol solution followed by centrifugation. Nascent polypeptides in polysomal pellet are further treated with ribonuclease A and resolved by SDS PAGE. This technique can be potentially used for any protein and allows analysis of ribosome movement along mRNA and the detection of the major pause sites. Additionally, this protocol can be adapted to study factors and conditions that can alter ribosome movement and thus potentially can also alter the function/conformation of the protein.
Genetics, Issue 65, Molecular Biology, Ribosome, Nascent polypeptide, Co-translational protein folding, Synonymous codon usage, gene regulation
A Toolkit to Enable Hydrocarbon Conversion in Aqueous Environments
Institutions: Delft University of Technology, Delft University of Technology.
This work puts forward a toolkit that enables the conversion of alkanes by Escherichia coli
and presents a proof of principle of its applicability. The toolkit consists of multiple standard interchangeable parts (BioBricks)9
addressing the conversion of alkanes, regulation of gene expression and survival in toxic hydrocarbon-rich environments.
A three-step pathway for alkane degradation was implemented in E. coli
to enable the conversion of medium- and long-chain alkanes to their respective alkanols, alkanals and ultimately alkanoic-acids. The latter were metabolized via the native β-oxidation pathway. To facilitate the oxidation of medium-chain alkanes (C5-C13) and cycloalkanes (C5-C8), four genes (alkB2
) of the alkane hydroxylase system from Gordonia
were transformed into E. coli
. For the conversion of long-chain alkanes (C15-C36), theladA
gene from Geobacillus thermodenitrificans
was implemented. For the required further steps of the degradation process, ADH
and ALDH (
originating from G. thermodenitrificans
) were introduced10,11
. The activity was measured by resting cell assays. For each oxidative step, enzyme activity was observed.
To optimize the process efficiency, the expression was only induced under low glucose conditions: a substrate-regulated promoter, pCaiF, was used. pCaiF is present in E. coli
K12 and regulates the expression of the genes involved in the degradation of non-glucose carbon sources.
The last part of the toolkit - targeting survival - was implemented using solvent tolerance genes, PhPFDα and β, both from Pyrococcus horikoshii
OT3. Organic solvents can induce cell stress and decreased survivability by negatively affecting protein folding. As chaperones, PhPFDα and β improve the protein folding process e.g.
under the presence of alkanes. The expression of these genes led to an improved hydrocarbon tolerance shown by an increased growth rate (up to 50%) in the presences of 10% n
-hexane in the culture medium were observed.
Summarizing, the results indicate that the toolkit enables E. coli
to convert and tolerate hydrocarbons in aqueous environments. As such, it represents an initial step towards a sustainable solution for oil-remediation using a synthetic biology approach.
Bioengineering, Issue 68, Microbiology, Biochemistry, Chemistry, Chemical Engineering, Oil remediation, alkane metabolism, alkane hydroxylase system, resting cell assay, prefoldin, Escherichia coli, synthetic biology, homologous interaction mapping, mathematical model, BioBrick, iGEM
Multi-target Parallel Processing Approach for Gene-to-structure Determination of the Influenza Polymerase PB2 Subunit
Institutions: Emerald Bio, Emerald Bio, Emerald Bio, Emerald Bio, Emerald Bio, Emerald Bio, Emerald Bio, Emerald Bio, Emerald Bio.
Pandemic outbreaks of highly virulent influenza strains can cause widespread morbidity and mortality in human populations worldwide. In the United States alone, an average of 41,400 deaths and 1.86 million hospitalizations are caused by influenza virus infection each year 1
. Point mutations in the polymerase basic protein 2 subunit (PB2) have been linked to the adaptation of the viral infection in humans 2
. Findings from such studies have revealed the biological significance of PB2 as a virulence factor, thus highlighting its potential as an antiviral drug target.
The structural genomics program put forth by the National Institute of Allergy and Infectious Disease (NIAID) provides funding to Emerald Bio and three other Pacific Northwest institutions that together make up the Seattle Structural Genomics Center for Infectious Disease (SSGCID). The SSGCID is dedicated to providing the scientific community with three-dimensional protein structures of NIAID category A-C pathogens. Making such structural information available to the scientific community serves to accelerate structure-based drug design.
Structure-based drug design plays an important role in drug development. Pursuing multiple targets in parallel greatly increases the chance of success for new lead discovery by targeting a pathway or an entire protein family. Emerald Bio has developed a high-throughput, multi-target parallel processing pipeline (MTPP) for gene-to-structure determination to support the consortium. Here we describe the protocols used to determine the structure of the PB2 subunit from four different influenza A strains.
Infection, Issue 76, Structural Biology, Virology, Genetics, Medicine, Biomedical Engineering, Molecular Biology, Infectious Diseases, Microbiology, Genomics, high throughput, multi-targeting, structural genomics, protein crystallization, purification, protein production, X-ray crystallography, Gene Composer, Protein Maker, expression, E. coli, fermentation, influenza, virus, vector, plasmid, cell, cell culture, PCR, sequencing
The Logic, Experimental Steps, and Potential of Heterologous Natural Product Biosynthesis Featuring the Complex Antibiotic Erythromycin A Produced Through E. coli
Institutions: State University of New York at Buffalo, Massachusetts Institute of Technology.
The heterologous production of complex natural products is an approach designed to address current limitations and future possibilities. It is particularly useful for those compounds which possess therapeutic value but cannot be sufficiently produced or would benefit from an improved form of production. The experimental procedures involved can be subdivided into three components: 1) genetic transfer; 2) heterologous reconstitution; and 3) product analysis. Each experimental component is under continual optimization to meet the challenges and anticipate the opportunities associated with this emerging approach.
Heterologous biosynthesis begins with the identification of a genetic sequence responsible for a valuable natural product. Transferring this sequence to a heterologous host is complicated by the biosynthetic pathway complexity responsible for product formation. The antibiotic erythromycin A is a good example. Twenty genes (totaling >50 kb) are required for eventual biosynthesis. In addition, three of these genes encode megasynthases, multi-domain enzymes each ~300 kDa in size. This genetic material must be designed and transferred to E. coli
for reconstituted biosynthesis. The use of PCR isolation, operon construction, multi-cystronic plasmids, and electro-transformation will be described in transferring the erythromycin A genetic cluster to E. coli
Once transferred, the E. coli
cell must support eventual biosynthesis. This process is also challenging given the substantial differences between E. coli
and most original hosts responsible for complex natural product formation. The cell must provide necessary substrates to support biosynthesis and coordinately express the transferred genetic cluster to produce active enzymes. In the case of erythromycin A, the E. coli
cell had to be engineered to provide the two precursors (propionyl-CoA and (2S)-methylmalonyl-CoA) required for biosynthesis. In addition, gene sequence modifications, plasmid copy number, chaperonin co-expression, post-translational enzymatic modification, and process temperature were also required to allow final erythromycin A formation.
Finally, successful production must be assessed. For the erythromycin A case, we will present two methods. The first is liquid chromatography-mass spectrometry (LC-MS) to confirm and quantify production. The bioactivity of erythromycin A will also be confirmed through use of a bioassay in which the antibiotic activity is tested against Bacillus subtilis
. The assessment assays establish erythromycin A biosynthesis from E. coli
and set the stage for future engineering efforts to improve or diversify production and for the production of new complex natural compounds using this approach.
Biomedical Engineering, Issue 71, Chemical Engineering, Bioengineering, Molecular Biology, Cellular Biology, Microbiology, Basic Protocols, Biochemistry, Biotechnology, Heterologous biosynthesis, natural products, antibiotics, erythromycin A, metabolic engineering, E. coli
Using RNA-mediated Interference Feeding Strategy to Screen for Genes Involved in Body Size Regulation in the Nematode C. elegans
Institutions: Borough of Manhattan Community College, City Universtiy of New York (CUNY), Queens College, The City University of New York (CUNY), Queens College, The City University of New York (CUNY).
Double-strand RNA-mediated interference (RNAi) is an effective strategy to knock down target gene expression1-3
. It has been applied to many model systems including plants, invertebrates and vertebrates. There are various methods to achieve RNAi in vivo4,5
. For example, the target gene may be transformed into an RNAi vector, and then either permanently or transiently transformed into cell lines or primary cells to achieve gene knockdown effects; alternatively synthesized double-strand oligonucleotides from specific target genes (RNAi oligos) may be transiently transformed into cell lines or primary cells to silence target genes; or synthesized double-strand RNA molecules may be microinjected into an organism. Since the nematode C. elegans
uses bacteria as a food source, feeding the animals with bacteria expressing double-strand RNA against target genes provides a viable strategy6
. Here we present an RNAi feeding method to score body size phenotype. Body size in C. elegans
is regulated primarily by the TGF- β - like ligand DBL-1, so this assay is appropriate for identification of TGF-β signaling components7
. We used different strains including two RNAi hypersensitive strains to repeat the RNAi feeding experiments. Our results showed that rrf-3
strain gave us the best expected RNAi phenotype. The method is easy to perform, reproducible, and easily quantified. Furthermore, our protocol minimizes the use of specialized equipment, so it is suitable for smaller laboratories or those at predominantly undergraduate institutions.
Developmental Biology, Issue 72, Genetics, Cellular Biology, Molecular Biology, Biochemistry, Basic Protocols, RNAi feeding technique, genetic screen, TGF-beta, body size, C. elegans, Caenorhabditis elegans, RNA-mediated Interference, RNAi, RNA, DNA, gene expression knock down, animal model
RNA-seq Analysis of Transcriptomes in Thrombin-treated and Control Human Pulmonary Microvascular Endothelial Cells
Institutions: Children's Mercy Hospital and Clinics, School of Medicine, University of Missouri-Kansas City.
The characterization of gene expression in cells via measurement of mRNA levels is a useful tool in determining how the transcriptional machinery of the cell is affected by external signals (e.g.
drug treatment), or how cells differ between a healthy state and a diseased state. With the advent and continuous refinement of next-generation DNA sequencing technology, RNA-sequencing (RNA-seq) has become an increasingly popular method of transcriptome analysis to catalog all species of transcripts, to determine the transcriptional structure of all expressed genes and to quantify the changing expression levels of the total set of transcripts in a given cell, tissue or organism1,2
. RNA-seq is gradually replacing DNA microarrays as a preferred method for transcriptome analysis because it has the advantages of profiling a complete transcriptome, providing a digital type datum (copy number of any transcript) and not relying on any known genomic sequence3
Here, we present a complete and detailed protocol to apply RNA-seq to profile transcriptomes in human pulmonary microvascular endothelial cells with or without thrombin treatment. This protocol is based on our recent published study entitled "RNA-seq Reveals Novel Transcriptome of Genes and Their Isoforms in Human Pulmonary Microvascular Endothelial Cells Treated with Thrombin,"4
in which we successfully performed the first complete transcriptome analysis of human pulmonary microvascular endothelial cells treated with thrombin using RNA-seq. It yielded unprecedented resources for further experimentation to gain insights into molecular mechanisms underlying thrombin-mediated endothelial dysfunction in the pathogenesis of inflammatory conditions, cancer, diabetes, and coronary heart disease, and provides potential new leads for therapeutic targets to those diseases.
The descriptive text of this protocol is divided into four parts. The first part describes the treatment of human pulmonary microvascular endothelial cells with thrombin and RNA isolation, quality analysis and quantification. The second part describes library construction and sequencing. The third part describes the data analysis. The fourth part describes an RT-PCR validation assay. Representative results of several key steps are displayed. Useful tips or precautions to boost success in key steps are provided in the Discussion section. Although this protocol uses human pulmonary microvascular endothelial cells treated with thrombin, it can be generalized to profile transcriptomes in both mammalian and non-mammalian cells and in tissues treated with different stimuli or inhibitors, or to compare transcriptomes in cells or tissues between a healthy state and a disease state.
Genetics, Issue 72, Molecular Biology, Immunology, Medicine, Genomics, Proteins, RNA-seq, Next Generation DNA Sequencing, Transcriptome, Transcription, Thrombin, Endothelial cells, high-throughput, DNA, genomic DNA, RT-PCR, PCR
Using SCOPE to Identify Potential Regulatory Motifs in Coregulated Genes
Institutions: Dartmouth College.
SCOPE is an ensemble motif finder that uses three component algorithms in parallel to identify potential regulatory motifs by over-representation and motif position preference1
. Each component algorithm is optimized to find a different kind of motif. By taking the best of these three approaches, SCOPE performs better than any single algorithm, even in the presence of noisy data1
. In this article, we utilize a web version of SCOPE2
to examine genes that are involved in telomere maintenance. SCOPE has been incorporated into at least two other motif finding programs3,4
and has been used in other studies5-8
The three algorithms that comprise SCOPE are BEAM9
, which finds non-degenerate motifs (ACCGGT), PRISM10
, which finds degenerate motifs (ASCGWT), and SPACER11
, which finds longer bipartite motifs (ACCnnnnnnnnGGT). These three algorithms have been optimized to find their corresponding type of motif. Together, they allow SCOPE to perform extremely well.
Once a gene set has been analyzed and candidate motifs identified, SCOPE can look for other genes that contain the motif which, when added to the original set, will improve the motif score. This can occur through over-representation or motif position preference. Working with partial gene sets that have biologically verified transcription factor binding sites, SCOPE was able to identify most of the rest of the genes also regulated by the given transcription factor.
Output from SCOPE shows candidate motifs, their significance, and other information both as a table and as a graphical motif map. FAQs and video tutorials are available at the SCOPE web site which also includes a "Sample Search" button that allows the user to perform a trial run.
Scope has a very friendly user interface that enables novice users to access the algorithm's full power without having to become an expert in the bioinformatics of motif finding. As input, SCOPE can take a list of genes, or FASTA sequences. These can be entered in browser text fields, or read from a file. The output from SCOPE contains a list of all identified motifs with their scores, number of occurrences, fraction of genes containing the motif, and the algorithm used to identify the motif. For each motif, result details include a consensus representation of the motif, a sequence logo, a position weight matrix, and a list of instances for every motif occurrence (with exact positions and "strand" indicated). Results are returned in a browser window and also optionally by email. Previous papers describe the SCOPE algorithms in detail1,2,9-11
Genetics, Issue 51, gene regulation, computational biology, algorithm, promoter sequence motif
A Strategy to Identify de Novo Mutations in Common Disorders such as Autism and Schizophrenia
Institutions: Universite de Montreal, Universite de Montreal, Universite de Montreal.
There are several lines of evidence supporting the role of de novo
mutations as a mechanism for common disorders, such as autism and schizophrenia. First, the de novo
mutation rate in humans is relatively high, so new mutations are generated at a high frequency in the population. However, de novo
mutations have not been reported in most common diseases. Mutations in genes leading to severe diseases where there is a strong negative selection against the phenotype, such as lethality in embryonic stages or reduced reproductive fitness, will not be transmitted to multiple family members, and therefore will not be detected by linkage gene mapping or association studies. The observation of very high concordance in monozygotic twins and very low concordance in dizygotic twins also strongly supports the hypothesis that a significant fraction of cases may result from new mutations. Such is the case for diseases such as autism and schizophrenia. Second, despite reduced reproductive fitness1
and extremely variable environmental factors, the incidence of some diseases is maintained worldwide at a relatively high and constant rate. This is the case for autism and schizophrenia, with an incidence of approximately 1% worldwide. Mutational load can be thought of as a balance between selection for or against a deleterious mutation and its production by de novo
mutation. Lower rates of reproduction constitute a negative selection factor that should reduce the number of mutant alleles in the population, ultimately leading to decreased disease prevalence. These selective pressures tend to be of different intensity in different environments. Nonetheless, these severe mental disorders have been maintained at a constant relatively high prevalence in the worldwide population across a wide range of cultures and countries despite a strong negative selection against them2
. This is not what one would predict in diseases with reduced reproductive fitness, unless there was a high new mutation rate. Finally, the effects of paternal age: there is a significantly increased risk of the disease with increasing paternal age, which could result from the age related increase in paternal de novo
mutations. This is the case for autism and schizophrenia3
. The male-to-female ratio of mutation rate is estimated at about 4–6:1, presumably due to a higher number of germ-cell divisions with age in males. Therefore, one would predict that de novo
mutations would more frequently come from males, particularly older males4
. A high rate of new mutations may in part explain why genetic studies have so far failed to identify many genes predisposing to complexes diseases genes, such as autism and schizophrenia, and why diseases have been identified for a mere 3% of genes in the human genome. Identification for de novo
mutations as a cause of a disease requires a targeted molecular approach, which includes studying parents and affected subjects. The process for determining if the genetic basis of a disease may result in part from de novo
mutations and the molecular approach to establish this link will be illustrated, using autism and schizophrenia as examples.
Medicine, Issue 52, de novo mutation, complex diseases, schizophrenia, autism, rare variations, DNA sequencing
Expression of Recombinant Proteins in the Methylotrophic Yeast Pichia pastoris
Institutions: University of British Columbia - UBC.
Protein expression in the microbial eukaryotic host Pichia pastoris
offers the possibility to generate high amounts of recombinant protein in a fast and easy to use expression system.
As a single-celled microorganism P. pastoris
is easy to manipulate and grows rapidly on inexpensive media at high cell densities. Being a eukaryote, P. pastoris
is able to perform many of the post-translational modifications performed by higher eukaryotic cells and the obtained recombinant proteins undergo protein folding, proteolytic processing, disulfide bond formation and glycosylation .
As a methylotrophic yeast P. pastoris
is capable of metabolizing methanol as its sole carbon source. The strong promoter for alcohol oxidase, AOX1,
is tightly regulated and induced by methanol and it is used for the expression of the gene of interest. Accordingly, the expression of the foreign protein can be induced by adding methanol to the growth medium [2; 3].
Another important advantage is the secretion of the recombinant protein into the growth medium, using a signal sequence to target the foreign protein to the secretory pathway of P. pastoris. With only low levels of endogenous protein secreted to the media by the yeast itself and no added proteins to the media, a heterologous protein builds the majority of the total protein in the medium and facilitates following protein purification steps [3; 4].
The vector used here (pPICZαA) contains the AOX1
promoter for tightly regulated, methanol-induced expression of the gene of interest; the α-factor secretion signal for secretion of the recombinant protein, a Zeocin resistance gene for selection in both E. coli
and a C-terminal peptide containing the c-myc
epitope and a polyhistidine (6xHis) tag for detection and purification of a recombinant protein. We also show western blot analysis of the recombinant protein using the specific Anti-myc
-HRP antibody recognizing the c-myc
epitope on the parent vector.
Microbiology, Issue 36, protein expression, recombinant protein, methylotrophic, yeast, Pichia pastoris, western blot, yeast DNA purification, protein purification