Escherichia coli (E. coli) is the most widely used expression system for the production of recombinant proteins for structural and functional studies. However, purifying proteins is sometimes challenging since many proteins are expressed in an insoluble form. When working with difficult or multiple targets it is therefore recommended to use high throughput (HTP) protein expression screening on a small scale (1-4 ml cultures) to quickly identify conditions for soluble expression. To cope with the various structural genomics programs of the lab, a quantitative (within a range of 0.1-100 mg/L culture of recombinant protein) and HTP protein expression screening protocol was implemented and validated on thousands of proteins. The protocols were automated with the use of a liquid handling robot but can also be performed manually without specialized equipment.
Disulfide-rich venom proteins are gaining increasing recognition for their potential as therapeutic drug leads. They can be highly potent and selective, but their complex disulfide bond networks make them challenging to produce. As a member of the FP7 European Venomics project (www.venomics.eu), our challenge is to develop successful production strategies with the aim of producing thousands of novel venom proteins for functional characterization. Aided by the redox properties of disulfide bond isomerase DsbC, we adapted our HTP production pipeline for the expression of oxidized, functional venom peptides in the E. coli cytoplasm. The protocols are also applicable to the production of diverse disulfide-rich proteins. Here we demonstrate our pipeline applied to the production of animal venom proteins. With the protocols described herein it is likely that soluble disulfide-rich proteins will be obtained in as little as a week. Even from a small scale, there is the potential to use the purified proteins for validating the oxidation state by mass spectrometry, for characterization in pilot studies, or for sensitive micro-assays.
23 Related JoVE Articles!
Preparation of the Mgm101 Recombination Protein by MBP-based Tagging Strategy
Institutions: State University of New York Upstate Medical University.
gene was identified 20 years ago for its role in the maintenance of mitochondrial DNA. Studies from several groups have suggested that the Mgm101 protein is involved in the recombinational repair of mitochondrial DNA. Recent investigations have indicated that Mgm101 is related to the Rad52-type recombination protein family. These proteins form large oligomeric rings and promote the annealing of homologous single stranded DNA molecules. However, the characterization of Mgm101 has been hindered by the difficulty in producing the recombinant protein. Here, a reliable procedure for the preparation of recombinant Mgm101 is described. Maltose Binding Protein (MBP)-tagged Mgm101 is first expressed in Escherichia coli
. The fusion protein is initially purified by amylose affinity chromatography. After being released by proteolytic cleavage, Mgm101 is separated from MBP by cationic exchange chromatography. Monodispersed Mgm101 is then obtained by size exclusion chromatography. A yield of ~0.87 mg of Mgm101 per liter of bacterial culture can be routinely obtained. The recombinant Mgm101 has minimal contamination of DNA. The prepared samples are successfully used for biochemical, structural and single particle image analyses of Mgm101. This protocol may also be used for the preparation of other large oligomeric DNA-binding proteins that may be misfolded and toxic to bacterial cells.
Biochemistry, Issue 76, Genetics, Molecular Biology, Cellular Biology, Microbiology, Bacteria, Proteins, Mgm101, Rad52, mitochondria, recombination, mtDNA, maltose-binding protein, MBP, E. coli., yeast, Saccharomyces cerevisiae, chromatography, electron microscopy, cell culture
Using SecM Arrest Sequence as a Tool to Isolate Ribosome Bound Polypeptides
Institutions: Cleveland State University.
Extensive research has provided ample evidences suggesting that protein folding in the cell is a co-translational process1-5
. However, the exact pathway that polypeptide chain follows during co-translational folding to achieve its functional form is still an enigma. In order to understand this process and to determine the exact conformation of the co-translational folding intermediates, it is essential to develop techniques that allow the isolation of RNCs carrying nascent chains of predetermined sizes to allow their further structural analysis.
SecM (secretion monitor) is a 170 amino acid E. coli
protein that regulates expression of the downstream SecA (secretion driving) ATPase in the secM-secA
. Nakatogawa and Ito originally found that a 17 amino acid long sequence (150-FSTPVWISQAQGIRAG
P-166) in the C-terminal region of the SecM protein is sufficient and necessary to cause stalling of SecM elongation at Gly165, thereby producing peptidyl-glycyl-tRNA stably bound to the ribosomal P-site7-9
. More importantly, it was found that this 17 amino acid long sequence can be fused to the C-terminus of virtually any full-length and/or truncated protein thus allowing the production of RNCs carrying nascent chains of predetermined sizes7
. Thus, when fused or inserted into the target protein, SecM stalling sequence produces arrest of the polypeptide chain elongation and generates stable RNCs both in vivo
in E. coli
cells and in vitro
in a cell-free system. Sucrose gradient centrifugation is further utilized to isolate RNCs.
The isolated RNCs can be used to analyze structural and functional features of the co-translational folding intermediates. Recently, this technique has been successfully used to gain insights into the structure of several ribosome bound nascent chains10,11
. Here we describe the isolation of bovine Gamma-B Crystallin RNCs fused to SecM and generated in an in vitro
Molecular Biology, Issue 64, Ribosome, nascent polypeptides, co-translational protein folding, translational arrest, in vitro translation
High Throughput Screening of Fungal Endoglucanase Activity in Escherichia coli
Institutions: California Institute of Technology, California Institute of Technology.
Cellulase enzymes (endoglucanases, cellobiohydrolases, and β-glucosidases) hydrolyze cellulose into component sugars, which in turn can be converted into fuel alcohols1
. The potential for enzymatic hydrolysis of cellulosic biomass to provide renewable energy has intensified efforts to engineer cellulases for economical fuel production2
. Of particular interest are fungal cellulases3-8
, which are already being used industrially for foods and textiles processing.
Identifying active variants among a library of mutant cellulases is critical to the engineering process; active mutants can be further tested for improved properties and/or subjected to additional mutagenesis. Efficient engineering of fungal cellulases has been hampered by a lack of genetic tools for native organisms and by difficulties in expressing the enzymes in heterologous hosts. Recently, Morikawa and coworkers developed a method for expressing in E. coli
the catalytic domains of endoglucanases from H. jecorina3,9
, an important industrial fungus with the capacity to secrete cellulases in large quantities. Functional E. coli
expression has also been reported for cellulases from other fungi, including Macrophomina phaseolina10
and Phanerochaete chrysosporium11-12
We present a method for high throughput screening of fungal endoglucanase activity in E. coli
. (Fig 1
) This method uses the common microbial dye Congo Red (CR) to visualize enzymatic degradation of carboxymethyl cellulose (CMC) by cells growing on solid medium. The activity assay requires inexpensive reagents, minimal manipulation, and gives unambiguous results as zones of degradation (“halos”) at the colony site. Although a quantitative measure of enzymatic activity cannot be determined by this method, we have found that halo size correlates with total enzymatic activity in the cell. Further characterization of individual positive clones will determine , relative protein fitness.
Traditional bacterial whole cell CMC/CR activity assays13
involve pouring agar containing CMC onto colonies, which is subject to cross-contamination, or incubating cultures in CMC agar wells, which is less amenable to large-scale experimentation. Here we report an improved protocol that modifies existing wash methods14
for cellulase activity: cells grown on CMC agar plates are removed prior to CR staining. Our protocol significantly reduces cross-contamination and is highly scalable, allowing the rapid screening of thousands of clones. In addition to H. jecorina enzymes
, we have expressed and screened endoglucanase variants from the Thermoascus aurantiacus
and Penicillium decumbens
(shown in Figure 2
), suggesting that this protocol is applicable to enzymes from a range of organisms.
Molecular Biology, Issue 54, cellulase, endoglucanase, CMC, Congo Red
Purification of the M. magneticum Strain AMB-1 Magnetosome Associated Protein MamAΔ41
Institutions: Ben-Gurion University.
Magnetotactic bacteria comprise a diverse group of aquatic microorganisms that are able to orientate themselves along geomagnetic fields. This behavior is believed to aid their search for suitable environments (1)
. This capability is conferred by the magnetosome, a subcellular organelle that consists of a linear-chain assembly of lipid vesicles each able to biomineralize and enclose a ~50-nm crystal of magnetite or greigite. A principle component of the magnetosome that was shown to be required for the formation of functional vesicles is MamA. MamA is a highly abundant magnetosome-associated protein which is one of the most characterized magnetosome-associated proteins in vivo (2-6)
. This article focuses on the purification of MamA, which despite being studied in vivo
, no clear functional or structural details have been identified for it. Bioinformatics analysis suggested that MamA is a tetra-tricopeptide repeat (TPR) containing protein. TPR is a structural motif found as such or forming part of a bigger fold in a wide range of proteins, it serves as a template for protein-protein interactions and mediates multi-protein complexes (7)
. TPRs are involved in many crucial tasks in eukaryotic cell organelle processes and many bacterial pathways (8-14).
In order to understand MamA, a unique TPR containing protein, highly purified protein is required as a first step. In this article, we present the purification protocol for a stable MamA deletion mutant (MamAΔ41) from M. magneticum
Cellular Biology, Issue 37, Recombinant protein purification, magnetotactic bacteria, magnetosome, MamA
GST-His purification: A Two-step Affinity Purification Protocol Yielding Full-length Purified Proteins
Institutions: Hôtel-Dieu de Québec.
Key assays in enzymology for the biochemical characterization of proteins in vitro
necessitate high concentrations of the purified protein of interest. Protein purification protocols should combine efficiency, simplicity and cost effectiveness1
. Here, we describe the GST-His method as a new small-scale affinity purification system for recombinant proteins, based on a N-terminal Glutathione Sepharose Tag (GST)2,3
and a C-terminal 10xHis tag4
, which are both fused to the protein of interest. The latter construct is used to generate baculoviruses, for infection of Sf9 infected cells for protein expression5
. GST is a rather long tag (29 kDa) which serves to ensure purification efficiency. However, it might influence physiological properties of the protein. Hence, it is subsequently cleaved off the protein using the PreScission enzyme6
. In order to ensure maximum purity and to remove the cleaved GST, we added a second affinity purification step based on the comparatively small His-Tag. Importantly, our technique is based on two different tags flanking the two ends of the protein, which is an efficient tool to remove degraded proteins and, therefore, enriches full-length proteins. The method presented here does not require an expensive instrumental setup, such as FPLC. Additionally, we incorporated MgCl2
and ATP washes to remove heat shock protein impurities and nuclease treatment to abolish contaminating nucleic acids. In summary, the combination of two different tags flanking the N- and the C-terminal and the capability to cleave off one of the tags, guaranties the recovery of a highly purified and full-length protein of interest.
Biochemistry, Issue 80, Genetics, Molecular Biology, Proteins, Proteomics, recombinant protein, affinity purification, Glutathione Sepharose Tag, Talon metal affinity resin
Optimization and Utilization of Agrobacterium-mediated Transient Protein Production in Nicotiana
Institutions: Fraunhofer USA Center for Molecular Biotechnology.
-mediated transient protein production in plants is a promising approach to produce vaccine antigens and therapeutic proteins within a short period of time. However, this technology is only just beginning to be applied to large-scale production as many technological obstacles to scale up are now being overcome. Here, we demonstrate a simple and reproducible method for industrial-scale transient protein production based on vacuum infiltration of Nicotiana
plants with Agrobacteria
carrying launch vectors. Optimization of Agrobacterium
cultivation in AB medium allows direct dilution of the bacterial culture in Milli-Q water, simplifying the infiltration process. Among three tested species of Nicotiana
, N. excelsiana
× N. excelsior
) was selected as the most promising host due to the ease of infiltration, high level of reporter protein production, and about two-fold higher biomass production under controlled environmental conditions. Induction of Agrobacterium
harboring pBID4-GFP (Tobacco mosaic virus
-based) using chemicals such as acetosyringone and monosaccharide had no effect on the protein production level. Infiltrating plant under 50 to 100 mbar for 30 or 60 sec resulted in about 95% infiltration of plant leaf tissues. Infiltration with Agrobacterium
laboratory strain GV3101 showed the highest protein production compared to Agrobacteria
laboratory strains LBA4404 and C58C1 and wild-type Agrobacteria
strains at6, at10, at77 and A4. Co-expression of a viral RNA silencing suppressor, p23 or p19, in N. benthamiana
resulted in earlier accumulation and increased production (15-25%) of target protein (influenza virus hemagglutinin).
Plant Biology, Issue 86, Agroinfiltration, Nicotiana benthamiana, transient protein production, plant-based expression, viral vector, Agrobacteria
Drug-induced Sensitization of Adenylyl Cyclase: Assay Streamlining and Miniaturization for Small Molecule and siRNA Screening Applications
Institutions: Purdue University, Eli Lilly and Company.
Sensitization of adenylyl cyclase (AC) signaling has been implicated in a variety of neuropsychiatric and neurologic disorders including substance abuse and Parkinson's disease. Acute activation of Gαi/o-linked receptors inhibits AC activity, whereas persistent activation of these receptors results in heterologous sensitization of AC and increased levels of intracellular cAMP. Previous studies have demonstrated that this enhancement of AC responsiveness is observed both in vitro
and in vivo
following the chronic activation of several types of Gαi/o-linked receptors including D2
dopamine and μ opioid receptors. Although heterologous sensitization of AC was first reported four decades ago, the mechanism(s) that underlie this phenomenon remain largely unknown. The lack of mechanistic data presumably reflects the complexity involved with this adaptive response, suggesting that nonbiased approaches could aid in identifying the molecular pathways involved in heterologous sensitization of AC. Previous studies have implicated kinase and Gbγ signaling as overlapping components that regulate the heterologous sensitization of AC. To identify unique and additional overlapping targets associated with sensitization of AC, the development and validation of a scalable cAMP sensitization assay is required for greater throughput. Previous approaches to study sensitization are generally cumbersome involving continuous cell culture maintenance as well as a complex methodology for measuring cAMP accumulation that involves multiple wash steps. Thus, the development of a robust cell-based assay that can be used for high throughput screening (HTS) in a 384 well format would facilitate future studies. Using two D2
dopamine receptor cellular models (i.e
), we have converted our 48-well sensitization assay (>20 steps 4-5 days) to a five-step, single day assay in 384-well format. This new format is amenable to small molecule screening, and we demonstrate that this assay design can also be readily used for reverse transfection of siRNA in anticipation of targeted siRNA library screening.
Bioengineering, Issue 83, adenylyl cyclase, cAMP, heterologous sensitization, superactivation, D2 dopamine, μ opioid, siRNA
Optimized Transfection Strategy for Expression and Electrophysiological Recording of Recombinant Voltage-Gated Ion Channels in HEK-293T Cells
Institutions: University of Waterloo.
The in vitro
expression and electrophysiological recording of recombinant voltage-gated ion channels in cultured human embryonic kidney cells (HEK-293T) is a ubiquitous research strategy. HEK-293T cells must be plated onto glass coverslips at low enough density so that they are not in contact with each other in order to allow for electrophysiological recording without confounding effects due to contact with adjacent cells. Transfected channels must also express with high efficiency at the plasma membrane for whole-cell patch clamp recording of detectable currents above noise levels. Heterologous ion channels often require long incubation periods at 28°C after transfection in order to achieve adequate membrane expression, but there are increasing losses of cell-coverslip adhesion and membrane stability at this temperature. To circumvent this problem, we developed an optimized strategy to transfect and plate HEK-293T cells. This method requires that cells be transfected at a relatively high confluency, and incubated at 28°C for varying incubation periods post-transfection to allow for adequate ion channel protein expression. Transfected cells are then plated onto glass coverslips and incubated at 37°C for several hours, which allows for rigid cell attachment to the coverslips and membrane restabilization. Cells can be recorded shortly after plating, or can be transferred to 28°C for further incubation. We find that the initial incubation at 28°C, after transfection but before plating, is key for the efficient expression of heterologous ion channels that normally do not express well at the plasma membrane. Positively transfected, cultured cells are identified by co-expressed eGFP or eGFP expressed from a bicistronic vector (e.g. pIRES2-EGFP) containing the recombinant ion channel cDNA just upstream of an internal ribosome entry site and an eGFP coding sequence. Whole-cell patch clamp recording requires specialized equipment, plus the crafting of polished recording electrodes and L-shaped ground electrodes from borosilicate glass. Drug delivery to study the pharmacology of ion channels can be achieved by directly micropipetting drugs into the recording dish, or by using microperfusion or gravity flow systems that produce uninterrupted streams of drug solution over recorded cells.
Neuroscience, Issue 47, brain, invertebrate, calcium channel, electrophysiology, voltage-gated
Transient Gene Expression in Tobacco using Gibson Assembly and the Gene Gun
Institutions: Harvard University, Harvard Medical School, Delft University of Technology.
In order to target a single protein to multiple subcellular organelles, plants typically duplicate the relevant genes, and express each gene separately using complex regulatory strategies including differential promoters and/or signal sequences. Metabolic engineers and synthetic biologists interested in targeting enzymes to a particular organelle are faced with a challenge: For a protein that is to be localized to more than one organelle, the engineer must clone the same gene multiple times. This work presents a solution to this strategy: harnessing alternative splicing of mRNA. This technology takes advantage of established chloroplast and peroxisome targeting sequences and combines them into a single mRNA that is alternatively spliced. Some splice variants are sent to the chloroplast, some to the peroxisome, and some to the cytosol. Here the system is designed for multiple-organelle targeting with alternative splicing. In this work, GFP was expected to be expressed in the chloroplast, cytosol, and peroxisome by a series of rationally designed 5’ mRNA tags. These tags have the potential to reduce the amount of cloning required when heterologous genes need to be expressed in multiple subcellular organelles. The constructs were designed in previous work11
, and were cloned using Gibson assembly, a ligation independent cloning method that does not require restriction enzymes. The resultant plasmids were introduced into Nicotiana benthamiana
epidermal leaf cells with a modified Gene Gun protocol. Finally, transformed leaves were observed with confocal microscopy.
Environmental Sciences, Issue 86, Plant Leaves, Synthetic Biology, Plants, Genetically Modified, DNA, Plant, RNA, Gene Targeting, Plant Physiological Processes, Genes, Gene gun, Gibson assembly, Nicotiana benthamiana, Alternative splicing, confocal microscopy, chloroplast, peroxisome
Expression of Recombinant Cellulase Cel5A from Trichoderma reesei in Tobacco Plants
Institutions: RWTH Aachen University, Fraunhofer Institute for Molecular Biology and Applied Ecology.
Cellulose degrading enzymes, cellulases, are targets of both research and industrial interests. The preponderance of these enzymes in difficult-to-culture organisms, such as hyphae-building fungi and anaerobic bacteria, has hastened the use of recombinant technologies in this field. Plant expression methods are a desirable system for large-scale production of enzymes and other industrially useful proteins. Herein, methods for the transient expression of a fungal endoglucanase, Trichoderma reesei
Cel5A, in Nicotiana tabacum
are demonstrated. Successful protein expression is shown, monitored by fluorescence using an mCherry-enzyme fusion protein. Additionally, a set of basic tests are used to examine the activity of transiently expressed T. reesei
Cel5A, including SDS-PAGE, Western blotting, zymography, as well as fluorescence and dye-based substrate degradation assays. The system described here can be used to produce an active cellulase in a short time period, so as to assess the potential for further production in plants through constitutive or inducible expression systems.
Environmental Sciences, Issue 88, heterologous expression, endoplasmic reticulum, endoglucanase, cellulose, glycosyl-hydrolase, fluorescence, cellulase, Trichoderma reesei, tobacco plants
Orthogonal Protein Purification Facilitated by a Small Bispecific Affinity Tag
Institutions: Royal Institute of Technology.
Due to the high costs associated with purification of recombinant proteins the protocols need to be rationalized. For high-throughput efforts there is a demand for general methods that do not require target protein specific optimization1
. To achieve this, purification tags that genetically can be fused to the gene of interest are commonly used2
. The most widely used affinity handle is the hexa-histidine tag, which is suitable for purification under both native and denaturing conditions3
. The metabolic burden for producing the tag is low, but it does not provide as high specificity as competing affinity chromatography based strategies1,2
Here, a bispecific purification tag with two different binding sites on a 46 amino acid, small protein domain has been developed. The albumin-binding domain is derived from Streptococcal protein G and has a strong inherent affinity to human serum albumin (HSA). Eleven surface-exposed amino acids, not involved in albumin-binding4
, were genetically randomized to produce a combinatorial library. The protein library with the novel randomly arranged binding surface (Figure 1) was expressed on phage particles to facilitate selection of binders by phage display technology. Through several rounds of biopanning against a dimeric Z-domain derived from Staphylococcal protein A5
, a small, bispecific molecule with affinity for both HSA and the novel target was identified6
The novel protein domain, referred to as ABDz1, was evaluated as a purification tag for a selection of target proteins with different molecular weight, solubility and isoelectric point. Three target proteins were expressed in Escherishia coli
with the novel tag fused to their N-termini and thereafter affinity purified. Initial purification on either a column with immobilized HSA or Z-domain resulted in relatively pure products. Two-step affinity purification with the bispecific tag resulted in substantial improvement of protein purity. Chromatographic media with the Z-domain immobilized, for example MabSelect SuRe, are readily available for purification of antibodies and HSA can easily be chemically coupled to media to provide the second matrix.
This method is especially advantageous when there is a high demand on purity of the recovered target protein. The bifunctionality of the tag allows two different chromatographic steps to be used while the metabolic burden on the expression host is limited due to the small size of the tag. It provides a competitive alternative to so called combinatorial tagging where multiple tags are used in combination1,7
Molecular Biology, Issue 59, Affinity chromatography, albumin-binding domain, human serum albumin, Z-domain
A Toolkit to Enable Hydrocarbon Conversion in Aqueous Environments
Institutions: Delft University of Technology, Delft University of Technology.
This work puts forward a toolkit that enables the conversion of alkanes by Escherichia coli
and presents a proof of principle of its applicability. The toolkit consists of multiple standard interchangeable parts (BioBricks)9
addressing the conversion of alkanes, regulation of gene expression and survival in toxic hydrocarbon-rich environments.
A three-step pathway for alkane degradation was implemented in E. coli
to enable the conversion of medium- and long-chain alkanes to their respective alkanols, alkanals and ultimately alkanoic-acids. The latter were metabolized via the native β-oxidation pathway. To facilitate the oxidation of medium-chain alkanes (C5-C13) and cycloalkanes (C5-C8), four genes (alkB2
) of the alkane hydroxylase system from Gordonia
were transformed into E. coli
. For the conversion of long-chain alkanes (C15-C36), theladA
gene from Geobacillus thermodenitrificans
was implemented. For the required further steps of the degradation process, ADH
and ALDH (
originating from G. thermodenitrificans
) were introduced10,11
. The activity was measured by resting cell assays. For each oxidative step, enzyme activity was observed.
To optimize the process efficiency, the expression was only induced under low glucose conditions: a substrate-regulated promoter, pCaiF, was used. pCaiF is present in E. coli
K12 and regulates the expression of the genes involved in the degradation of non-glucose carbon sources.
The last part of the toolkit - targeting survival - was implemented using solvent tolerance genes, PhPFDα and β, both from Pyrococcus horikoshii
OT3. Organic solvents can induce cell stress and decreased survivability by negatively affecting protein folding. As chaperones, PhPFDα and β improve the protein folding process e.g.
under the presence of alkanes. The expression of these genes led to an improved hydrocarbon tolerance shown by an increased growth rate (up to 50%) in the presences of 10% n
-hexane in the culture medium were observed.
Summarizing, the results indicate that the toolkit enables E. coli
to convert and tolerate hydrocarbons in aqueous environments. As such, it represents an initial step towards a sustainable solution for oil-remediation using a synthetic biology approach.
Bioengineering, Issue 68, Microbiology, Biochemistry, Chemistry, Chemical Engineering, Oil remediation, alkane metabolism, alkane hydroxylase system, resting cell assay, prefoldin, Escherichia coli, synthetic biology, homologous interaction mapping, mathematical model, BioBrick, iGEM
Expression, Isolation, and Purification of Soluble and Insoluble Biotinylated Proteins for Nerve Tissue Regeneration
Institutions: University of Akron.
Recombinant protein engineering has utilized Escherichia coli (E. coli)
expression systems for nearly 4 decades, and today E. coli
is still the most widely used host organism. The flexibility of the system allows for the addition of moieties such as a biotin tag (for streptavidin interactions) and larger functional proteins like green fluorescent protein or cherry red protein. Also, the integration of unnatural amino acids like metal ion chelators, uniquely reactive functional groups, spectroscopic probes, and molecules imparting post-translational modifications has enabled better manipulation of protein properties and functionalities. As a result this technique creates customizable fusion proteins that offer significant utility for various fields of research. More specifically, the biotinylatable protein sequence has been incorporated into many target proteins because of the high affinity interaction between biotin with avidin and streptavidin. This addition has aided in enhancing detection and purification of tagged proteins as well as opening the way for secondary applications such as cell sorting. Thus, biotin-labeled molecules show an increasing and widespread influence in bioindustrial and biomedical fields. For the purpose of our research we have engineered recombinant biotinylated fusion proteins containing nerve growth factor (NGF) and semaphorin3A (Sema3A) functional regions. We have reported previously how these biotinylated fusion proteins, along with other active protein sequences, can be tethered to biomaterials for tissue engineering and regenerative purposes. This protocol outlines the basics of engineering biotinylatable proteins at the milligram scale, utilizing a T7 lac
inducible vector and E. coli
expression hosts, starting from transformation to scale-up and purification.
Bioengineering, Issue 83, protein engineering, recombinant protein production, AviTag, BirA, biotinylation, pET vector system, E. coli, inclusion bodies, Ni-NTA, size exclusion chromatography
Multi-target Parallel Processing Approach for Gene-to-structure Determination of the Influenza Polymerase PB2 Subunit
Institutions: Emerald Bio, Emerald Bio, Emerald Bio, Emerald Bio, Emerald Bio, Emerald Bio, Emerald Bio, Emerald Bio, Emerald Bio.
Pandemic outbreaks of highly virulent influenza strains can cause widespread morbidity and mortality in human populations worldwide. In the United States alone, an average of 41,400 deaths and 1.86 million hospitalizations are caused by influenza virus infection each year 1
. Point mutations in the polymerase basic protein 2 subunit (PB2) have been linked to the adaptation of the viral infection in humans 2
. Findings from such studies have revealed the biological significance of PB2 as a virulence factor, thus highlighting its potential as an antiviral drug target.
The structural genomics program put forth by the National Institute of Allergy and Infectious Disease (NIAID) provides funding to Emerald Bio and three other Pacific Northwest institutions that together make up the Seattle Structural Genomics Center for Infectious Disease (SSGCID). The SSGCID is dedicated to providing the scientific community with three-dimensional protein structures of NIAID category A-C pathogens. Making such structural information available to the scientific community serves to accelerate structure-based drug design.
Structure-based drug design plays an important role in drug development. Pursuing multiple targets in parallel greatly increases the chance of success for new lead discovery by targeting a pathway or an entire protein family. Emerald Bio has developed a high-throughput, multi-target parallel processing pipeline (MTPP) for gene-to-structure determination to support the consortium. Here we describe the protocols used to determine the structure of the PB2 subunit from four different influenza A strains.
Infection, Issue 76, Structural Biology, Virology, Genetics, Medicine, Biomedical Engineering, Molecular Biology, Infectious Diseases, Microbiology, Genomics, high throughput, multi-targeting, structural genomics, protein crystallization, purification, protein production, X-ray crystallography, Gene Composer, Protein Maker, expression, E. coli, fermentation, influenza, virus, vector, plasmid, cell, cell culture, PCR, sequencing
The Logic, Experimental Steps, and Potential of Heterologous Natural Product Biosynthesis Featuring the Complex Antibiotic Erythromycin A Produced Through E. coli
Institutions: State University of New York at Buffalo, Massachusetts Institute of Technology.
The heterologous production of complex natural products is an approach designed to address current limitations and future possibilities. It is particularly useful for those compounds which possess therapeutic value but cannot be sufficiently produced or would benefit from an improved form of production. The experimental procedures involved can be subdivided into three components: 1) genetic transfer; 2) heterologous reconstitution; and 3) product analysis. Each experimental component is under continual optimization to meet the challenges and anticipate the opportunities associated with this emerging approach.
Heterologous biosynthesis begins with the identification of a genetic sequence responsible for a valuable natural product. Transferring this sequence to a heterologous host is complicated by the biosynthetic pathway complexity responsible for product formation. The antibiotic erythromycin A is a good example. Twenty genes (totaling >50 kb) are required for eventual biosynthesis. In addition, three of these genes encode megasynthases, multi-domain enzymes each ~300 kDa in size. This genetic material must be designed and transferred to E. coli
for reconstituted biosynthesis. The use of PCR isolation, operon construction, multi-cystronic plasmids, and electro-transformation will be described in transferring the erythromycin A genetic cluster to E. coli
Once transferred, the E. coli
cell must support eventual biosynthesis. This process is also challenging given the substantial differences between E. coli
and most original hosts responsible for complex natural product formation. The cell must provide necessary substrates to support biosynthesis and coordinately express the transferred genetic cluster to produce active enzymes. In the case of erythromycin A, the E. coli
cell had to be engineered to provide the two precursors (propionyl-CoA and (2S)-methylmalonyl-CoA) required for biosynthesis. In addition, gene sequence modifications, plasmid copy number, chaperonin co-expression, post-translational enzymatic modification, and process temperature were also required to allow final erythromycin A formation.
Finally, successful production must be assessed. For the erythromycin A case, we will present two methods. The first is liquid chromatography-mass spectrometry (LC-MS) to confirm and quantify production. The bioactivity of erythromycin A will also be confirmed through use of a bioassay in which the antibiotic activity is tested against Bacillus subtilis
. The assessment assays establish erythromycin A biosynthesis from E. coli
and set the stage for future engineering efforts to improve or diversify production and for the production of new complex natural compounds using this approach.
Biomedical Engineering, Issue 71, Chemical Engineering, Bioengineering, Molecular Biology, Cellular Biology, Microbiology, Basic Protocols, Biochemistry, Biotechnology, Heterologous biosynthesis, natural products, antibiotics, erythromycin A, metabolic engineering, E. coli
Isolation of Ribosome Bound Nascent Polypeptides in vitro to Identify Translational Pause Sites Along mRNA
Institutions: Cleveland State University.
The rate of translational elongation is non-uniform. mRNA secondary structure, codon usage and mRNA associated proteins may alter ribosome movement on the messagefor review see 1
. However, it's now widely accepted that synonymous codon usage is the primary cause of non-uniform translational elongation rates1
. Synonymous codons are not used with identical frequency. A bias exists in the use of synonymous codons with some codons used more frequently than others2
. Codon bias is organism as well as tissue specific2,3
. Moreover, frequency of codon usage is directly proportional to the concentrations of cognate tRNAs4
. Thus, a frequently used codon will have higher multitude of corresponding tRNAs, which further implies that a frequent codon will be translated faster than an infrequent one. Thus, regions on mRNA enriched in rare codons (potential pause sites) will as a rule slow down ribosome movement on the message and cause accumulation of nascent peptides of the respective sizes5-8
. These pause sites can have functional impact on the protein expression, mRNA stability and protein foldingfor review see 9
. Indeed, it was shown that alleviation of such pause sites can alter ribosome movement on mRNA and subsequently may affect the efficiency of co-translational (in vivo
) protein folding1,7,10,11
. To understand the process of protein folding in vivo
, in the cell, that is ultimately coupled to the process of protein synthesis it is essential to gain comprehensive insights into the impact of codon usage/tRNA content on the movement of ribosomes along mRNA during translational elongation.
Here we describe a simple technique that can be used to locate major translation pause sites for a given mRNA translated in various cell-free systems6-8
. This procedure is based on isolation of nascent polypeptides accumulating on ribosomes during in vitro
translation of a target mRNA. The rationale is that at low-frequency codons, the increase in the residence time of the ribosomes results in increased amounts of nascent peptides of the corresponding sizes. In vitro
transcribed mRNA is used for in vitro
translational reactions in the presence of radioactively labeled amino acids to allow the detection of the nascent chains. In order to isolate ribosome bound nascent polypeptide complexes the translation reaction is layered on top of 30% glycerol solution followed by centrifugation. Nascent polypeptides in polysomal pellet are further treated with ribonuclease A and resolved by SDS PAGE. This technique can be potentially used for any protein and allows analysis of ribosome movement along mRNA and the detection of the major pause sites. Additionally, this protocol can be adapted to study factors and conditions that can alter ribosome movement and thus potentially can also alter the function/conformation of the protein.
Genetics, Issue 65, Molecular Biology, Ribosome, Nascent polypeptide, Co-translational protein folding, Synonymous codon usage, gene regulation
A Restriction Enzyme Based Cloning Method to Assess the In vitro Replication Capacity of HIV-1 Subtype C Gag-MJ4 Chimeric Viruses
Institutions: Emory University, Emory University.
The protective effect of many HLA class I alleles on HIV-1 pathogenesis and disease progression is, in part, attributed to their ability to target conserved portions of the HIV-1 genome that escape with difficulty. Sequence changes attributed to cellular immune pressure arise across the genome during infection, and if found within conserved regions of the genome such as Gag, can affect the ability of the virus to replicate in vitro
. Transmission of HLA-linked polymorphisms in Gag to HLA-mismatched recipients has been associated with reduced set point viral loads. We hypothesized this may be due to a reduced replication capacity of the virus. Here we present a novel method for assessing the in vitro
replication of HIV-1 as influenced by the gag
gene isolated from acute time points from subtype C infected Zambians. This method uses restriction enzyme based cloning to insert the gag
gene into a common subtype C HIV-1 proviral backbone, MJ4. This makes it more appropriate to the study of subtype C sequences than previous recombination based methods that have assessed the in vitro
replication of chronically derived gag-pro
sequences. Nevertheless, the protocol could be readily modified for studies of viruses from other subtypes. Moreover, this protocol details a robust and reproducible method for assessing the replication capacity of the Gag-MJ4 chimeric viruses on a CEM-based T cell line. This method was utilized for the study of Gag-MJ4 chimeric viruses derived from 149 subtype C acutely infected Zambians, and has allowed for the identification of residues in Gag that affect replication. More importantly, the implementation of this technique has facilitated a deeper understanding of how viral replication defines parameters of early HIV-1 pathogenesis such as set point viral load and longitudinal CD4+ T cell decline.
Infectious Diseases, Issue 90, HIV-1, Gag, viral replication, replication capacity, viral fitness, MJ4, CEM, GXR25
Protocols for Implementing an Escherichia coli Based TX-TL Cell-Free Expression System for Synthetic Biology
Institutions: California Institute of Technology, California Institute of Technology, Massachusetts Institute of Technology, University of Minnesota.
Ideal cell-free expression systems can theoretically emulate an in vivo
cellular environment in a controlled in vitro
This is useful for expressing proteins and genetic circuits in a controlled manner as well as for providing a prototyping environment for synthetic biology.2,3
To achieve the latter goal, cell-free expression systems that preserve endogenous Escherichia coli transcription-translation mechanisms are able to more accurately reflect in vivo
cellular dynamics than those based on T7 RNA polymerase transcription. We describe the preparation and execution of an efficient endogenous E. coli
based transcription-translation (TX-TL) cell-free expression system that can produce equivalent amounts of protein as T7-based systems at a 98% cost reduction to similar commercial systems.4,5
The preparation of buffers and crude cell extract are described, as well as the execution of a three tube TX-TL reaction. The entire protocol takes five days to prepare and yields enough material for up to 3000 single reactions in one preparation. Once prepared, each reaction takes under 8 hr from setup to data collection and analysis. Mechanisms of regulation and transcription exogenous to E. coli
, such as lac/tet repressors and T7 RNA polymerase, can be supplemented.6
Endogenous properties, such as mRNA and DNA degradation rates, can also be adjusted.7
The TX-TL cell-free expression system has been demonstrated for large-scale circuit assembly, exploring biological phenomena, and expression of proteins under both T7- and endogenous promoters.6,8
Accompanying mathematical models are available.9,10
The resulting system has unique applications in synthetic biology as a prototyping environment, or "TX-TL biomolecular breadboard."
Cellular Biology, Issue 79, Bioengineering, Synthetic Biology, Chemistry Techniques, Synthetic, Molecular Biology, control theory, TX-TL, cell-free expression, in vitro, transcription-translation, cell-free protein synthesis, synthetic biology, systems biology, Escherichia coli cell extract, biological circuits, biomolecular breadboard
Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules
Institutions: Princeton University.
The aim of de novo
protein design is to find the amino acid sequences that will fold into a desired 3-dimensional structure with improvements in specific properties, such as binding affinity, agonist or antagonist behavior, or stability, relative to the native sequence. Protein design lies at the center of current advances drug design and discovery. Not only does protein design provide predictions for potentially useful drug targets, but it also enhances our understanding of the protein folding process and protein-protein interactions. Experimental methods such as directed evolution have shown success in protein design. However, such methods are restricted by the limited sequence space that can be searched tractably. In contrast, computational design strategies allow for the screening of a much larger set of sequences covering a wide variety of properties and functionality. We have developed a range of computational de novo
protein design methods capable of tackling several important areas of protein design. These include the design of monomeric proteins for increased stability and complexes for increased binding affinity.
To disseminate these methods for broader use we present Protein WISDOM (http://www.proteinwisdom.org), a tool that provides automated methods for a variety of protein design problems. Structural templates are submitted to initialize the design process. The first stage of design is an optimization sequence selection stage that aims at improving stability through minimization of potential energy in the sequence space. Selected sequences are then run through a fold specificity stage and a binding affinity stage. A rank-ordered list of the sequences for each step of the process, along with relevant designed structures, provides the user with a comprehensive quantitative assessment of the design. Here we provide the details of each design method, as well as several notable experimental successes attained through the use of the methods.
Genetics, Issue 77, Molecular Biology, Bioengineering, Biochemistry, Biomedical Engineering, Chemical Engineering, Computational Biology, Genomics, Proteomics, Protein, Protein Binding, Computational Biology, Drug Design, optimization (mathematics), Amino Acids, Peptides, and Proteins, De novo protein and peptide design, Drug design, In silico sequence selection, Optimization, Fold specificity, Binding affinity, sequencing
In Vitro Reconstitution of Light-harvesting Complexes of Plants and Green Algae
Institutions: VU University Amsterdam.
In plants and green algae, light is captured by the light-harvesting complexes (LHCs), a family of integral membrane proteins that coordinate chlorophylls and carotenoids. In vivo
, these proteins are folded with pigments to form complexes which are inserted in the thylakoid membrane of the chloroplast. The high similarity in the chemical and physical properties of the members of the family, together with the fact that they can easily lose pigments during isolation, makes their purification in a native state challenging. An alternative approach to obtain homogeneous preparations of LHCs was developed by Plumley and Schmidt in 19871
, who showed that it was possible to reconstitute these complexes in vitro
starting from purified pigments and unfolded apoproteins, resulting in complexes with properties very similar to that of native complexes. This opened the way to the use of bacterial expressed recombinant proteins for in vitro
reconstitution. The reconstitution method is powerful for various reasons: (1) pure preparations of individual complexes can be obtained, (2) pigment composition can be controlled to assess their contribution to structure and function, (3) recombinant proteins can be mutated to study the functional role of the individual residues (e.g.,
pigment binding sites) or protein domain (e.g.,
protein-protein interaction, folding). This method has been optimized in several laboratories and applied to most of the light-harvesting complexes. The protocol described here details the method of reconstituting light-harvesting complexes in vitro
currently used in our laboratory,
and examples describing applications of the method are provided.
Biochemistry, Issue 92, Reconstitution, Photosynthesis, Chlorophyll, Carotenoids, Light Harvesting Protein, Chlamydomonas reinhardtii, Arabidopsis thaliana
Characterization of Complex Systems Using the Design of Experiments Approach: Transient Protein Expression in Tobacco as a Case Study
Institutions: RWTH Aachen University, Fraunhofer Gesellschaft.
Plants provide multiple benefits for the production of biopharmaceuticals including low costs, scalability, and safety. Transient expression offers the additional advantage of short development and production times, but expression levels can vary significantly between batches thus giving rise to regulatory concerns in the context of good manufacturing practice. We used a design of experiments (DoE) approach to determine the impact of major factors such as regulatory elements in the expression construct, plant growth and development parameters, and the incubation conditions during expression, on the variability of expression between batches. We tested plants expressing a model anti-HIV monoclonal antibody (2G12) and a fluorescent marker protein (DsRed). We discuss the rationale for selecting certain properties of the model and identify its potential limitations. The general approach can easily be transferred to other problems because the principles of the model are broadly applicable: knowledge-based parameter selection, complexity reduction by splitting the initial problem into smaller modules, software-guided setup of optimal experiment combinations and step-wise design augmentation. Therefore, the methodology is not only useful for characterizing protein expression in plants but also for the investigation of other complex systems lacking a mechanistic description. The predictive equations describing the interconnectivity between parameters can be used to establish mechanistic models for other complex systems.
Bioengineering, Issue 83, design of experiments (DoE), transient protein expression, plant-derived biopharmaceuticals, promoter, 5'UTR, fluorescent reporter protein, model building, incubation conditions, monoclonal antibody
Production of Disulfide-stabilized Transmembrane Peptide Complexes for Structural Studies
Institutions: The Walter and Eliza Hall Institute of Medical Research, The University of Melbourne.
Physical interactions among the lipid-embedded alpha-helical domains of membrane proteins play a crucial role in folding and assembly of membrane protein complexes and in dynamic processes such as transmembrane (TM) signaling and regulation of cell-surface protein levels. Understanding the structural features driving the association of particular sequences requires sophisticated biophysical and biochemical analyses of TM peptide complexes. However, the extreme hydrophobicity of TM domains makes them very difficult to manipulate using standard peptide chemistry techniques, and production of suitable study material often proves prohibitively challenging. Identifying conditions under which peptides can adopt stable helical conformations and form complexes spontaneously
adds a further level of difficulty. Here we present a procedure for the production of homo- or hetero-dimeric TM peptide complexes from materials that are expressed in E. coli
, thus allowing incorporation of stable isotope labels for nuclear magnetic resonance (NMR) or non-natural amino acids for other applications relatively inexpensively. The key innovation in this method is that TM complexes are produced and purified as covalently associated
(disulfide-crosslinked) assemblies that can form stable, stoichiometric and homogeneous structures when reconstituted into detergent, lipid or other membrane-mimetic materials. We also present carefully optimized procedures for expression and purification that are equally applicable whether producing single TM domains or crosslinked complexes and provide advice for adapting these methods to new TM sequences.
Biochemistry, Issue 73, Structural Biology, Chemistry, Chemical Engineering, Biophysics, Genetics, Molecular Biology, Membrane Proteins, Proteins, Molecular Structure, transmembrane domain, peptide chemistry, membrane protein structure, immune receptors, reversed-phase HPLC, HPLC, peptides, lipids, protein, cloning, TFA Elution, CNBr Digestion, NMR, expression, cell culture
Genetically-encoded Molecular Probes to Study G Protein-coupled Receptors
Institutions: The Rockefeller University.
To facilitate structural and dynamic studies of G protein-coupled receptor (GPCR) signaling complexes, new approaches are required to introduce informative probes or labels into expressed receptors that do not perturb receptor function. We used amber codon suppression technology to genetically-encode the unnatural amino acid, p
-azido-L-phenylalanine (azF) at various targeted positions in GPCRs heterologously expressed in mammalian cells. The versatility of the azido group is illustrated here in different applications to study GPCRs in their native cellular environment or under detergent solubilized conditions. First, we demonstrate a cell-based targeted photocrosslinking technology to identify the residues in the ligand-binding pocket of GPCR where a tritium-labeled small-molecule ligand is crosslinked to a genetically-encoded azido amino acid. We then demonstrate site-specific modification of GPCRs by the bioorthogonal Staudinger-Bertozzi ligation reaction that targets the azido group using phosphine derivatives. We discuss a general strategy for targeted peptide-epitope tagging of expressed membrane proteins in-culture and its detection using a whole-cell-based ELISA approach. Finally, we show that azF-GPCRs can be selectively tagged with fluorescent probes. The methodologies discussed are general, in that they can in principle be applied to any amino acid position in any expressed GPCR to interrogate active signaling complexes.
Genetics, Issue 79, Receptors, G-Protein-Coupled, Protein Engineering, Signal Transduction, Biochemistry, Unnatural amino acid, site-directed mutagenesis, G protein-coupled receptor, targeted photocrosslinking, bioorthogonal labeling, targeted epitope tagging