The Journal of Visualized Experiments (JoVE) is a peer reviewed, PubMed-indexed video journal. Our mission is to increase the productivity of scientific research.

Recommend to Librarian

In JoVE (1)

Other Publications (78)

Articles by Paul D. Adams in JoVE

 JoVE General

Label-free in situ Imaging of Lignification in Plant Cell Walls


JoVE 2064 11/01/2010

1Energy Biosciences Institute, University of California, Berkeley, 2Molecular Foundry, Lawrence Berkeley National Laboratory, 3Physical Biosciences Division, Lawrence Berkeley National Laboratory

A method based on confocal Raman microscopy is presented that affords label-free visualization of lignin in plant cell walls and comparison of lignification in different tissues, samples or species.

Other articles by Paul D. Adams on PubMed

Molecular Dynamics Applied to X-ray Structure Refinement

Simulated annealing, in the form of temperature-controlled molecular dynamics, has been successfully applied to macromolecular X-ray structure optimization. The theory and practice of the method are reviewed, and some recent improvements are described.

Transmembrane Signal Transduction of the Alpha(IIb)beta(3) Integrin

Integrins are composed of noncovalently bound dimers of an alpha- and a beta-subunit. They play an important role in cell-matrix adhesion and signal transduction through the cell membrane. Signal transduction can be initiated by the binding of intracellular proteins to the integrin. Binding leads to a major conformational change. The change is passed on to the extracellular domain through the membrane. The affinity of the extracellular domain to certain ligands increases; thus at least two states exist, a low-affinity and a high-affinity state. The conformations and conformational changes of the transmembrane (TM) domain are the focus of our interest. We show by a global search of helix-helix interactions that the TM section of the family of integrins are capable of adopting a structure similar to the structure of the homodimeric TM protein Glycophorin A. For the alpha(IIb)beta(3) integrin, this structural motif represents the high-affinity state. A second conformation of the TM domain of alpha(IIb)beta(3) is identified as the low-affinity state by known mutational and nuclear magnetic resonance (NMR) studies. A transition between these two states was determined by molecular dynamics (MD) calculations. On the basis of these calculations, we propose a three-state mechanism.

Intramolecular Quenching of Tryptophan Fluorescence by the Peptide Bond in Cyclic Hexapeptides

Intramolecular quenching of tryptophan fluorescence by protein functional groups was studied in a series of rigid cyclic hexapeptides containing a single tryptophan. The solution structure of the canonical peptide c[D-PpYTFWF] (pY, phosphotyrosine) was determined in aqueous solution by 1D- and 2D-(1)H NMR techniques. The peptide backbone has a single predominant conformation. The tryptophan side chain has three chi(1) rotamers: a major chi(1) = -60 degrees rotamer with a population of 0.67, and two minor rotamers of equal population. The peptides have three fluorescence lifetimes of about 3.8, 1.8, and 0.3 ns with relative amplitudes that agree with the chi(1) rotamer populations determined by NMR. The major 3.8-ns lifetime component is assigned to the chi(1) = -60 degrees rotamer. The multiple fluorescence lifetimes are attributed to differences among rotamers in the rate of excited-state electron transfer to peptide bonds. Electron-transfer rates were calculated for the six preferred side chain rotamers using Marcus theory. A simple model with reasonable assumptions gives excellent agreement between observed and calculated lifetimes for the 3.8- and 1.8-ns lifetimes and assigns the 1.8-ns lifetime component to the chi(1) = 180 degrees rotamer. Substitution of phenylalanine by lysine on either side of tryptophan has no effect on fluorescence quantum yield or lifetime, indicating that intramolecular excited-state proton transfer catalyzed by the epsilon-ammonium does not occur in these peptides.

PHENIX: Building New Software for Automated Crystallographic Structure Determination

Structural genomics seeks to expand rapidly the number of protein structures in order to extract the maximum amount of information from genomic sequence databases. The advent of several large-scale projects worldwide leads to many new challenges in the field of crystallographic macromolecular structure determination. A novel software package called PHENIX (Python-based Hierarchical ENvironment for Integrated Xtallography) is therefore being developed. This new software will provide the necessary algorithms to proceed from reduced intensity data to a refined molecular model and to facilitate structure solution for both the novice and expert crystallographer.

Fluorescence of Cis-1-amino-2-(3-indolyl)cyclohexane-1-carboxylic Acid: a Single Tryptophan Chi(1) Rotamer Model

A constrained derivative, cis-1-amino-2-(3-indolyl)cyclohexane-1-carboxylic acid, cis-W3, was designed to test the rotamer model of tryptophan photophysics. The conformational constraint enforces a single chi(1) conformation, analogous to the chi(1) = 60 degrees rotamer of tryptophan. The side-chain torsion angles in the X-ray structure of cis-W3 were chi(1) = 58.5 degrees and chi(2) = -88.7 degrees. Molecular mechanics calculations suggested two chi(2) rotamers for cis-W3 in solution, -100 degrees and 80 degrees, analogous to the chi(2) = +/-90 degrees rotamers of tryptophan. The fluorescence decay of the cis-W3 zwitterion was biexponential with lifetimes of 3.1 and 0.3 ns at 25 degrees C. The relative amplitudes of the lifetime components match the chi(2) rotamer populations predicted by molecular mechanics. The longer lifetime represents the major chi(2) = -100 degrees rotamer. The shorter lifetime represents the minor chi(2) = 80 degrees rotamer having the ammonium group closer to C4 of the indole ring (labeled C5 in the cis-W3 X-ray structure). Intramolecular excited-state proton transfer occurs at indole C4 in the tryptophan zwitterion (Saito, I.; Sugiyama, H.; Yamamoto, A.; Muramatsu, S.; Matsuura,T. J. Am. Chem. Soc. 1984, 106, 4286-4287). Photochemical isotope exchange experiments showed that H-D exchange occurs exclusively at C5 in the cis-W3 zwitterion, consistent with the presence of the chi(2) = 80 degrees rotamer in solution. The rates of two nonradiative processes, excited-state proton and electron transfer, were measured for individual chi(2) rotamers. The excited-state proton-transfer rate was determined from H-D exchange and fluorescence lifetime data. The excited-state electron-transfer rate was determined from the temperature dependence of the fluorescence lifetime. The major quenching process in the -100 degrees rotamer is electron transfer from the excited indole to carboxylate. Electron transfer also occurs in the 80 degrees rotamer, but the major quenching process is intramolecular proton transfer. Both quenching processes are suppressed by deprotonation of the amino group. The results for cis-W3 provide compelling evidence that the complex fluorescence decay of the tryptophan zwitterion originates in ground-state heterogeneity with the different lifetimes primarily reflecting different intramolecular excited-state proton- and electron-transfer rates in various rotamers.

Computational Aspects of High-throughput Crystallographic Macromolecular Structure Determination

High-resolution Structure of RNase P Protein from Thermotoga Maritima

The structure of RNase P protein from the hyperthermophilic bacterium Thermotoga maritima was determined at 1.2-A resolution by using x-ray crystallography. This protein structure is from an ancestral-type RNase P and bears remarkable similarity to the recently determined structures of RNase P proteins from bacteria that have the distinct, Bacillus type of RNase P. These two types of protein span the extent of bacterial RNase P diversity, so the results generalize the structure of the bacterial RNase P protein. The broad phylogenetic conservation of structure and distribution of potential RNA-binding elements in the RNase P proteins indicate that all of these homologous proteins bind to their cognate RNAs primarily by interaction with the phylogenetically conserved core of the RNA. The protein is found to dimerize through an extensive, well-ordered interface. This dimerization may reflect a mechanism of thermal stability of the protein before assembly with the RNA moiety of the holoenzyme.

X-ray Crystallographic and Kinetic Studies of Human Sorbitol Dehydrogenase

Sorbitol dehydrogenase (hSDH) and aldose reductase form the polyol pathway that interconverts glucose and fructose. Redox changes from overproduction of the coenzyme NADH by SDH may play a role in diabetes-induced dysfunction in sensitive tissues, making SDH a therapeutic target for diabetic complications. We have purified and determined the crystal structures of human SDH alone, SDH with NAD(+), and SDH with NADH and an inhibitor that is competitive with fructose. hSDH is a tetramer of identical, catalytically active subunits. In the apo and NAD(+) complex, the catalytic zinc is coordinated by His69, Cys44, Glu70, and a water molecule. The inhibitor coordinates the zinc through an oxygen and a nitrogen atom with the concomitant dissociation of Glu70. The inhibitor forms hydrophobic interactions to NADH and likely sterically occludes substrate binding. The structure of the inhibitor complex provides a framework for developing more potent inhibitors of hSDH.

Role of the Gamma-phosphate of ATP in Triggering Protein Folding by GroEL-GroES: Function, Structure and Energetics

Productive cis folding by the chaperonin GroEL is triggered by the binding of ATP but not ADP, along with cochaperonin GroES, to the same ring as non-native polypeptide, ejecting polypeptide into an encapsulated hydrophilic chamber. We examined the specific contribution of the gamma-phosphate of ATP to this activation process using complexes of ADP and aluminium or beryllium fluoride. These ATP analogues supported productive cis folding of the substrate protein, rhodanese, even when added to already-formed, folding-inactive cis ADP ternary complexes, essentially introducing the gamma-phosphate of ATP in an independent step. Aluminium fluoride was observed to stabilize the association of GroES with GroEL, with a substantial release of free energy (-46 kcal/mol). To understand the basis of such activation and stabilization, a crystal structure of GroEL-GroES-ADP.AlF3 was determined at 2.8 A. A trigonal AlF3 metal complex was observed in the gamma-phosphate position of the nucleotide pocket of the cis ring. Surprisingly, when this structure was compared with that of the previously determined GroEL-GroES-ADP complex, no other differences were observed. We discuss the likely basis of the ability of gamma-phosphate binding to convert preformed GroEL-GroES-ADP-polypeptide complexes into the folding-active state.

Automatic Solution of Heavy-atom Substructures

Backbone Dynamics of an Oncogenic Mutant of Cdc42Hs Shows Increased Flexibility at the Nucleotide-binding Site

Cdc42Hs, a member of the Ras superfamily of GTP-binding signal transduction proteins, binds guanine nucleotides, and acts as a molecular-timing switch in multiple signal transduction pathways. The structure of the wild-type protein has been solved (Feltham et al. (1997) Biochemistry 36, 8755-8766), and the backbone dynamics have been characterized by NMR spectroscopy (Loh et al. (1999) Biochemistry 38, 12547-12557). The F28L mutation of Cdc42Hs is characterized by an increased rate of cycling between the GTP and GDP-bound forms leading to cell transformation (Lin et al. (1997) Curr. Biol. 7, 794-797). Here, we describe the backbone dynamics of Cdc42Hs(F28L)-GDP using 1H-15N NMR measurements of T1, T1rho, and steady-state NOE at two magnetic field strengths. Residue-specific values of the generalized order parameters (Ss2 and Sf2), local correlation time (tau(e)), and exchange rate (R(ex)) were obtained using the Lipari-Szabo formalism. Chemical-shift perturbation analysis suggested that very little structural change was evident outside of the nucleotide-binding site. However, residues comprising the nucleotide-binding site, as well as the nucleotide itself, exhibit increased dynamics over a wide range of time scales in Cdc42Hs(F28L) relative to the wild type. In addition to changes in dynamics measured by relaxation methods, hydrogen-deuterium exchange indicated a substantial disruption of the hydrogen-bonding network within the nucleotide-binding site. Thus, local dynamic changes introduced by a single-point mutation can affect important aspects of signaling processes without disrupting the conformation of the whole protein.

Exploring the Structural Dynamics of the E.coli Chaperonin GroEL Using Translation-libration-screw Crystallographic Refinement of Intermediate States

Large rigid-body domain movements are critical to GroEL-mediated protein folding, especially apical domain elevation and twist associated with the formation of a folding chamber upon binding ATP and co-chaperonin GroES. Here, we have modeled the anisotropic displacements of GroEL domains from various crystallized states, unliganded GroEL, ATPgammaS-bound, ADP-AlFx/GroES-bound, and ADP/GroES bound, using translation-libration-screw (TLS) analysis. Remarkably, the TLS results show that the inherent motions of unliganded GroEL, a polypeptide-accepting state, are biased along the transition pathway that leads to the folding-active state. In the ADP-AlFx/GroES-bound folding-active state the dynamic modes of the apical domains become reoriented and coupled to the motions of bound GroES. The ADP/GroES complex exhibits these same motions, but they are increased in magnitude, potentially reflecting the decreased stability of the complex after nucleotide hydrolysis. Our results have allowed the visualization of the anisotropic molecular motions that link the static conformations previously observed by X-ray crystallography. Application of the same analyses to other macromolecules where rigid body motions occur may give insight into the large scale dynamics critical for function and thus has the potential to extend our fundamental understanding of molecular machines.

Recent Developments in the PHENIX Software for Automated Crystallographic Structure Determination

A new software system called PHENIX (Python-based Hierarchical ENvironment for Integrated Xtallography) is being developed for the automation of crystallographic structure solution. This will provide the necessary algorithms to proceed from reduced intensity data to a refined molecular model, and facilitate structure solution for both the novice and expert crystallographer. Here, the features of PHENIXare reviewed and the recent advances in infrastructure and algorithms are briefly described.

Crystal Structures of the Rhodococcus Proteasome with and Without Its Pro-peptides: Implications for the Role of the Pro-peptide in Proteasome Assembly

To understand the role of the pro-peptide in proteasome assembly, we have determined structures of the Rhodococcus proteasome and a mutant form that prevents the autocatalytic removal of its pro-peptides. The structures reveal that the pro-peptide acts as an assembly-promoting factor by linking its own beta-subunit to two adjacent alpha-subunits, thereby providing a molecular explanation for the observed kinetics of proteasome assembly. The Rhodococcus proteasome has been found to have a substantially smaller contact region between alpha-subunits compared to those regions in the proteasomes of Thermoplasma, yeast, and mammalian cells, suggesting that a smaller contact area between alpha-subunits is likely the structural basis for the Rhodococcus alpha-subunits not assembling into alpha-rings when expressed alone. Analysis of all available beta-subunit structures shows that the contact area between beta-subunits within a beta-ring is not sufficient for beta-ring self-assembly without the additional contact provided by the alpha-ring. This appears to be a fail-safe mechanism ensuring that the active sites on the beta-subunits are activated only after proteasome assembly is complete.

Modeling Membrane Proteins Utilizing Information from Silent Amino Acid Substitutions

This unit describes predicting the structure of simple transmembrane alpha-helical bundles. The protocol is based on a global molecular dynamics search (GMDS) of the configuration space of the helical bundle, yielding several candidates structures. The correct structure amongst these candidates is selected using information from silent amino acid substitutions, employing the following premise: Only the correct structure must (by definition) accept all of the silent amino acid substitutions. Thus, the correct structure is found by repeating the GMDS for several close homologues and selecting the structure that persists in all of the trials.

Robust Indexing for Automatic Data Collection

Improved methods for indexing diffraction patterns from macromolecular crystals are presented. The novel procedures include a more robust way to verify the position of the incident X-ray beam on the detector, an algorithm to verify that the deduced lattice basis is consistent with the observations, and an alternative approach to identify the metric symmetry of the lattice. These methods help to correct failures commonly experienced during indexing, and increase the overall success rate of the process. Rapid indexing, without the need for visual inspection, will play an important role as beamlines at synchrotron sources prepare for high-throughput automation.

Crystal Structure of a PhoU Protein Homologue: a New Class of Metalloprotein Containing Multinuclear Iron Clusters

PhoU proteins are known to play a role in the regulation of phosphate uptake. In Thermotoga maritima, two PhoU homologues have been identified bioinformatically. Here we report the crystal structure of one of the PhoU homologues at 2.0 A resolution. The structure of the PhoU protein homologue contains a highly symmetric new structural fold composed of two repeats of a three-helix bundle. The structure unexpectedly revealed a trinuclear and a tetranuclear iron cluster that were found to be bound on the surface. Each of the two multinuclear iron clusters is coordinated by a conserved E(D)XXXD motif pair. Our structure reveals a new class of metalloprotein containing multinuclear iron clusters. The possible functional implication based on the structure are discussed.

Crystal Structure of DNA Sequence Specificity Subunit of a Type I Restriction-modification Enzyme and Its Functional Implications

Type I restriction-modification enzymes are differentiated from type II and type III enzymes by their recognition of two specific dsDNA sequences separated by a given spacer and cleaving DNA randomly away from the recognition sites. They are oligomeric proteins formed by three subunits: a specificity subunit, a methylation subunit, and a restriction subunit. We solved the crystal structure of a specificity subunit from Methanococcus jannaschii at 2.4-A resolution. Two highly conserved regions (CRs) in the middle and at the C terminus form a coiled-coil of long antiparallel alpha-helices. Two target recognition domains form globular structures with almost identical topologies and two separate DNA binding clefts with a modeled DNA helix axis positioned across the CR helices. The structure suggests that the coiled-coil CRs act as a molecular ruler for the separation between two recognized DNA sequences. Furthermore, the relative orientation of the two DNA binding clefts suggests kinking of bound dsDNA and exposing of target adenines from the recognized DNA sequences.

Crystal Structure of the "PhoU-like" Phosphate Uptake Regulator from Aquifex Aeolicus

The phoU gene of Aquifex aeolicus encodes a protein called PHOU_AQUAE with sequence similarity to the PhoU protein of Escherichia coli. Despite the fact that there is a large number of family members (more than 300) attributed to almost all known bacteria and despite PHOU_AQUAE's association with the regulation of genes for phosphate metabolism, the nature of its regulatory function is not well understood. Nearly one-half of these PhoU-like proteins, including both PHOU_AQUAE and the one from E. coli, form a subfamily with an apparent dimer structure of two PhoU domains on the basis of their amino acid sequence. The crystal structure of PHOU_AQUAE (a 221-amino-acid protein) reveals two similar coiled-coil PhoU domains, each forming a three-helix bundle. The structures of PHOU_AQUAE proteins from both a soluble fraction and refolded inclusion bodies (at resolutions of 2.8 and 3.2A, respectively) showed no significant differences. The folds of the PhoU domain and Bag domains (for a class of cofactors of the eukaryotic chaperone Hsp70 family) are similar. Accordingly, we propose that gene regulation by PhoU may occur by association of PHOU_AQUAE with the ATPase domain of the histidine kinase PhoR, promoting release of its substrate PhoB. Other proteins that share the PhoU domain fold include the coiled-coil domains of the STAT protein, the ribosome-recycling factor, and structural proteins like spectrin.

Crystal Structure of a Heat-inducible Transcriptional Repressor HrcA from Thermotoga Maritima: Structural Insight into DNA Binding and Dimerization

All cells have a defense mechanism against a sudden heat-shock stress. Commonly, they express a set of proteins that protect cellular proteins from being denatured by heat. Among them, GroE and DnaK chaperones are representative defending systems, and their transcription is regulated by a heat-shock repressor protein HrcA. HrcA repressor controls the transcription of groE and dnaK operons by binding the palindromic CIRCE element, presumably as a dimer, and the activity of HrcA repressor is modulated by GroE chaperones. Here, we report the first crystal structure of a heat-inducible transcriptional repressor, HrcA, from Thermotoga maritima at 2.2A resolution. The Tm_HrcA protein crystallizes as a dimer. The monomer is composed of three domains: an N-terminal winged helix-turn-helix domain (WH), a GAF-like domain, and an inserted dimerizing domain (IDD). The IDD shows a unique structural fold with an anti-parallel beta-sheet composed of three beta-strands sided by four alpha-helices. The Tm_HrcA dimer structure is formed through hydrophobic contact between the IDDs and a limited contact that involves conserved residues between the GAF-like domains. In the overall dimer structure, the two WH domains are exposed, but the conformation of these two domains seems to be incompatible with DNA binding. We suggest that our structure may represent an inactive form of the HrcA repressor. Structural implication on how the inactive form of HrcA may be converted to the active form by GroEL binding to a conserved C-terminal sequence region of HrcA is discussed.

A Robust Bulk-solvent Correction and Anisotropic Scaling Procedure

A reliable method for the determination of bulk-solvent model parameters and an overall anisotropic scale factor is of increasing importance as structure determination becomes more automated. Current protocols require the manual inspection of refinement results in order to detect errors in the calculation of these parameters. Here, a robust method for determining bulk-solvent and anisotropic scaling parameters in macromolecular refinement is described. The implementation of a maximum-likelihood target function for determining the same parameters is also discussed. The formulas and corresponding derivatives of the likelihood function with respect to the solvent parameters and the components of anisotropic scale matrix are presented. These algorithms are implemented in the CCTBX bulk-solvent correction and scaling module.

Crystal Structure of a Bacterial Ribonuclease P RNA

The x-ray crystal structure of a 417-nt ribonuclease P RNA from Bacillus stearothermophilus was solved to 3.3-A resolution. This RNA enzyme is constructed from a number of coaxially stacked helical domains joined together by local and long-range interactions. These helical domains are arranged to form a remarkably flat surface, which is implicated by a wealth of biochemical data in the binding and cleavage of the precursors of transfer RNA substrate. Previous photoaffinity crosslinking data are used to position the substrate on the crystal structure and to identify the chemically active site of the ribozyme. This site is located in a highly conserved core structure formed by intricately interlaced long-range interactions between interhelical sequences.

Automated Crystallographic Ligand Building Using the Medial Axis Transform of an Electron-density Isosurface

Automatic fitting methods that build molecules into electron-density maps usually fail below 3.5 A resolution. As a first step towards addressing this problem, an algorithm has been developed using an approximation of the medial axis to simplify an electron-density isosurface. This approximation captures the central axis of the isosurface with a graph which is then matched against a graph of the molecular model. One of the first applications of the medial axis to X-ray crystallography is presented here. When applied to ligand fitting, the method performs at least as well as methods based on selecting peaks in electron-density maps. Generalization of the method to recognition of common features across multiple contour levels could lead to powerful automatic fitting methods that perform well even at low resolution.

Structural Genomics of Minimal Organisms and Protein Fold Space

The initial aim of the Berkeley Structural Genomics Center is to obtain a near-complete structural complement of two minimal organisms, closely related pathogens Mycoplasma genitalium and M. pneumoniae. The former has fewer than 500 genes and the latter fewer than 700 genes. To achieve this goal, the current protein targets have been selected starting with those predicted to be most tractable and likely to yield new structural and functional information. During the past 3 years, the semi-automated structural genomics pipeline has been set up from cloning, expression, purification, and ultimately to structural determination. The results from the pipeline substantially increased the coverage of the protein fold space of M. pneumoniae and M. genitalium. Furthermore, about 1/2 of the structures of 'unique' protein sequences revealed new and novel folds, and over 2/3 of the structures of previously annotated 'hypothetical proteins' inferred their molecular functions.

Crystal Structures of an NAD Kinase from Archaeoglobus Fulgidus in Complex with ATP, NAD, or NADP

NAD kinase is a ubiquitous enzyme that catalyzes the phosphorylation of NAD to NADP using ATP or inorganic polyphosphate (poly(P)) as phosphate donor, and is regarded as the only enzyme responsible for the synthesis of NADP. We present here the crystal structures of an NAD kinase from the archaeal organism Archaeoglobus fulgidus in complex with its phosphate donor ATP at 1.7 A resolution, with its substrate NAD at 3.05 A resolution, and with the product NADP in two different crystal forms at 2.45 A and 2.0 A resolution, respectively. In the ATP bound structure, the AMP portion of the ATP molecule is found to use the same binding site as the nicotinamide ribose portion of NAD/NADP in the NAD/NADP bound structures. A magnesium ion is found to be coordinated to the phosphate tail of ATP as well as to a pyrophosphate group. The conserved GGDG loop forms hydrogen bonds with the pyrophosphate group in the ATP-bound structure and the 2' phosphate group of the NADP in the NADP-bound structures. A possible phosphate transfer mechanism is proposed on the basis of the structures presented.

Structural Basis for Double-stranded RNA Processing by Dicer

The specialized ribonuclease Dicer initiates RNA interference by cleaving double-stranded RNA (dsRNA) substrates into small fragments about 25 nucleotides in length. In the crystal structure of an intact Dicer enzyme, the PAZ domain, a module that binds the end of dsRNA, is separated from the two catalytic ribonuclease III (RNase III) domains by a flat, positively charged surface. The 65 angstrom distance between the PAZ and RNase III domains matches the length spanned by 25 base pairs of RNA. Thus, Dicer itself is a molecular ruler that recognizes dsRNA and cleaves a specified distance from the helical end.

Solution Structure of an Oncogenic Mutant of Cdc42Hs

Cdc42Hs(F28L) is a single-point mutant of Cdc42Hs, a member of the Ras superfamily of GTP-binding proteins, that facilitates cellular transformation brought about by an increased rate of cycling between GTP and GDP [Lin, R., et al. (1997) Curr. Biol. 7, 794-797]. Dynamics studies of Cdc42Hs(F28L)-GDP have shown increased flexibility for several residues at the nucleotide-binding site [Adams, P. D., et al. (2004) Biochemistry 43, 9968-9977]. The solution structure of Cdc42Hs-GDP (wild type) has previously been determined by NMR spectroscopy [Feltham, J. L., et al. (1997) Biochemistry 36, 8755-8766]. Here, we describe the solution structure of Cdc42Hs(F28L)-GDP, which provides insight into the structural basis for the change in affinity for GDP. Heteronuclear NMR experiments were performed to assign resonances in the protein, and distance, hydrogen bonding, residual dipolar coupling, and dihedral angle constraints were used to calculate a set of low-energy structures using distance geometry and simulated annealing refinement protocols. The overall structure of Cdc42Hs(F28L)-GDP is very similar to that of wild-type Cdc42Hs, consisting of a centrally located six-stranded beta-sheet structure surrounding the C-terminal alpha-helix [Feltham, J. L., et al. (1997) Biochemistry 36, 8755-8766]. In addition, the same three regions in wild-type Cdc42Hs that show structural disorder (Switch I, Switch II, and the Insert region) are disordered in F28L as well. Although the structure of Cdc42Hs(F28L)-GDP is very similar to that of the wild type, interactions with the nucleotide and hydrogen bonding within the nucleotide binding site are altered, and the region surrounding L28 is substantially more disordered.

Automated Ligand Fitting by Core-fragment Fitting and Extension into Density

A procedure for fitting of ligands to electron-density maps by first fitting a core fragment of the ligand to density and then extending the remainder of the ligand into density is presented. The approach was tested by fitting 9327 ligands over a wide range of resolutions (most are in the range 0.8-4.8 A) from the Protein Data Bank (PDB) into (Fo - Fc)exp(i phi(c)) difference density calculated using entries from the PDB without these ligands. The procedure was able to place 58% of these 9327 ligands within 2 A (r.m.s.d.) of the coordinates of the atoms in the original PDB entry for that ligand. The success of the fitting procedure was relatively insensitive to the size of the ligand in the range 10-100 non-H atoms and was only moderately sensitive to resolution, with the percentage of ligands placed near the coordinates of the original PDB entry for fits in the range 58-73% over all resolution ranges tested.

SPARX, a New Environment for Cryo-EM Image Processing

SPARX (single particle analysis for resolution extension) is a new image processing environment with a particular emphasis on transmission electron microscopy (TEM) structure determination. It includes a graphical user interface that provides a complete graphical programming environment with a novel data/process-flow infrastructure, an extensive library of Python scripts that perform specific TEM-related computational tasks, and a core library of fundamental C++ image processing functions. In addition, SPARX relies on the EMAN2 library and cctbx, the open-source computational crystallography library from PHENIX. The design of the system is such that future inclusion of other image processing libraries is a straightforward task. The SPARX infrastructure intelligently handles retention of intermediate values, even those inside programming structures such as loops and function calls. SPARX and all dependencies are free for academic use and available with complete source.

Ligand Identification Using Electron-density Map Correlations

A procedure for the identification of ligands bound in crystal structures of macromolecules is described. Two characteristics of the density corresponding to a ligand are used in the identification procedure. One is the correlation of the ligand density with each of a set of test ligands after optimization of the fit of that ligand to the density. The other is the correlation of a fingerprint of the density with the fingerprint of model density for each possible ligand. The fingerprints consist of an ordered list of correlations of each the test ligands with the density. The two characteristics are scored using a Z-score approach in which the correlations are normalized to the mean and standard deviation of correlations found for a variety of mismatched ligand-density pairs, so that the Z scores are related to the probability of observing a particular value of the correlation by chance. The procedure was tested with a set of 200 of the most commonly found ligands in the Protein Data Bank, collectively representing 57% of all ligands in the Protein Data Bank. Using a combination of these two characteristics of ligand density, ranked lists of ligand identifications were made for representative (F(o) - F(c))exp(i(phi)c) difference density from entries in the Protein Data Bank. In 48% of the 200 cases, the correct ligand was at the top of the ranked list of ligands. This approach may be useful in identification of unknown ligands in new macromolecular structures as well as in the identification of which ligands in a mixture have bound to a macromolecule.

Interpretation of Ensembles Created by Multiple Iterative Rebuilding of Macromolecular Models

Automation of iterative model building, density modification and refinement in macromolecular crystallography has made it feasible to carry out this entire process multiple times. By using different random seeds in the process, a number of different models compatible with experimental data can be created. Sets of models were generated in this way using real data for ten protein structures from the Protein Data Bank and using synthetic data generated at various resolutions. Most of the heterogeneity among models produced in this way is in the side chains and loops on the protein surface. Possible interpretations of the variation among models created by repetitive rebuilding were investigated. Synthetic data were created in which a crystal structure was modelled as the average of a set of ;perfect' structures and the range of models obtained by rebuilding a single starting model was examined. The standard deviations of coordinates in models obtained by repetitive rebuilding at high resolution are small, while those obtained for the same synthetic crystal structure at low resolution are large, so that the diversity within a group of models cannot generally be a quantitative reflection of the actual structures in a crystal. Instead, the group of structures obtained by repetitive rebuilding reflects the precision of the models, and the standard deviation of coordinates of these structures is a lower bound estimate of the uncertainty in coordinates of the individual models.

On Macromolecular Refinement at Subatomic Resolution with Interatomic Scatterers

A study of the accurate electron-density distribution in molecular crystals at subatomic resolution (better than approximately 1.0 A) requires more detailed models than those based on independent spherical atoms. A tool that is conventionally used in small-molecule crystallography is the multipolar model. Even at upper resolution limits of 0.8-1.0 A, the number of experimental data is insufficient for full multipolar model refinement. As an alternative, a simpler model composed of conventional independent spherical atoms augmented by additional scatterers to model bonding effects has been proposed. Refinement of these mixed models for several benchmark data sets gave results that were comparable in quality with the results of multipolar refinement and superior to those for conventional models. Applications to several data sets of both small molecules and macromolecules are shown. These refinements were performed using the general-purpose macromolecular refinement module phenix.refine of the PHENIX package.

Phaser Crystallographic Software

Phaser is a program for phasing macromolecular crystal structures by both molecular replacement and experimental phasing methods. The novel phasing algorithms implemented in Phaser have been developed using maximum likelihood and multivariate statistics. For molecular replacement, the new algorithms have proved to be significantly better than traditional methods in discriminating correct solutions from noise, and for single-wavelength anomalous dispersion experimental phasing, the new algorithms, which account for correlations between F(+) and F(-), give better phases (lower mean phase error with respect to the phases given by the refined structure) than those that use mean F and anomalous differences DeltaF. One of the design concepts of Phaser was that it be capable of a high degree of automation. To this end, Phaser (written in C++) can be called directly from Python, although it can also be called using traditional CCP4 keyword-style input. Phaser is a platform for future development of improved phasing methods and their release, including source code, to the crystallographic community.

NMR Assignment of Cdc42(T35A), an Active Switch I Mutant of Cdc42

Cdc42(T35A) is an active construct of Cdc42, a Ras GTPase involved in signal transduction, containing a single-point mutation in an important effector-binding region. We determined the backbone and side chain resonance assignments of (13)C,(15)N-labelled Cdc42(T35A) from E. coli.

Iterative-build OMIT Maps: Map Improvement by Iterative Model Building and Refinement Without Model Bias

A procedure for carrying out iterative model building, density modification and refinement is presented in which the density in an OMIT region is essentially unbiased by an atomic model. Density from a set of overlapping OMIT regions can be combined to create a composite 'iterative-build' OMIT map that is everywhere unbiased by an atomic model but also everywhere benefiting from the model-based information present elsewhere in the unit cell. The procedure may have applications in the validation of specific features in atomic models as well as in overall model validation. The procedure is demonstrated with a molecular-replacement structure and with an experimentally phased structure and a variation on the method is demonstrated by removing model bias from a structure from the Protein Data Bank.

Automated Structure Solution with the PHENIX Suite

Significant time and effort are often required to solve and complete a macromolecular crystal structure. The development of automated computational methods for the analysis, solution, and completion of crystallographic structures has the potential to produce minimally biased models in a short time without the need for manual intervention. The PHENIX software suite is a highly automated system for macromolecular structure determination that can rapidly arrive at an initial partial model of a structure without significant human intervention, given moderate resolution, and good quality data. This achievement has been made possible by the development of new algorithms for structure determination, maximum-likelihood molecular replacement (PHASER), heavy-atom search (HySS), template- and pattern-based automated model-building (RESOLVE, TEXTAL), automated macromolecular refinement (phenix. refine), and iterative model-building, density modification and refinement that can operate at moderate resolution (RESOLVE, AutoBuild). These algorithms are based on a highly integrated and comprehensive set of crystallographic libraries that have been built and made available to the community. The algorithms are tightly linked and made easily accessible to users through the PHENIX Wizards and the PHENIX GUI.

Iterative Model Building, Structure Refinement and Density Modification with the PHENIX AutoBuild Wizard

The PHENIX AutoBuild wizard is a highly automated tool for iterative model building, structure refinement and density modification using RESOLVE model building, RESOLVE statistical density modification and phenix.refine structure refinement. Recent advances in the AutoBuild wizard and phenix.refine include automated detection and application of NCS from models as they are built, extensive model-completion algorithms and automated solvent-molecule picking. Model-completion algorithms in the AutoBuild wizard include loop building, crossovers between chains in different models of a structure and side-chain optimization. The AutoBuild wizard has been applied to a set of 48 structures at resolutions ranging from 1.1 to 3.2 A, resulting in a mean R factor of 0.24 and a mean free R factor of 0.29. The R factor of the final model is dependent on the quality of the starting electron density and is relatively independent of resolution.

Surprises and Pitfalls Arising from (pseudo)symmetry

It is not uncommon for protein crystals to crystallize with more than a single molecule per asymmetric unit. When more than a single molecule is present in the asymmetric unit, various pathological situations such as twinning, modulated crystals and pseudo translational or rotational symmetry can arise. The presence of pseudosymmetry can lead to uncertainties about the correct space group, especially in the presence of twinning. The background to certain common pathologies is presented and a new notation for space groups in unusual settings is introduced. The main concepts are illustrated with several examples from the literature and the Protein Data Bank.

Addressing the Need for Alternative Transportation Fuels: the Joint BioEnergy Institute

A Method for the Prevention of Thrombin-induced Degradation of Recombinant Proteins

A new strategy to prevent degradation of recombinant proteins caused by non-specific cleavage by thrombin is described. We demonstrate that degradation due to non-specific cleavage of recombinant protein mediated by thrombin can be completely prevented by separation of thrombin from the recombinant protein on spin columns packed with heparin-sepharose. This method is generally applicable to all recombinant proteins that require the thrombin for the cleavage of affinity tags for purification. To our knowledge, this is the first report of an efficient and reliable method for the separation of residual thrombin from purified recombinant proteins.

Protein Structures by Spallation Neutron Crystallography

The Protein Crystallography Station at Los Alamos Neutron Science Center is a high-performance beamline that forms the core of a capability for neutron macromolecular structure and function determination. This capability also includes the Macromolecular Neutron Crystallography (MNC) consortium between Los Alamos (LANL) and Lawrence Berkeley National Laboratories for developing computational tools for neutron protein crystallography, a biological deuteration laboratory, the National Stable Isotope Production Facility, and an MNC drug design consortium between LANL and Case Western Reserve University.

The Protein Structure Initiative Structural Genomics Knowledgebase

The Protein Structure Initiative Structural Genomics Knowledgebase (PSI SGKB, http://kb.psi-structuralgenomics.org) has been created to turn the products of the PSI structural genomics effort into knowledge that can be used by the biological research community to understand living systems and disease. This resource provides central access to structures in the Protein Data Bank (PDB), along with functional annotations, associated homology models, worldwide protein target tracking information, available protocols and the potential to obtain DNA materials for many of the targets. It also offers the ability to search all of the structural and methodological publications and the innovative technologies that were catalyzed by the PSI's high-throughput research efforts. In collaboration with the Nature Publishing Group, the PSI SGKB provides a research library, editorials about new research advances, news and an events calendar to present a broader view of structural biology and structural genomics. By making these resources freely available, the PSI SGKB serves as a bridge to connect the structural biology and the greater biomedical communities.

Crystallographic Model Quality at a Glance

A crystallographic macromolecular model is typically characterized by a list of quality criteria, such as R factors, deviations from ideal stereochemistry and average B factors, which are usually provided as tables in publications or in structural databases. In order to facilitate a quick model-quality evaluation, a graphical representation is proposed. Each key parameter such as R factor or bond-length deviation from ;ideal values' is shown graphically as a point on a ;ruler'. These rulers are plotted as a set of lines with the same origin, forming a hub and spokes. Different parts of the rulers are coloured differently to reflect the frequency (red for a low frequency, blue for a high frequency) with which the corresponding values are observed in a reference set of structures determined previously. The points for a given model marked on these lines are connected to form a polygon. A polygon that is strongly compressed or dilated along some axes reveals unusually low or high values of the corresponding characteristics. Polygon vertices in ;red zones' indicate parameters which lie outside typical values.

Generalized X-ray and Neutron Crystallographic Analysis: More Accurate and Complete Structures for Biological Macromolecules

X-ray and neutron crystallographic techniques provide complementary information on the structure and function of biological macromolecules. X-ray and neutron (XN) crystallographic data have been combined in a joint structure-refinement procedure that has been developed using recent advances in modern computational methodologies, including cross-validated maximum-likelihood target functions with gradient-based optimization and simulated annealing. The XN approach for complete (including hydrogen) macromolecular structure analysis provides more accurate and complete structures, as demonstrated for diisopropyl fluorophosphatase, photoactive yellow protein and human aldose reductase. Furthermore, this method has several practical advantages, including the easier determination of the orientation of water molecules, hydroxyl groups and some amino-acid side chains.

Decision-making in Structure Solution Using Bayesian Estimates of Map Quality: the PHENIX AutoSol Wizard

Estimates of the quality of experimental maps are important in many stages of structure determination of macromolecules. Map quality is defined here as the correlation between a map and the corresponding map obtained using phases from the final refined model. Here, ten different measures of experimental map quality were examined using a set of 1359 maps calculated by re-analysis of 246 solved MAD, SAD and MIR data sets. A simple Bayesian approach to estimation of map quality from one or more measures is presented. It was found that a Bayesian estimator based on the skewness of the density values in an electron-density map is the most accurate of the ten individual Bayesian estimators of map quality examined, with a correlation between estimated and actual map quality of 0.90. A combination of the skewness of electron density with the local correlation of r.m.s. density gives a further improvement in estimating map quality, with an overall correlation coefficient of 0.92. The PHENIX AutoSol wizard carries out automated structure solution based on any combination of SAD, MAD, SIR or MIR data sets. The wizard is based on tools from the PHENIX package and uses the Bayesian estimates of map quality described here to choose the highest quality solutions after experimental phasing.

Structure of Endoglucanase Cel9A from the Thermoacidophilic Alicyclobacillus Acidocaldarius

The production of biofuels using biomass is an alternative route to support the growing global demand for energy and to also reduce the environmental problems caused by the burning of fossil fuels. Cellulases are likely to play an important role in the degradation of biomass and the production of sugars for subsequent fermentation to fuel. Here, the crystal structure of an endoglucanase, Cel9A, from Alicyclobacillus acidocaldarius (Aa_Cel9A) is reported which displays a modular architecture composed of an N-terminal Ig-like domain connected to the catalytic domain. This paper describes the overall structure and the detailed contacts between the two modules. Analysis suggests that the interaction involving the residues Gln13 (from the Ig-like module) and Phe439 (from the catalytic module) is important in maintaining the correct conformation of the catalytic module required for protein activity. Moreover, the Aa_Cel9A structure shows three metal-binding sites that are associated with the thermostability and/or substrate affinity of the enzyme.

Automatic Multiple-zone Rigid-body Refinement with a Large Convergence Radius

Rigid-body refinement is the constrained coordinate refinement of one or more groups of atoms that each move (rotate and translate) as a single body. The goal of this work was to establish an automatic procedure for rigid-body refinement which implements a practical compromise between runtime requirements and convergence radius. This has been achieved by analysis of a large number of trial refinements for 12 classes of random rigid-body displacements (that differ in magnitude of introduced errors), using both least-squares and maximum-likelihood target functions. The results of these tests led to a multiple-zone protocol. The final parameterization of this protocol was optimized empirically on the basis of a second large set of test refinements. This multiple-zone protocol is implemented as part of the phenix.refine program.

Averaged Kick Maps: Less Noise, More Signal... and Probably Less Bias

Use of reliable density maps is crucial for rapid and successful crystal structure determination. Here, the averaged kick (AK) map approach is investigated, its application is generalized and it is compared with other map-calculation methods. AK maps are the sum of a series of kick maps, where each kick map is calculated from atomic coordinates modified by random shifts. As such, they are a numerical analogue of maximum-likelihood maps. AK maps can be unweighted or maximum-likelihood (sigma(A)) weighted. Analysis shows that they are comparable and correspond better to the final model than sigma(A) and simulated-annealing maps. The AK maps were challenged by a difficult structure-validation case, in which they were able to clarify the problematic region in the density without the need for model rebuilding. The conclusion is that AK maps can be useful throughout the entire progress of crystal structure determination, offering the possibility of improved map interpretation.

Recent Developments in Phasing and Structure Refinement for Macromolecular Crystallography

Central to crystallographic structure solution is obtaining accurate phases in order to build a molecular model, ultimately followed by refinement of that model to optimize its fit to the experimental diffraction data and prior chemical knowledge. Recent advances in phasing and model refinement and validation algorithms make it possible to arrive at better electron density maps and more accurate models.

Electronic Ligand Builder and Optimization Workbench (eLBOW): a Tool for Ligand Coordinate and Restraint Generation

The electronic Ligand Builder and Optimization Workbench (eLBOW) is a program module of the PHENIX suite of computational crystallographic software. It is designed to be a flexible procedure that uses simple and fast quantum-chemical techniques to provide chemically accurate information for novel and known ligands alike. A variety of input formats and options allow the attainment of a number of diverse goals including geometry optimization and generation of restraints.

A Rapid and Inexpensive Labeling Method for Microarray Gene Expression Analysis

Global gene expression profiling by DNA microarrays is an invaluable tool in biological research. However, existing labeling methods are time consuming and costly and therefore often limit the scale of microarray experiments and sample throughput. Here we introduce a new, fast, inexpensive method for direct random-primed fluorescent labeling of eukaryotic cDNA for gene expression analysis and compare the results obtained on the NimbleGen microarray platform with two other widely-used labeling methods, namely the NimbleGen-recommended double-stranded cDNA protocol and the indirect (aminoallyl) method.

On the Use of Logarithmic Scales for Analysis of Diffraction Data

Predictions of the possible model parameterization and of the values of model characteristics such as R factors are important for macromolecular refinement and validation protocols. One of the key parameters defining these and other values is the resolution of the experimentally measured diffraction data. The higher the resolution, the larger the number of diffraction data N(ref), the larger its ratio to the number N(at) of non-H atoms, the more parameters per atom can be used for modelling and the more precise and detailed a model can be obtained. The ratio N(ref)/N(at) was calculated for models deposited in the Protein Data Bank as a function of the resolution at which the structures were reported. The most frequent values for this distribution depend essentially linearly on resolution when the latter is expressed on a uniform logarithmic scale. This defines simple analytic formulae for the typical Matthews coefficient and for the typically allowed number of parameters per atom for crystals diffracting to a given resolution. This simple dependence makes it possible in many cases to estimate the expected resolution of the experimental data for a crystal with a given Matthews coefficient. When expressed using the same logarithmic scale, the most frequent values for R and R(free) factors and for their difference are also essentially linear across a large resolution range. The minimal R-factor values are practically constant at resolutions better than 3 A, below which they begin to grow sharply. This simple dependence on the resolution allows the prediction of expected R-factor values for unknown structures and may be used to guide model refinement and validation.

PHENIX: a Comprehensive Python-based System for Macromolecular Structure Solution

Macromolecular X-ray crystallography is routinely applied to understand biological processes at a molecular level. However, significant time and effort are still required to solve and complete many of these structures because of the need for manual interpretation of complex numerical data using many software packages and the repeated use of interactive three-dimensional graphics. PHENIX has been developed to provide a comprehensive system for macromolecular crystallographic structure solution with an emphasis on the automation of all procedures. This has relied on the development of algorithms that minimize or eliminate subjective input, the development of algorithms that automate procedures that are traditionally performed by hand and, finally, the development of a framework that allows a tight integration between the algorithms.

Crystal Structure of the Transcriptional Activator HlyU from Vibrio Vulnificus CMCP6

HlyU is a transcription factor of the ArsR/SmtB family and activates the expression of the pathogenic Vibrio vulnificus RTX toxin. In contrast to the other metal-responding ArsR/SmtB proteins, HlyU does not sense metal ions. To provide its structural information, we elucidated the crystal structure of HlyU from V. vulnificus CMCP6 (HlyU_Vv). The monomeric HlyU_Vv architecture of five alpha-helices and two beta-strands, some of which constitute a typical DNA-binding winged helix-turn-helix (wHTH) motif, is very similar to that of other transcription regulators. Nonetheless, the homo-dimeric HlyU_Vv structure shows several different, three-dimensional features in the spatial position and the detailed dimeric interaction, which were not observed in the modeling study based on the same protein family and sequence similarity.

Raman Imaging of Cell Wall Polymers in Arabidopsis Thaliana

We present chemical images of Arabidopsis thaliana stem cross-sections acquired by confocal Raman microscopy. Using green light (532 nm) from a continuous wave laser, the spatial distributions of cell wall polymers in Arabidopsis are visualized for the first time with lateral resolution that is sub-mum. Our results facilitate the label-free in situ characterization and screening of cell wall composition in this plant biology and genetics model organism, contributing ultimately towards an understanding of the molecular biology of many plant traits.

Crystal Structures of a Group II Chaperonin Reveal the Open and Closed States Associated with the Protein Folding Cycle

Chaperonins are large protein complexes consisting of two stacked multisubunit rings, which open and close in an ATP-dependent manner to create a protected environment for protein folding. Here, we describe the first crystal structure of a group II chaperonin in an open conformation. We have obtained structures of the archaeal chaperonin from Methanococcus maripaludis in both a peptide acceptor (open) state and a protein folding (closed) state. In contrast with group I chaperonins, in which the equatorial domains share a similar conformation between the open and closed states and the largest motions occurs at the intermediate and apical domains, the three domains of the archaeal chaperonin subunit reorient as a single rigid body. The large rotation observed from the open state to the closed state results in a 65% decrease of the folding chamber volume and creates a highly hydrophilic surface inside the cage. These results suggest a completely distinct closing mechanism in the group II chaperonins as compared with the group I chaperonins.

Biochemical Characterization and Crystal Structure of Endoglucanase Cel5A from the Hyperthermophilic Thermotoga Maritima

Tm_Cel5A, which belongs to family 5 of the glycoside hydrolases, is an extremely stable enzyme among the endo-acting glycosidases present in the hyperthermophilic organism Thermotoga maritima. Members of GH5 family shows a common (β/α)(8) TIM-barrel fold in which the catalytic acid/base and nucleophile are located on strands β-4 and β-7 of the barrel fold. Thermally resistant cellulases are desirable for lignocellulosic biofuels production and the Tm_Cel5A is an excellent candidate for use in the degradation of polysaccharides present on biomass. This paper describes two Tm_Cel5A structures (crystal forms I and II) solved at 2.20 and 1.85Å resolution, respectively. Our analyses of the Tm_Cel5A structure and comparison to a mesophilic GH5 provides a basis for the thermostability associated with Tm_Cel5A. Furthermore, both crystal forms of Tm_Cel5A possess a cadmium (Cd(2+)) ion bound between the two catalytic residues. Activity assays of Tm_Cel5A confirmed a strong inhibition effect in the presence of Cd(2+) metal ions demonstrating competition with the natural substrate for the active site. Based on the structural information we have obtained for Tm_Cel5A, protein bioengineering can be used to potentially increase the thermostability of mesophilic cellulase enzymes.

Molecular Simulations Provide New Insights into the Role of the Accessory Immunoglobulin-like Domain of Cel9A

Cel9A from the thermoacidophilic bacterium Alicyclobacillus acidocaldarius belongs to the subfamily E1 of family 9 glycoside hydrolases, many members of which have an N-terminal Ig-like domain followed by the catalytic domain. The Ig-like domain is not directly involved in either carbohydrate binding or biocatalysis; however, deletion of the Ig-domain promotes loss of enzymatic activity. We have investigated the functional role of the Ig-like domain using molecular dynamics simulations. Our simulations indicate that residues within the Ig-like domain are dynamically correlated with residues in the carbohydrate-binding pocket and with key catalytic residues of Cel9A. Free energy perturbation simulations indicate that the Ig-like domain stabilizes the catalytic domain and may be responsible for the enhanced thermostability of Cel9A.

Phenix.model_vs_data: a High-level Tool for the Calculation of Crystallographic Model and Data Statistics

phenix.model_vs_data is a high-level command-line tool for the computation of crystallographic model and data statistics, and the evaluation of the fit of the model to data. Analysis of all Protein Data Bank structures that have experimental data available shows that in most cases the reported statistics, in particular R factors, can be reproduced within a few percentage points. However, there are a number of outliers where the recomputed R values are significantly different from those originally reported. The reasons for these discrepancies are discussed.

A Microscale Platform for Integrated Cell-free Expression and Activity Screening of Cellulases

Recent advances in production of cellulases by genetic engineering and isolation from natural microbial communities have necessitated the development of high-throughput analytical technologies for cellulase expression and screening. We have developed a novel cost-effective microscale approach based on in vitro protein synthesis, which seamlessly integrates cellulase expression with activity screening without the need for any protein purification procedures. Our platform achieves the entire process of transcription, translation, and activity screening within 2-3 hours in microwell arrays compared with days needed for conventional cell-based cellulase expression, purification, and activity screening. Highly sensitive fluorescence-based detection permits activity screening in volumes as low as 2-3 μL with minimal evaporation (even at temperatures as high as 95 °C) leading to two orders of magnitude reduction in reagent usage and cost. The platform was used for rapid expression and screening of β-glucosidases (BGs) and cellobiohydrolases (CBHs) isolated from thermophilic microorganisms. Furthermore, it was also used to determine optimum temperatures for BG and CBH activities and to study product inhibition of CBHs. The approach described here is well suited for first-pass screening of large libraries to identify cellulases with desired properties that can subsequently be produced on a large scale for detailed structural and functional characterization.

Evidence of Functional Protein Dynamics from X-ray Crystallographic Ensembles

It is widely recognized that representing a protein as a single static conformation is inadequate to describe the dynamics essential to the performance of its biological function. We contrast the amino acid displacements below and above the protein dynamical transition temperature, T(D)∼215K, of hen egg white lysozyme using X-ray crystallography ensembles that are analyzed by molecular dynamics simulations as a function of temperature. We show that measuring structural variations across an ensemble of X-ray derived models captures the activation of conformational states that are of functional importance just above T(D), and they remain virtually identical to structural motions measured at 300K. Our results highlight the ability to observe functional structural variations across an ensemble of X-ray crystallographic data, and that residue fluctuations measured in MD simulations at room temperature are in quantitative agreement with the experimental observable.

Coupling of Receptor Conformation and Ligand Orientation Determine Graded Activity

Small molecules stabilize specific protein conformations from a larger ensemble, enabling molecular switches that control diverse cellular functions. We show here that the converse also holds true: the conformational state of the estrogen receptor can direct distinct orientations of the bound ligand. 'Gain-of-allostery' mutations that mimic the effects of ligand in driving protein conformation allowed crystallization of the partial agonist ligand WAY-169916 with both the canonical active and inactive conformations of the estrogen receptor. The intermediate transcriptional activity induced by WAY-169916 is associated with the ligand binding differently to the active and inactive conformations of the receptor. Analyses of a series of chemical derivatives demonstrated that altering the ensemble of ligand binding orientations changes signaling output. The coupling of different ligand binding orientations to distinct active and inactive protein conformations defines a new mechanism for titrating allosteric signaling activity.

Microfluidic Glycosyl Hydrolase Screening for Biomass-to-biofuel Conversion

The hydrolysis of biomass to fermentable sugars using glycosyl hydrolases such as cellulases and hemicellulases is a limiting and costly step in the conversion of biomass to biofuels. Enhancement in hydrolysis efficiency is necessary and requires improvement in both enzymes and processing strategies. Advances in both areas in turn strongly depend on the progress in developing high-throughput assays to rapidly and quantitatively screen a large number of enzymes and processing conditions. For example, the characterization of various cellodextrins and xylooligomers produced during the time course of saccharification is important in the design of suitable reactors, enzyme cocktail compositions, and biomass pretreatment schemes. We have developed a microfluidic-chip-based assay for rapid and precise characterization of glycans and xylans resulting from biomass hydrolysis. The technique enables multiplexed separation of soluble cellodextrins and xylose oligomers in around 1 min (10-fold faster than HPLC). The microfluidic device was used to elucidate the mode of action of Tm_Cel5A, a novel cellulase from hyperthermophile Thermotoga maritima . The results demonstrate that the cellulase is active at 80 °C and effectively hydrolyzes cellodextrins and ionic-liquid-pretreated switchgrass and Avicel to glucose, cellobiose, and cellotriose. The proposed microscale approach is ideal for quantitative large-scale screening of enzyme libraries for biomass hydrolysis, for development of energy feedstocks, and for polysaccharide sequencing.

Joint X-ray and Neutron Refinement with Phenix.refine

Approximately 85% of the structures deposited in the Protein Data Bank have been solved using X-ray crystallography, making it the leading method for three-dimensional structure determination of macromolecules. One of the limitations of the method is that the typical data quality (resolution) does not allow the direct determination of H-atom positions. Most hydrogen positions can be inferred from the positions of other atoms and therefore can be readily included into the structure model as a priori knowledge. However, this may not be the case in biologically active sites of macromolecules, where the presence and position of hydrogen is crucial to the enzymatic mechanism. This makes the application of neutron crystallography in biology particularly important, as H atoms can be clearly located in experimental neutron scattering density maps. Without exception, when a neutron structure is determined the corresponding X-ray structure is also known, making it possible to derive the complete structure using both data sets. Here, the implementation of crystallographic structure-refinement procedures that include both X-ray and neutron data (separate or jointly) in the PHENIX system is described.

Targeted Proteomics for Metabolic Pathway Optimization: Application to Terpene Production

Successful metabolic engineering relies on methodologies that aid assembly and optimization of novel pathways in microbes. Many different factors may contribute to pathway performance, and problems due to mRNA abundance, protein abundance, or enzymatic activity may not be evident by monitoring product titers. To this end, synthetic biologists and metabolic engineers utilize a variety of analytical methods to identify the parts of the pathway that limit production. In this study, targeted proteomics, via selected-reaction monitoring (SRM) mass spectrometry, was used to measure protein levels in Escherichia coli strains engineered to produce the sesquiterpene, amorpha-4,11-diene. From this analysis, two mevalonate pathway proteins, mevalonate kinase (MK) and phosphomevalonate kinase (PMK) from Saccharomyces cerevisiae, were identified as potential bottlenecks. Codon-optimization of the genes encoding MK and PMK and expression from a stronger promoter led to significantly improved MK and PMK protein levels and over three-fold improved final amorpha-4,11-diene titer (>500 mg/L).

The Structural Biology Knowledgebase: a Portal to Protein Structures, Sequences, Functions, and Methods

The Protein Structure Initiative's Structural Biology Knowledgebase (SBKB, URL: http://sbkb.org ) is an open web resource designed to turn the products of the structural genomics and structural biology efforts into knowledge that can be used by the biological community to understand living systems and disease. Here we will present examples on how to use the SBKB to enable biological research. For example, a protein sequence or Protein Data Bank (PDB) structure ID search will provide a list of related protein structures in the PDB, associated biological descriptions (annotations), homology models, structural genomics protein target status, experimental protocols, and the ability to order available DNA clones from the PSI:Biology-Materials Repository. A text search will find publication and technology reports resulting from the PSI's high-throughput research efforts. Web tools that aid in research, including a system that accepts protein structure requests from the community, will also be described. Created in collaboration with the Nature Publishing Group, the Structural Biology Knowledgebase monthly update also provides a research library, editorials about new research advances, news, and an events calendar to present a broader view of structural genomics and structural biology.

Exact Direct-space Asymmetric Units for the 230 Crystallographic Space Groups

It is well known that the direct-space asymmetric unit definitions found in the International Tables for Crystallography, Volume A, are inexact at the borders. Face- and edge-specific sub-conditions have to be added to remove parts redundant under symmetry. This paper introduces a concise geometric notation for asymmetric unit conditions. The notation is the foundation for a reference table of exact direct-space asymmetric unit definitions for the 230 crystallographic space-group types. The change-of-basis transformation law for the conditions is derived, which allows the information from the reference table to be used for any space-group setting. We also show how the vertices of an asymmetric unit can easily be computed from the information in the reference table.

A Switch I Mutant of Cdc42 Exhibits Less Conformational Freedom

Cdc42 is a Ras-related small G-protein and functions as a molecular switch in signal transduction pathways linked with cell growth and differentiation. It is controlled by cycling between GTP-bound (active) and GDP-bound (inactive) forms. Nucleotide binding and hydrolysis are modulated by interactions with effectors and/or regulatory proteins. These interactions are centralized in two relatively flexible "Switch" regions as characterized by internal dynamics on multiple time scales [Loh, A. P., et al. (2001) Biochemistry 40, 4590-4600], and this flexibility may be essential for protein interactions. In the Switch I region, Thr(35) seems to be critical for function, as it is completely invariant in Ras-related proteins. To investigate the importance of conformational flexibility in Switch I of Cdc42, we mutated threonine to alanine, determined the solution structure, and characterized the backbone dynamics of the single-point mutant protein, Cdc42(T35A). Backbone dynamics data suggest that the mutation changes the time scale of the internal motions of several residues, with several resonances not being discernible in wild-type Cdc42 [Adams, P. D., and Oswald, R. E. (2007) Biomol. NMR Assignments 1, 225-227]. The mutation does not appear to affect the thermal stability of Cdc42, and chymotrypsin digestion data further suggest that changes in the conformational flexibility of Switch I slow proteolytic cleavage relative to that of the wild type. In vitro binding assays show less binding of Cdc42(T35A), relative to that of wild type, to a GTPase binding protein that inhibits GTP hydrolysis in Cdc42. These results suggest that the mutation of T(35) leads to the loss of conformational freedom in Switch I that could affect effector-regulatory protein interactions.

The Phenix Software for Automated Determination of Macromolecular Structures

X-ray crystallography is a critical tool in the study of biological systems. It is able to provide information that has been a prerequisite to understanding the fundamentals of life. It is also a method that is central to the development of new therapeutics for human disease. Significant time and effort are required to determine and optimize many macromolecular structures because of the need for manual interpretation of complex numerical data, often using many different software packages, and the repeated use of interactive three-dimensional graphics. The Phenix software package has been developed to provide a comprehensive system for macromolecular crystallographic structure solution with an emphasis on automation. This has required the development of new algorithms that minimize or eliminate subjective input in favor of built-in expert-systems knowledge, the automation of procedures that are traditionally performed by hand, and the development of a computational framework that allows a tight integration between the algorithms. The application of automated methods is particularly appropriate in the field of structural proteomics, where high throughput is desired. Features in Phenix for the automation of experimental phasing with subsequent model building, molecular replacement, structure refinement and validation are described and examples given of running Phenix from both the command line and graphical user interface.

Blind Image Analysis for the Compositional and Structural Characterization of Plant Cell Walls

A new image analysis strategy is introduced to determine the composition and the structural characteristics of plant cell walls by combining Raman microspectroscopy and unsupervised data mining methods. The proposed method consists of three main steps: spectral preprocessing, spatial clustering of the image and finally estimation of spectral profiles of pure components and their weights. Point spectra of Raman maps of cell walls were preprocessed to remove noise and fluorescence contributions and compressed with PCA. Processed spectra were then subjected to k-means clustering to identify spatial segregations in the images. Cell wall images were reconstructed with cluster identities and each cluster was represented by the average spectrum of all the pixels in the cluster. Pure components spectra were estimated by spectral entropy minimization criteria with simulated annealing optimization. Two pure spectral estimates that represent lignin and carbohydrates were recovered and their spatial distributions were calculated. Our approach partitioned the cell walls into many sublayers, based on their composition, thus enabling composition analysis at subcellular levels. It also overcame the well known problem that native lignin spectra in lignocellulosics have high spectral overlap with contributions from cellulose and hemicelluloses, thus opening up new avenues for microanalyses of monolignol composition of native lignin and carbohydrates without chemical or mechanical extraction of the cell wall materials.

A New Generation of Crystallographic Validation Tools for the Protein Data Bank

This report presents the conclusions of the X-ray Validation Task Force of the worldwide Protein Data Bank (PDB). The PDB has expanded massively since current criteria for validation of deposited structures were adopted, allowing a much more sophisticated understanding of all the components of macromolecular crystals. The size of the PDB creates new opportunities to validate structures by comparison with the existing database, and the now-mandatory deposition of structure factors creates new opportunities to validate the underlying diffraction data. These developments highlighted the need for a new assessment of validation criteria. The Task Force recommends that a small set of validation data be presented in an easily understood format, relative to both the full PDB and the applicable resolution class, with greater detail available to interested users. Most importantly, we recommend that referees and editors judging the quality of structural experiments have access to a concise summary of well-established quality indicators.

High-throughput Enzymatic Hydrolysis of Lignocellulosic Biomass Via In-situ Regeneration

The high cost of lignocellulolytic enzymes is one of the main barriers towards the development of economically competitive biorefineries. Enzyme engineering can be used to significantly increase the production rate as well as specific activity of enzymes. However, the success of enzyme optimization efforts is currently limited by a lack of robust high-throughput (HTP) cellulase screening platforms for insoluble pretreated lignocellulosic substrates. We have developed a cost-effective microplate based HTP enzyme-screening platform for ionic liquid (IL) pretreated lignocellulose. By performing in-situ biomass regeneration in micro-volumes, we can volumetrically meter biomass (sub-mg loading) and also precisely control the amount of residual IL for engineering novel IL-tolerant cellulases. Our platform only requires straightforward liquid-handling steps and allows the integration of biomass regeneration, washing, saccharification, and imaging steps in a single microtiter plate. The proposed method can be used to screen individual cellulases as well as to develop novel cellulase cocktails.

Structure of a Three-domain Sesquiterpene Synthase: a Prospective Target for Advanced Biofuels Production

The sesquiterpene bisabolene was recently identified as a biosynthetic precursor to bisabolane, an advanced biofuel with physicochemical properties similar to those of D2 diesel. High-titer microbial bisabolene production was achieved using Abies grandis α-bisabolene synthase (AgBIS). Here, we report the structure of AgBIS, a three-domain plant sesquiterpene synthase, crystallized in its apo form and bound to five different inhibitors. Structural and biochemical characterization of the AgBIS terpene synthase Class I active site leads us to propose a catalytic mechanism for the cyclization of farnesyl diphosphate into bisabolene via a bisabolyl cation intermediate. Further, we describe the nonfunctional AgBIS Class II active site whose high similarity to bifunctional diterpene synthases makes it an important link in understanding terpene synthase evolution. Practically, the AgBIS crystal structure is important in future protein engineering efforts to increase the microbial production of bisabolene.

Iotbx.cif: a Comprehensive CIF Toolbox

iotbx.cif is a new software module for the development of applications that make use of the CIF format. Comprehensive tools are provided for input, output and validation of CIFs, as well as for interconversion with high-level cctbx [Grosse-Kunstleve, Sauter, Moriarty & Adams (2002). J. Appl. Cryst.35, 126-136] crystallographic objects. The interface to the library is written in Python, whilst parsing is carried out using a compiled parser, combining the performance of a compiled language (C++) with the benefits of using an interpreted language.

Supplementation of Intracellular XylR Leads to Co-utilization of Hemicellulose Sugars

E. coli has the potential to be a powerful biocatalyst for the conversion of lignocellulosic biomass into useful materials such as biofuels and polymers. One important challenge in using E. coli for the transformation of biomass sugars is diauxie or sequential utilization of different types of sugars. We demonstrate that by increasing the intracellular levels of the transcription factor XylR, the preferential consumption of arabinose before xylose can be eliminated. In addition, XylR augmentation must be finely tuned for robust co-utilization of these two hemicellulosic sugars. Using a novel technique for scarless gene insertion, an additional copy of xylR was inserted into the araBAD operon. The resulting strain was superior at co-metabolizing mixtures of arabinose and xylose and was able to produce at least 36% more ethanol compared to wild type strains. This strain will be a useful starting point for the development of an E. coli biocatalyst that can simultaneously convert all biomass sugars.

Encoding Substrates with Mass Tags to Resolve Stereospecific Reactions Using Nimzyme

The nanostructure-initiator mass spectrometry based enzyme assay (Nimzyme) provides a rapid method for screening glycan modifying reactions. However, this approach cannot resolve stereospecific reactions which are common in glycobiology and are typically assayed using lower-throughput methods (gas chromatography/mass spectrometry (GC/MS) or liquid chromatography/tandem mass spectrometry (LC/MS/MS) analysis) often in conjunction with stable isotopically labeled reactants. However, in many applications, library size necessitates the development of higher-throughput screening approaches of stereospecific reactions from crude sample preparations. Therefore, here we test the approach of utilizing Nimzyme linkers with unique masses to encode substrate identity such that this assay can resolve stereospecific reactions.

Mechanism of Nucleotide Sensing in Group II Chaperonins

Group II chaperonins mediate protein folding in an ATP-dependent manner in eukaryotes and archaea. The binding of ATP and subsequent hydrolysis promotes the closure of the multi-subunit rings where protein folding occurs. The mechanism by which local changes in the nucleotide-binding site are communicated between individual subunits is unknown. The crystal structure of the archaeal chaperonin from Methanococcus maripaludis in several nucleotides bound states reveals the local conformational changes associated with ATP hydrolysis. Residue Lys-161, which is extremely conserved among group II chaperonins, forms interactions with the γ-phosphate of ATP but shows a different orientation in the presence of ADP. The loss of the ATP γ-phosphate interaction with Lys-161 in the ADP state promotes a significant rearrangement of a loop consisting of residues 160-169. We propose that Lys-161 functions as an ATP sensor and that 160-169 constitutes a nucleotide-sensing loop (NSL) that monitors the presence of the γ-phosphate. Functional analysis using NSL mutants shows a significant decrease in ATPase activity, suggesting that the NSL is involved in timing of the protein folding cycle.

Raman-spectroscopy-based Noninvasive Microanalysis of Native Lignin Structure

A new robust, noninvasive, Raman microspectroscopic method is introduced to analyze the structure of native lignin. Lignin spectra of poplar, Arabidopsis, and Miscanthus were recovered and structural differences were unambiguously detected. Compositional analysis of 4-coumarate-CoA ligase suppressed transgenic poplar showed that the syringyl-to-guaiacyl ratio decreased by 35% upon the mutation. A cell-specific compositional analysis of basal stems of Arabidopsis showed similar distributions of S and G monolignols in xylary fiber cells and interfascicular cells.

Waiting
simple hit counter