$$\rightleftharpoonup{xx}$$
$$\longleftharp{xx}$$,
$$\longrightharp{xx}$$,
Macromolecular X-ray crystallography (MX) is, by far, the most used method for gaining atomic resolution insight into the three-dimensional structures of biological macromolecules. However, a major bottle necks is the requirement for relatively large, well diffracting crystals.
Often, and particularly when crystallizing membrane proteins, only very small crystals of a few microns in the largest dimension can be obtained. Radiation damage effects limit the resolution of a complete diffraction data set that can be collected from a single micro crystal2, and very often, it is necessary to improve the signal to noise ratio and hence data set resolution, by merging several partial diffraction data sets from different, but isomorphic crystals. The increases in flux density of X-ray beams at synchrotron sources and elsewhere (e.g. X-ray free-electron lasers (X-FELs)), have meant that useful partial diffraction data sets can be collected from even very small crystals of biological macromolecules. This, in turn, has led to the development of new techniques for the collection and merging of partial diffraction data sets collected from many different crystals in order to produce a complete data set for structure solution. Such techniques are commonly referred to as serial crystallography (SX)3,4,5,6,7,8. A prototypical example of SX is the use of injector devices to introduce a narrow stream of a crystal slurry into the X-ray beam3,4,5. A diffraction pattern is recorded every time a crystal is exposed to X-rays leading to the collection, from many thousands of individual crystals, of 'still' diffraction images, information which is then merged to produce a complete data set. However, a considerable disadvantage of this type of serial data collection is that the processing of still images can be problematic. The data quality is considerably improved if crystals can be rotated and/or several diffraction images are collected from the same crystal during serial crystallography experiments6.
MeshAndCollect1 was developed with the aim of combining SX with 'standard' MX rotation data collection and allows, in an automatic fashion, experimenters to collect partial diffraction data sets from numerous crystals of the same macromolecular target mounted on the same or different sample holders. A complete diffraction data set is then obtained by merging the most isomorphous of the partial data sets collected. MeshAndCollect is compatible with any state-of-the-art synchrotron X-ray beamline for MX (ideally an insertion device facility with a relatively small (20 µm or less) beam size at the sample position). In addition to the compilation of complete data sets from a series of small, well-diffracting crystals, the method is also very suitable for the initial experimental assessment of the diffraction quality of micro-crystals and for the processing of opaque samples, e.g., in meso grown microcrystals of membrane proteins9.
At the start of a MeshAndCollect experiment, the positions, in two dimensions, of each of the many crystal contained in a single sample holder are determined using a low dose X-ray scan. The diffraction images collected during this scan are automatically analyzed by the program DOZOR1, which sorts the positions of the crystals on the sample holder according to their respective diffraction strength. Positions for the collection of partial data sets are assigned automatically based on a diffraction strength cut-off and, in the last step, small wedges of diffraction data, typically ±5° of rotation, are collected from each chosen position. Experience has shown that this rotation range provides a sufficient amount of reflections per crystal for partial data set scaling purposes, while at the same time, reducing possible crystal centering issues and the chance of exposing multiple crystals in a particularly crowded support1. The individual diffraction data wedges (partial data sets) are then processed either manually or using automated data processing pipelines10,11,12,13. For downstream structure determination it is then necessary to find the best combination of partial data sets to be merged14,15,16 after which the resulting complete data set can be treated in the same way as one originating from a single crystal experiment.
As an example of MeshAndCollect in practice, we present here the solution of the crystal structure of the Cyan Fluorescent Protein (CFP) Cerulean, using a diffraction data set constructed from the combination of partial data sets collected from a series of microcrystals mounted on the same sample support. Cerulean has been engineered from the Green Fluorescent Protein (GFP) from the jellyfish Aequorea victoria17, whose fluorescent chromophore is autocatalytically formed from the cyclisation of three consecutive amino acid residues. Cerulean is obtained from GFP by mutating the first and second residues of the chromophore, a serine and a tyrosine, to threonine (S65T) and tryptophan (Y66W) respectively and adapting the chromophore environment with further mutations (Y145A, N146I, H148D, M153T and V163A) to produce a significant, yet suboptimal fluorescence level of QY = 0.4918,19,20. The suboptimal fluorescent properties of Cerulean have been proposed to be linked to complex protein dynamics involving the imperfect stabilization of one of the eleven β-strands of the protein21 and to the accommodation of two different chromophore isomers depending on the pH and irradiation conditions22. We chose to work with Cerulean as a model protein illustrating the use of the MeshAndCollect protocol due to the relatively ease of tuning crystal size depending on the crystallization. The structure of Cerulean is very similar to that of its parent protein GFP, as it is constituted of a β-barrel formed of eleven β-strands surrounding an α-helix, which bears the chromophore.