RESUMO
Cryo-EM images have extremely low signal-to-noise levels because biological macromolecules are highly radiation-sensitive, requiring low-dose imaging, and because the molecules are poor in contrast. Confident recovery of the signal requires the averaging of many images, the iterative optimization of parameters and the introduction of much prior information. Poor parameter estimates, overfitting and variations in signal strength and resolution across the resulting reconstructions remain frequent issues. Because biological samples are real-space phenomena, exhibiting local variations, real-space measures can be both more reliable and more appropriate than Fourier-space measures. Real-space measures can be calculated separately over each differing region of an image or volume. Real-space filters can be applied according to the local need. Powerful prior information, not available in Fourier space, can be introduced in real space. Priors can be applied in real space in ways that Fourier space precludes. The treatment of biological phenomena remains highly dependent on spatial frequency, however, which would normally be handled in Fourier space. We believe that measures and filters based around real-space operations on extracted frequency bands, i.e. a series of band-pass filtered real-space volumes, and over real-space densities of striding (sequentially increasing or decreasing) resolution through Fourier space are the best way to address this and will perform better than global Fourier-space-based approaches. Future developments in image processing within the field are generally expected to be based on a mixture of both rationally designed and deep-learning approaches, and to incorporate novel prior information from developments such as AlphaFold. Regardless of approach, it is clear that `locality', through real-space measures, filters and processing, will become central to image processing.
Assuntos
Algoritmos , Processamento de Imagem Assistida por Computador , Microscopia Crioeletrônica/métodos , Processamento de Imagem Assistida por Computador/métodosRESUMO
In 2020, cryo-EM single-particle analysis achieved true atomic resolution thanks to technological developments in hardware and software. The number of high-resolution reconstructions continues to grow, increasing the importance of the accurate determination of atomic coordinates. Here, a new Python package and program called Servalcat is presented that is designed to facilitate atomic model refinement. Servalcat implements a refinement pipeline using the program REFMAC5 from the CCP4 package. After the refinement, Servalcat calculates a weighted Fo - Fc difference map, which is derived from Bayesian statistics. This map helps manual and automatic model building in real space, as is common practice in crystallography. The Fo - Fc map helps in the visualization of weak features including hydrogen densities. Although hydrogen densities are weak, they are stronger than in the electron-density maps produced by X-ray crystallography, and some H atoms are even visible at â¼1.8â Å resolution. Servalcat also facilitates atomic model refinement under symmetry constraints. If point-group symmetry has been applied to the map during reconstruction, the asymmetric unit model is refined with the appropriate symmetry constraints.
Assuntos
Algoritmos , Microscopia Crioeletrônica/métodos , Hidrogênio/química , Substâncias Macromoleculares/química , Imagem Individual de Molécula/métodos , Software , Modelos MolecularesRESUMO
This paper describes outcomes of the 2019 Cryo-EM Model Challenge. The goals were to (1) assess the quality of models that can be produced from cryogenic electron microscopy (cryo-EM) maps using current modeling software, (2) evaluate reproducibility of modeling results from different software developers and users and (3) compare performance of current metrics used for model evaluation, particularly Fit-to-Map metrics, with focus on near-atomic resolution. Our findings demonstrate the relatively high accuracy and reproducibility of cryo-EM models derived by 13 participating teams from four benchmark maps, including three forming a resolution series (1.8 to 3.1 Å). The results permit specific recommendations to be made about validating near-atomic cryo-EM structures both in the context of individual experiments and structure data archives such as the Protein Data Bank. We recommend the adoption of multiple scoring parameters to provide full and objective annotation and assessment of the model, reflective of the observed cryo-EM map density.
Assuntos
Microscopia Crioeletrônica/métodos , Modelos Moleculares , Cristalografia por Raios X , Conformação Proteica , Proteínas/químicaRESUMO
Single particle analysis has become a key structural biology technique. Experimental images are extremely noisy, and during iterative refinement it is possible to stably incorporate noise into the reconstruction. Such "over-fitting" can lead to misinterpretation of the structure and flawed biological results. Several strategies are routinely used to prevent over-fitting, the most common being independent refinement of two sides of a split dataset. In this study, we show that over-fitting remains an issue within regions of low local signal-to-noise, despite independent refinement of half datasets. We propose a modification of the refinement process through the application of a local signal-to-noise filter: SIDESPLITTER. We show that our approach can reduce over-fitting for both idealised and experimental data while maintaining independence between the two sides of a split refinement. SIDESPLITTER refinement leads to improved density, and can also lead to improvement of the final resolution in extreme cases where datasets are prone to severe over-fitting, such as small membrane proteins.
Assuntos
Imageamento Tridimensional , Proteínas de Membrana/ultraestrutura , Modelos Moleculares , Imagem Individual de Molécula/métodos , Algoritmos , Proteínas de Membrana/química , Razão Sinal-Ruído , SoftwareRESUMO
Confidence maps provide complementary information for interpreting cryo-EM densities as they indicate statistical significance with respect to background noise. They can be thresholded by specifying the expected false-discovery rate (FDR), and the displayed volume shows the parts of the map that have the corresponding level of significance. Here, the basic statistical concepts of confidence maps are reviewed and practical guidance is provided for their interpretation and usage inside the CCP-EM suite. Limitations of the approach are discussed and extensions towards other error criteria such as the family-wise error rate are presented. The observed map features can be rendered at a common isosurface threshold, which is particularly beneficial for the interpretation of weak and noisy densities. In the current article, a practical guide is provided to the recommended usage of confidence maps.
Assuntos
Microscopia Crioeletrônica/métodos , ATPases Bacterianas Próton-Translocadoras/química , Gráficos por Computador , Modelos Moleculares , Conformação Proteica , Ribossomos/química , Eletricidade Estática , Estatística como Assunto , Vírus do Mosaico do Tabaco , Interface Usuário-ComputadorRESUMO
We present LAFTER, an algorithm for de-noising single particle reconstructions from cryo-EM. Single particle analysis entails the reconstruction of high-resolution volumes from tens of thousands of particle images with low individual signal-to-noise. Imperfections in this process result in substantial variations in the local signal-to-noise ratio within the resulting reconstruction, complicating the interpretation of molecular structure. An effective local de-noising filter could therefore improve interpretability and maximise the amount of useful information obtained from cryo-EM maps. LAFTER is a local de-noising algorithm based on a pair of serial real-space filters. It compares independent half-set reconstructions to identify and retain shared features that have power greater than the noise. It is capable of recovering features across a wide range of signal-to-noise ratios, and we demonstrate recovery of the strongest features at Fourier shell correlation (FSC) values as low as 0.144 over a 2563-voxel cube. A fast and computationally efficient implementation of LAFTER is freely available. We also propose a new way to evaluate the effectiveness of real-space filters for noise suppression, based on the correspondence between two FSC curves: 1) the FSC between the filtered and unfiltered volumes, and 2) Cref, the FSC between the unfiltered volume and a hypothetical noiseless volume, which can readily be estimated from the FSC between two half-set reconstructions.
Assuntos
Algoritmos , Microscopia Crioeletrônica/métodos , Microscopia Eletrônica de Transmissão/métodos , Razão Sinal-RuídoRESUMO
Scripting programming languages provide the fastest means of prototyping complex functionality. Those with a syntax and grammar resembling human language also greatly enhance the maintainability of the produced source code. Furthermore, the combination of a powerful, machine-independent scripting language with binary libraries tailored for each computer architecture allows programs to break free from the tight boundaries of efficiency traditionally associated with scripts. In the present work, we describe how an efficient C++ crystallographic library such as Clipper can be wrapped, adapted and generalized for use in both crystallographic and electron cryo-microscopy applications, scripted with the Python language. We shall also place an emphasis on best practices in automation, illustrating how this can be achieved with this new Python module.
Assuntos
Bases de Dados de Proteínas , Linguagens de Programação , Proteínas/química , Microscopia Crioeletrônica , Conformação ProteicaRESUMO
As part of its remit to provide computational support to the cryo-EM community, the Collaborative Computational Project for Electron cryo-Microscopy (CCP-EM) has produced a software framework which enables easy access to a range of programs and utilities. The resulting software suite incorporates contributions from different collaborators by encapsulating them in Python task wrappers, which are then made accessible via a user-friendly graphical user interface as well as a command-line interface suitable for scripting. The framework includes tools for project and data management. An overview of the design of the framework is given, together with a survey of the functionality at different levels. The current CCP-EM suite has particular strength in the building and refinement of atomic models into cryo-EM reconstructions, which is described in detail.
Assuntos
Microscopia Crioeletrônica/métodos , Software , Design de Software , Interface Usuário-ComputadorRESUMO
The use of slab-like flat specimens for electron cryo-tomography restricts the range of viewing angles that can be used. This leads to the "missing wedge" problem, which causes artefacts and anisotropic resolution in reconstructed tomograms. Cylindrical specimens provide a way to eliminate the problem, since they allow imaging from a full range of viewing angles around the tilt axis. Such specimens have been used before for tomography of radiation-insensitive samples at room temperature, but never for frozen-hydrated specimens. Here, we demonstrate the use of thin-walled carbon tubes as specimen holders, allowing the preparation of cylindrical frozen-hydrated samples of ribosomes, liposomes and whole bacterial cells. Images acquired from these cylinders have equal quality at all viewing angles, and the accessible tilt range is restricted only by the physical limits of the microscope. Tomographic reconstructions of these specimens demonstrate that the effects of the missing wedge are substantially reduced, and could be completely eliminated if a full tilt range was used. The overall quality of these tomograms is still lower than that obtained by existing methods, but improvements are likely in future.
Assuntos
Tomografia com Microscopia Eletrônica/instrumentação , Manejo de Espécimes/instrumentação , Manejo de Espécimes/métodos , Carbono/química , Desenho de Equipamento , Escherichia coli/ultraestrutura , Lipossomos/ultraestrutura , Nanopartículas/química , Ribossomos/ultraestruturaRESUMO
The Pi sampling method is derived from the incomplete factorial approach to macromolecular crystallization screen design. The resulting `Pi screens' have a modular distribution of a given set of up to 36 stock solutions. Maximally diverse conditions can be produced by taking into account the properties of the chemicals used in the formulation and the concentrations of the corresponding solutions. The Pi sampling method has been implemented in a web-based application that generates screen formulations and recipes. It is particularly adapted to screens consisting of 96 different conditions. The flexibility and efficiency of Pi sampling is demonstrated by the crystallization of soluble proteins and of an integral membrane-protein sample.
Assuntos
Cristalização/métodos , Proteínas de Membrana/química , Algoritmos , Animais , Humanos , Receptores Acoplados a Proteínas G/química , SoluçõesRESUMO
The pathogenic bacteria Bordetella parapertussis and Bordetella bronchiseptica express a lipopolysaccharide O antigen containing a polymer of 2,3-diacetamido-2,3-dideoxy-l-galacturonic acid. The O-antigen cluster contains three neighbouring genes that encode proteins belonging to the short-chain dehydrogenase/reductase (SDR) family, wbmF, wbmG and wbmH, and we aimed to elucidate their individual functions. Mutation and complementation implicate each gene in O-antigen expression but, as their putative sugar nucleotide substrates are not currently available, biochemical characterisation of WbmF, WbmG and WbmH is impractical at the present time. SDR family members catalyse a wide range of chemical reactions including oxidation, reduction and epimerisation. Because they typically share low sequence conservation, however, catalytic function cannot be predicted from sequence analysis alone. In this context, structural characterisation of the native proteins, co-crystals and small-molecule soaks enables differentiation of the functions of WbmF, WbmG and WbmH. These proteins exhibit typical SDR architecture and coordinate NAD. In the substrate-binding domain, all three enzymes bind uridyl nucleotides. WbmG contains a typical SDR catalytic TYK triad, which is required for oxidoreductase function, but the active site is devoid of additional acid-base functionality. Similarly, WbmH possesses a TYK triad, but an otherwise feature-poor active site. Consequently, 3,5-epimerase function can probably be ruled out for these enzymes. The WbmF active site contains conserved 3,5-epimerase features, namely, a positionally conserved cysteine (Cys133) and basic side chain (His90 or Asn213), but lacks the serine/threonine component of the SDR triad and therefore may not act as an oxidoreductase. The data suggest a pathway for synthesis of the O-antigen precursor UDP-2,3-diacetamido-2,3-dideoxy-l-galacturonic acid and illustrate the usefulness of structural data in predicting protein function.
Assuntos
Bordetella/imunologia , Antígenos O/biossíntese , Oxirredutases/metabolismo , Catálise , Cristalização , Cristalografia por Raios X , Eletroforese em Gel de Poliacrilamida , Oxirredutases/química , Relação Estrutura-Atividade , Especificidade por SubstratoRESUMO
The short-chain dehydrogenase enzymes WbmF, WbmG and WbmH from Bordetella bronchiseptica were cloned into Escherichia coli expression vectors, overexpressed and purified to homogeneity. Crystals of all three wild-type enzymes were obtained using vapour-diffusion crystallization with high-molecular-weight PEGs as a primary precipitant at alkaline pH. Some of the crystallization conditions permitted the soaking of crystals with cofactors and nucleotides or nucleotide sugars, which are possible substrate compounds, and further conditions provided co-complexes of two of the proteins with these compounds. The crystals diffracted to resolutions of between 1.50 and 2.40 A at synchrotron X-ray sources. The synchrotron data obtained were sufficient to determine eight structures of the three enzymes in complex with a variety of cofactors and substrate molecules.