RESUMO
Particle localization (picking) in digital tomograms is a laborious and time-intensive step in cryogenic electron tomography (cryoET) analysis often requiring considerable user involvement, thus becoming a bottleneck for automated cryoET subtomogram averaging (STA) pipelines. In this paper, we introduce a deep learning framework called PickYOLO to tackle this problem. PickYOLO is a super-fast, universal particle detector based on the deep-learning real-time object recognition system YOLO (You Only Look Once), and tested on single particles, filamentous structures, and membrane-embedded particles. After training with the centre coordinates of a few hundred representative particles, the network automatically detects additional particles with high yield and reliability at a rate of 0.24-3.75 s per tomogram. PickYOLO can automatically detect number of particles comparable to those manually selected by experienced microscopists. This makes PickYOLO a valuable tool to substantially reduce the time and manual effort needed to analyse cryoET data for STA, greatly aiding in high-resolution cryoET structure determination.
Assuntos
Aprendizado Profundo , Elétrons , Reprodutibilidade dos Testes , Microscopia Crioeletrônica/métodos , Tomografia com Microscopia Eletrônica/métodos , Processamento de Imagem Assistida por Computador/métodosRESUMO
The Collaborative Computational Project No. 4 (CCP4) is a UK-led international collective with a mission to develop, test, distribute and promote software for macromolecular crystallography. The CCP4 suite is a multiplatform collection of programs brought together by familiar execution routines, a set of common libraries and graphical interfaces. The CCP4 suite has experienced several considerable changes since its last reference article, involving new infrastructure, original programs and graphical interfaces. This article, which is intended as a general literature citation for the use of the CCP4 software suite in structure determination, will guide the reader through such transformations, offering a general overview of the new features and outlining future developments. As such, it aims to highlight the individual programs that comprise the suite and to provide the latest references to them for perusal by crystallographers around the world.
Assuntos
Proteínas , Software , Proteínas/química , Cristalografia por Raios X , Substâncias MacromolecularesRESUMO
Biodegradable magnesium (Mg) alloys can revolutionize osteosynthesis, because they have mechanical properties similar to those of the bone, and degrade over time, avoiding the need of removal surgery. However, they are not yet routinely applied because their degradation behavior is not fully understood. In this study we have investigated and quantified the degradation and osseointegration behavior of two biodegradable Mg alloys based on gadolinium (Gd) at high resolution. Mg-5Gd and Mg-10Gd screws were inserted in rat tibia for 4, 8 and 12 weeks. Afterward, the degradation rate and degradation homogeneity, as well as bone-to-implant interface, were studied with synchrotron radiation micro computed tomography and histology. Titanium (Ti) and polyether ether ketone (PEEK) were used as controls material to evaluate osseointegration. Our results showed that Mg-5Gd degraded faster and less homogeneously than Mg-10Gd. Both alloys gradually form a stable degradation layer at the interface and were surrounded by new bone tissue. The results were correlated to in vitro data obtained from the same material and shape. The average bone-to-implant contact of the Mg-xGd implants was comparable to that of Ti and higher than for PEEK. The results suggest that both Mg-xGd alloys are suitable as materials for bone implants.
RESUMO
Highly accurate segmentation of large 3D volumes is a demanding task. Challenging applications like the segmentation of synchrotron radiation microtomograms (SRµCT) at high-resolution, which suffer from low contrast, high spatial variability and measurement artifacts, readily exceed the capacities of conventional segmentation methods, including the manual segmentation by human experts. The quantitative characterization of the osseointegration and spatio-temporal biodegradation process of bone implants requires reliable, and very precise segmentation. We investigated the scaling of 2D U-net for high resolution grayscale volumes by three crucial model hyper-parameters (i.e., the model width, depth, and input size). To leverage the 3D information of high-resolution SRµCT, common three axes prediction fusing is extended, investigating the effect of adding more than three axes prediction. In a systematic evaluation we compare the performance of scaling the U-net by intersection over union (IoU) and quantitative measurements of osseointegration and degradation parameters. Overall, we observe that a compound scaling of the U-net and multi-axes prediction fusing with soft voting yields the highest IoU for the class "degradation layer". Finally, the quantitative analysis showed that the parameters calculated with model segmentation deviated less from the high quality results than those obtained by a semi-automatic segmentation method.
Assuntos
Biodegradação Ambiental , Síncrotrons , Microtomografia por Raio-X/métodos , Artefatos , Aprendizado Profundo , Reações Falso-Positivas , Humanos , Processamento de Imagem Assistida por Computador , Ciência dos Materiais , Redes Neurais de Computação , Osseointegração , Próteses e Implantes , Reprodutibilidade dos TestesRESUMO
The science of X-ray free-electron lasers (XFELs) critically depends on the performance of the X-ray laser and on the quality of the samples placed into the X-ray beam. The stability of biological samples is limited and key biomolecular transformations occur on short timescales. Experiments in biology require a support laboratory in the immediate vicinity of the beamlines. The XBI BioLab of the European XFEL (XBI denotes XFEL Biology Infrastructure) is an integrated user facility connected to the beamlines for supporting a wide range of biological experiments. The laboratory was financed and built by a collaboration between the European XFEL and the XBI User Consortium, whose members come from Finland, Germany, the Slovak Republic, Sweden and the USA, with observers from Denmark and the Russian Federation. Arranged around a central wet laboratory, the XBI BioLab provides facilities for sample preparation and scoring, laboratories for growing prokaryotic and eukaryotic cells, a Bio Safety Level 2 laboratory, sample purification and characterization facilities, a crystallization laboratory, an anaerobic laboratory, an aerosol laboratory, a vacuum laboratory for injector tests, and laboratories for optical microscopy, atomic force microscopy and electron microscopy. Here, an overview of the XBI facility is given and some of the results of the first user experiments are highlighted.
RESUMO
Recent developments in cryogenic electron microscopy (cryo-EM) have enabled structural studies of large macromolecular complexes at resolutions previously only attainable using macromolecular crystallography. Although a number of methods can already assist in de novo building of models into high-resolution cryo-EM maps, automated and reliable map interpretation remains a challenge. Presented here is a systematic study of the accuracy of models built into cryo-EM maps using ARP/wARP. It is demonstrated that the local resolution is a good indicator of map interpretability, and for the majority of the test cases ARP/wARP correctly builds 90% of main-chain fragments in regions where the local resolution is 4.0â Å or better. It is also demonstrated that the coordinate accuracy for models built into cryo-EM maps is comparable to that of X-ray crystallographic models at similar local cryo-EM and crystallographic resolutions. The model accuracy also correlates with the refined atomic displacement parameters.
Assuntos
Microscopia Crioeletrônica/métodos , Substâncias Macromoleculares/química , Proteínas/química , Bases de Dados de Proteínas , Modelos Moleculares , Conformação Proteica , SoftwareRESUMO
The performance of automated protein model building usually decreases with resolution, mainly owing to the lower information content of the experimental data. This calls for a more elaborate use of the available structural information about macromolecules. Here, a new method is presented that uses structural homologues to improve the quality of protein models automatically constructed using ARP/wARP. The method uses local structural similarity between deposited models and the model being built, and results in longer main-chain fragments that in turn can be more reliably docked to the protein sequence. The application of the homology-based model extension method to the example of a CFA synthase at 2.7â Å resolution resulted in a more complete model with almost all of the residues correctly built and docked to the sequence. The method was also evaluated on 1493 molecular-replacement solutions at a resolution of 4.0â Å and better that were submitted to the ARP/wARP web service for model building. A significant improvement in the completeness and sequence coverage of the built models has been observed.
Assuntos
Cristalografia por Raios X/métodos , Proteínas/química , Algoritmos , Modelos Moleculares , Fragmentos de Peptídeos/química , Conformação Proteica , Homologia Estrutural de ProteínaRESUMO
The West-Life project (https://about.west-life.eu/) is a Horizon 2020 project funded by the European Commission to provide data processing and data management services for the international community of structural biologists, and in particular to support integrative experimental approaches within the field of structural biology. It has developed enhancements to existing web services for structure solution and analysis, created new pipelines to link these services into more complex higher-level workflows, and added new data management facilities. Through this work it has striven to make the benefits of European e-Infrastructures more accessible to life-science researchers in general and structural biologists in particular.
RESUMO
A novel approach to obtaining structural information from macromolecular X-ray data extending to resolutions as low as 20 A is presented. Following a simple map-segmentation procedure, the approximate shapes of the domains forming the structure are identified. A pattern-recognition comparative analysis of these shapes and those derived from the structures of domains from the PDB results in candidate structural models that can be used for a fit into the density map. It is shown that the placed candidate models can be employed for subsequent phase extension to higher resolution.
Assuntos
Cristalografia por Raios X/métodos , Bactérias/química , Proteínas de Bactérias/química , Proteínas de Bactérias/metabolismo , Bases de Dados de Proteínas , Elétrons , Modelos Moleculares , Mutagênicos/química , Multimerização Proteica , Estrutura Quaternária de Proteína , Estrutura Terciária de ProteínaRESUMO
BACKGROUND: Docking algorithms are developed to predict in which orientation two proteins are likely to bind under natural conditions. The currently used methods usually consist of a sampling step followed by a scoring step. We developed a weighted geometric correlation based on optimised atom specific weighting factors and combined them with our previously published amino acid specific scoring and with a comprehensive SVM-based scoring function. RESULTS: The scoring with the atom specific weighting factors yields better results than the amino acid specific scoring. In combination with SVM-based scoring functions the percentage of complexes for which a near native structure can be predicted within the top 100 ranks increased from 14% with the geometric scoring to 54% with the combination of all scoring functions. Especially for the enzyme-inhibitor complexes the results of the ranking are excellent. For half of these complexes a near-native structure can be predicted within the first 10 proposed structures and for more than 86% of all enzyme-inhibitor complexes within the first 50 predicted structures. CONCLUSION: We were able to develop a combination of different scoring schemes which considers a series of previously described and some new scoring criteria yielding a remarkable improvement of prediction quality.
Assuntos
Algoritmos , Modelos Químicos , Modelos Moleculares , Mapeamento de Interação de Proteínas/métodos , Análise de Sequência de Proteína/métodos , Sequência de Aminoácidos , Sítios de Ligação , Simulação por Computador , Dados de Sequência Molecular , Ligação ProteicaRESUMO
BACKGROUND: One of the most challenging aspects of protein-protein docking is the inclusion of flexibility into the docking procedure. We developed a postfilter where the grid-representation of proteins for docking is extended by an optimised weighting factor for each amino acid. RESULTS: For up to 86% of the evaluated complexes a near-native structure was within the top 5% of the ranked prediction output. The weighting factors obtained by the optimisation procedure correlate to a certain extent with the flexibility of the amino acids, their hydrophobicity and with their propensity to be in the interface. CONCLUSION: Use of the optimised amino acid specific parameters yields a strong increase of near-native structures on the first ranks of the prediction.
Assuntos
Aminoácidos/química , Biologia Computacional/métodos , Proteínas/química , Análise de Sequência de Proteína/métodos , Algoritmos , Animais , Simulação por Computador , Bases de Dados de Proteínas , Humanos , Modelos Moleculares , Modelos Estatísticos , Ligação Proteica , Dobramento de Proteína , Reprodutibilidade dos TestesRESUMO
In this work we present two methods for the reranking of protein-protein docking studies. One scoring method searches the InterDom database for domains that are available in the proteins to be docked and evaluates the interaction of these domains in other complexes of known structure. The second one analyzes the interface of each proposed conformation with regard to the conservation of Phe, Met, and Trp and their polar neighbor residues. The special relevance of these residues is based on a publication by Ma et al. (Proc Natl Acad Sci USA 2003;100:5772-5777), who compared the conservation of all residues in the interface region to the conservation on the rest of the protein's surface. The scoring functions were tested on 30 unbound docking test cases. The evaluation of the methods is based on the ability to rerank the output of a Fast Fourier Transformation (FFT) docking. Both were able to improve the ranking of the docking output. The best improvement was achieved for enzyme-inhibitor examples. Especially the domain-based scoring function was successful and able to place a near-native solution on one of the first six ranks for 13 of 17 (76%) enzyme-inhibitor complexes [in 53% (nine complexes) even on the first rank]. The method evaluating residue conservation allowed us to increase the number of good solutions within the first 100 ranks out of approximately 9000 in 82% of the 17 enzyme-inhibitor test cases, and for seven (41%) out of 17 enzyme-inhibitor complexes, a near native solution was placed within the first seven ranks.
Assuntos
Proteínas/química , Proteínas/metabolismo , Sítios de Ligação , Ligação Competitiva , Biologia/métodos , Inibidores Enzimáticos/química , Enzimas/química , Enzimas/metabolismo , Conformação Proteica , Software , Propriedades de SuperfícieRESUMO
The prediction of protein 3D structures close to insertions and deletions or, more generally, loop prediction, is still one of the major challenges in homology modeling projects. In this article, we developed ranking criteria and selection filters to improve knowledge-based loop predictions. These criteria were developed and optimized for a test data set containing 678 insertions and deletions. The examples are, in principle, predictable from the used loop database with an RMSD < 1 A and represent realistic modeling situations. Four noncorrelated criteria for the selection of fragments are evaluated. A fast prefilter compares the distance between the anchor groups in the template protein with the stems of the fragments. The RMSD of the anchor groups is used for fitting and ranking of the selected loop candidates. After fitting, repulsive close contacts of loop candidates with the template protein are used for filtering, and fragments with backbone torsion angles, which are unfavorable according to a knowledge-based potential, are eliminated. By the combined application of these filter criteria to the test set, it was possible to increase the percentage of predictions with a global RMSD < 1 A to over 50% among the first five ranks, with average global RMSD values for the first rank candidate that are between 1.3 and 2.2 A for different loop lengths. Compared to other examples described in the literature, our large numbers of test cases are not self-predictions, where loops are placed in a protein after a peptide loop has been cut out, but are attempts to predict structural changes that occur in evolution when a protein is affected by insertions and deletions.