Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 51
Filtrar
1.
Front Bioinform ; 3: 1311287, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-38111685

RESUMO

Recent advances in Artificial Intelligence and Machine Learning (e.g., AlphaFold, RosettaFold, and ESMFold) enable prediction of three-dimensional (3D) protein structures from amino acid sequences alone at accuracies comparable to lower-resolution experimental methods. These tools have been employed to predict structures across entire proteomes and the results of large-scale metagenomic sequence studies, yielding an exponential increase in available biomolecular 3D structural information. Given the enormous volume of this newly computed biostructure data, there is an urgent need for robust tools to manage, search, cluster, and visualize large collections of structures. Equally important is the capability to efficiently summarize and visualize metadata, biological/biochemical annotations, and structural features, particularly when working with vast numbers of protein structures of both experimental origin from the Protein Data Bank (PDB) and computationally-predicted models. Moreover, researchers require advanced visualization techniques that support interactive exploration of multiple sequences and structural alignments. This paper introduces a suite of tools provided on the RCSB PDB research-focused web portal RCSB. org, tailor-made for efficient management, search, organization, and visualization of this burgeoning corpus of 3D macromolecular structure data.

2.
J Mol Biol ; 435(14): 168055, 2023 07 15.
Artigo em Inglês | MEDLINE | ID: mdl-36958605

RESUMO

The human interactome is composed of around half a million interactions according to recent estimations and it is only for a small fraction of those that three-dimensional structural information is available. Indeed, the structural coverage of the human interactome is very low and given the complexity and time-consuming requirements of solving protein structures this problem will remain for the foreseeable future. Structural models, or predictions, of protein complexes can provide valuable information when the experimentally determined 3D structures are not available. Here we present CM2D3, a relational database containing structural models of the whole human interactome derived both from comparative modeling and data-driven docking. Starting from a consensus interactome derived from integrating several interactomics databases, a strategy was devised to derive structural models by computational means. Currently, CM2D3 includes 33338 structural models of which 5121 derived from comparative modeling and the remaining from docking. Of the latter, the structures of 14554 complexes were derived from monomers modeled by M4T while the rest were modeled with structures as predicted by AlphaFold2. Lastly, CM2D3 complements existing resources by focusing on models derived from both free-docking, as opposed to template-based docking, and hence expanding the available structural information on protein complexes to the scientific community. Database URL:http://www.bioinsilico.org/CM2D3.


Assuntos
Bases de Dados de Proteínas , Proteínas , Humanos , Biologia Computacional/métodos , Simulação de Acoplamento Molecular , Ligação Proteica , Conformação Proteica , Mapeamento de Interação de Proteínas/métodos , Proteínas/química , Software
3.
J Mol Biol ; 435(14): 167994, 2023 07 15.
Artigo em Inglês | MEDLINE | ID: mdl-36738985

RESUMO

The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB) provides open access to experimentally-determined three-dimensional (3D) structures of biomolecules. The RCSB PDB RCSB.org research-focused web portal is used annually by many millions of users around the world. They access biostructure information, run complex queries utilizing various search services (e.g., full-text, structural and chemical attribute, chemical, sequence, and structure similarity searches), and visualize macromolecules in 3D, all at no charge and with no limitations on data usage. Notwithstanding more than 24,000-fold growth of the PDB over the past five decades, experimentally-determined structures are only available for a small subset of the millions of proteins of known sequence. Recently developed machine learning software tools can predict 3D structures of proteins at accuracies comparable to lower-resolution experimental methods. The RCSB PDB now provides access to ∼1,000,000 Computed Structure Models (CSMs) of proteins coming from AlphaFold DB and the ModelArchive alongside ∼200,000 experimentally-determined PDB structures. Both CSMs and PDB structures are available on RCSB.org and via well-established RCSB PDB Data, Search, and 1D-Coordinates application programming interfaces (APIs). Simultaneous delivery of PDB data and CSMs provides users with access to complementary structural information across the human proteome and those of model organisms and selected pathogens. API enhancements are backwards-compatible and programmatic users can "opt in" to access CSMs with minimal effort. Herein, we describe modifications to RCSB PDB cyberinfrastructure required to support sixfold scaling of 3D biostructure data delivery and lay the groundwork for scaling to accommodate hundreds of millions of CSMs.


Assuntos
Biologia Computacional , Bases de Dados de Proteínas , Humanos , Biologia Computacional/métodos , Conformação Proteica , Proteoma , Software
4.
Nucleic Acids Res ; 51(D1): D488-D508, 2023 01 06.
Artigo em Inglês | MEDLINE | ID: mdl-36420884

RESUMO

The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB), founding member of the Worldwide Protein Data Bank (wwPDB), is the US data center for the open-access PDB archive. As wwPDB-designated Archive Keeper, RCSB PDB is also responsible for PDB data security. Annually, RCSB PDB serves >10 000 depositors of three-dimensional (3D) biostructures working on all permanently inhabited continents. RCSB PDB delivers data from its research-focused RCSB.org web portal to many millions of PDB data consumers based in virtually every United Nations-recognized country, territory, etc. This Database Issue contribution describes upgrades to the research-focused RCSB.org web portal that created a one-stop-shop for open access to ∼200 000 experimentally-determined PDB structures of biological macromolecules alongside >1 000 000 incorporated Computed Structure Models (CSMs) predicted using artificial intelligence/machine learning methods. RCSB.org is a 'living data resource.' Every PDB structure and CSM is integrated weekly with related functional annotations from external biodata resources, providing up-to-date information for the entire corpus of 3D biostructure data freely available from RCSB.org with no usage limitations. Within RCSB.org, PDB structures and the CSMs are clearly identified as to their provenance and reliability. Both are fully searchable, and can be analyzed and visualized using the full complement of RCSB.org web portal capabilities.


Assuntos
Inteligência Artificial , Bases de Dados de Proteínas , Proteínas , Aprendizado de Máquina , Conformação Proteica , Proteínas/química , Reprodutibilidade dos Testes
5.
Protein Sci ; 31(12): e4482, 2022 12.
Artigo em Inglês | MEDLINE | ID: mdl-36281733

RESUMO

Now in its 52nd year of continuous operations, the Protein Data Bank (PDB) is the premiere open-access global archive housing three-dimensional (3D) biomolecular structure data. It is jointly managed by the Worldwide Protein Data Bank (wwPDB) partnership. The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB) is funded by the National Science Foundation, National Institutes of Health, and US Department of Energy and serves as the US data center for the wwPDB. RCSB PDB is also responsible for the security of PDB data in its role as wwPDB-designated Archive Keeper. Every year, RCSB PDB serves tens of thousands of depositors of 3D macromolecular structure data (coming from macromolecular crystallography, nuclear magnetic resonance spectroscopy, electron microscopy, and micro-electron diffraction). The RCSB PDB research-focused web portal (RCSB.org) makes PDB data available at no charge and without usage restrictions to many millions of PDB data consumers around the world. The RCSB PDB training, outreach, and education web portal (PDB101.RCSB.org) serves nearly 700 K educators, students, and members of the public worldwide. This invited Tools Issue contribution describes how RCSB PDB (i) is organized; (ii) works with wwPDB partners to process new depositions; (iii) serves as the wwPDB-designated Archive Keeper; (iv) enables exploration and 3D visualization of PDB data via RCSB.org; and (v) supports training, outreach, and education via PDB101.RCSB.org. New tools and features at RCSB.org are presented using examples drawn from high-resolution structural studies of proteins relevant to treatment of human cancers by targeting immune checkpoints.


Assuntos
Biologia Computacional , Proteínas , Humanos , Conformação Proteica , Bases de Dados de Proteínas , Proteínas/química , Biologia Computacional/métodos , Substâncias Macromoleculares/química
6.
Bioinformatics ; 38(12): 3304-3305, 2022 06 13.
Artigo em Inglês | MEDLINE | ID: mdl-35543462

RESUMO

MOTIVATION: Mapping positional features from one-dimensional (1D) sequences onto three-dimensional (3D) structures of biological macromolecules is a powerful tool to show geometric patterns of biochemical annotations and provide a better understanding of the mechanisms underpinning protein and nucleic acid function at the atomic level. RESULTS: We present a new library designed to display fully customizable interactive views between 1D positional features of protein and/or nucleic acid sequences and their 3D structures as isolated chains or components of macromolecular assemblies. AVAILABILITY AND IMPLEMENTATION: https://github.com/rcsb/rcsb-saguaro-3d. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Ácidos Nucleicos , Software , Bases de Dados de Proteínas , Substâncias Macromoleculares/química , Proteínas/química
7.
Protein Sci ; 31(1): 187-208, 2022 01.
Artigo em Inglês | MEDLINE | ID: mdl-34676613

RESUMO

The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB), funded by the US National Science Foundation, National Institutes of Health, and Department of Energy, has served structural biologists and Protein Data Bank (PDB) data consumers worldwide since 1999. RCSB PDB, a founding member of the Worldwide Protein Data Bank (wwPDB) partnership, is the US data center for the global PDB archive housing biomolecular structure data. RCSB PDB is also responsible for the security of PDB data, as the wwPDB-designated Archive Keeper. Annually, RCSB PDB serves tens of thousands of three-dimensional (3D) macromolecular structure data depositors (using macromolecular crystallography, nuclear magnetic resonance spectroscopy, electron microscopy, and micro-electron diffraction) from all inhabited continents. RCSB PDB makes PDB data available from its research-focused RCSB.org web portal at no charge and without usage restrictions to millions of PDB data consumers working in every nation and territory worldwide. In addition, RCSB PDB operates an outreach and education PDB101.RCSB.org web portal that was used by more than 800,000 educators, students, and members of the public during calendar year 2020. This invited Tools Issue contribution describes (i) how the archive is growing and evolving as new experimental methods generate ever larger and more complex biomolecular structures; (ii) the importance of data standards and data remediation in effective management of the archive and facile integration with more than 50 external data resources; and (iii) new tools and features for 3D structure analysis and visualization made available during the past year via the RCSB.org web portal.


Assuntos
Biologia Computacional/história , Bases de Dados de Proteínas/história , Interface Usuário-Computador , Aniversários e Eventos Especiais , História do Século XX , História do Século XXI
8.
Integr Comp Biol ; 61(6): 2119-2131, 2022 02 05.
Artigo em Inglês | MEDLINE | ID: mdl-34259842

RESUMO

Differences within a biological system are ubiquitous, creating variation in nature. Variation underlies all evolutionary processes and allows persistence and resilience in changing environments; thus, uncovering the drivers of variation is critical. The growing recognition that variation is central to biology presents a timely opportunity for determining unifying principles that drive variation across biological levels of organization. Currently, most studies that consider variation are focused at a single biological level and not integrated into a broader perspective. Here we explain what variation is and how it can be measured. We then discuss the importance of variation in natural systems, and briefly describe the biological research that has focused on variation. We outline some of the barriers and solutions to studying variation and its drivers in biological systems. Finally, we detail the challenges and opportunities that may arise when studying the drivers of variation due to the multi-level nature of biological systems. Examining the drivers of variation will lead to a reintegration of biology. It will further forge interdisciplinary collaborations and open opportunities for training diverse quantitative biologists. We anticipate that these insights will inspire new questions and new analytic tools to study the fundamental questions of what drives variation in biological systems and how variation has shaped life.


Assuntos
Evolução Biológica , Animais
9.
Bioinformatics ; 38(5): 1452-1454, 2022 02 07.
Artigo em Inglês | MEDLINE | ID: mdl-34864908

RESUMO

MOTIVATION: Membrane proteins are encoded by approximately one fifth of human genes but account for more than half of all US FDA approved drug targets. Thanks to new technological advances, the number of membrane proteins archived in the PDB is growing rapidly. However, automatic identification of membrane proteins or inference of membrane location is not a trivial task. RESULTS: We present recent improvements to the RCSB Protein Data Bank web portal (RCSB PDB, rcsb.org) that provide a wealth of new membrane protein annotations integrated from four external resources: OPM, PDBTM, MemProtMD and mpstruc. We have substantially enhanced the presentation of data on membrane proteins. The number of membrane proteins with annotations available on rcsb.org was increased by ∼80%. Users can search for these annotations, explore corresponding tree hierarchies, display membrane segments at the 1D amino acid sequence level, and visualize the predicted location of the membrane layer in 3D. AVAILABILITY AND IMPLEMENTATION: Annotations, search, tree data and visualization are available at our rcsb.org web portal. Membrane visualization is supported by the open-source Mol* viewer (molstar.org and github.com/molstar/molstar). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Proteínas de Membrana , Software , Humanos , Conformação Proteica , Bases de Dados de Proteínas , Sequência de Aminoácidos
10.
J Mol Biol ; 433(11): 166656, 2021 05 28.
Artigo em Inglês | MEDLINE | ID: mdl-32976910

RESUMO

Protein interactions play a crucial role among the different functions of a cell and are central to our understanding of cellular processes both in health and disease. Here we present Galaxy InteractoMIX (http://galaxy.interactomix.com), a platform composed of 13 different computational tools each addressing specific aspects of the study of protein-protein interactions, ranging from large-scale cross-species protein-wide interactomes to atomic resolution level of protein complexes. Galaxy InteractoMIX provides an intuitive interface where users can retrieve consolidated interactomics data distributed across several databases or uncover links between diseases and genes by analyzing the interactomes underlying these diseases. The platform makes possible large-scale prediction and curation protein interactions using the conservation of motifs, interology, or presence or absence of key sequence signatures. The range of structure-based tools includes modeling and analysis of protein complexes, delineation of interfaces and the modeling of peptides acting as inhibitors of protein-protein interactions. Galaxy InteractoMIX includes a range of ready-to-use workflows to run complex analyses requiring minimal intervention by users. The potential range of applications of the platform covers different aspects of life science, biomedicine, biotechnology and drug discovery where protein associations are studied.


Assuntos
Biologia Computacional/métodos , Mapeamento de Interação de Proteínas , Software , Motivos de Aminoácidos , Sequência Conservada , Modelos Moleculares , Interface Usuário-Computador , Fluxo de Trabalho
11.
Bioinformatics ; 36(22-23): 5526-5527, 2021 Apr 01.
Artigo em Inglês | MEDLINE | ID: mdl-33313665

RESUMO

MOTIVATION: Interoperability between polymer sequences and structural data is essential for providing a complete picture of protein and gene features and helping to understand biomolecular function. RESULTS: Herein, we present two resources designed to improve interoperability between the RCSB Protein Data Bank, the NCBI and the UniProtKB data resources and visualize integrated data therefrom. The underlying tools provide a flexible means of mapping between the different coordinate spaces and an interactive tool allows convenient visualization of the 1-dimensional data over the web. AVAILABILITYAND IMPLEMENTATION: https://1d-coordinates.rcsb.org and https://rcsb.github.io/rcsb-saguaro. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

12.
J Mol Biol ; 433(11): 166704, 2021 05 28.
Artigo em Inglês | MEDLINE | ID: mdl-33186584

RESUMO

The US Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB) serves many millions of unique users worldwide by delivering experimentally-determined 3D structures of biomolecules integrated with >40 external data resources via RCSB.org, application programming interfaces (APIs), and FTP downloads. Herein, we present the architectural redesign of RCSB PDB data delivery services that build on existing PDBx/mmCIF data schemas. New data access APIs (data.rcsb.org) enable efficient delivery of all PDB archive data. A novel GraphQL-based API provides flexible, declarative data retrieval along with a simple-to-use REST API. A powerful new search system (search.rcsb.org) seamlessly integrates heterogeneous types of searches across the PDB archive. Searches may combine text attributes, protein or nucleic acid sequences, small-molecule chemical descriptors, 3D macromolecular shapes, and sequence motifs. The new RCSB.org architecture adheres to the FAIR Principles, empowering users to address a wide array of research problems in fundamental biology, biomedicine, biotechnology, bioengineering, and bioenergy.


Assuntos
Biologia Computacional , Bases de Dados de Proteínas , Substâncias Macromoleculares/química , Ferramenta de Busca
13.
Nucleic Acids Res ; 49(D1): D437-D451, 2021 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-33211854

RESUMO

The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB), the US data center for the global PDB archive and a founding member of the Worldwide Protein Data Bank partnership, serves tens of thousands of data depositors in the Americas and Oceania and makes 3D macromolecular structure data available at no charge and without restrictions to millions of RCSB.org users around the world, including >660 000 educators, students and members of the curious public using PDB101.RCSB.org. PDB data depositors include structural biologists using macromolecular crystallography, nuclear magnetic resonance spectroscopy, 3D electron microscopy and micro-electron diffraction. PDB data consumers accessing our web portals include researchers, educators and students studying fundamental biology, biomedicine, biotechnology, bioengineering and energy sciences. During the past 2 years, the research-focused RCSB PDB web portal (RCSB.org) has undergone a complete redesign, enabling improved searching with full Boolean operator logic and more facile access to PDB data integrated with >40 external biodata resources. New features and resources are described in detail using examples that showcase recently released structures of SARS-CoV-2 proteins and host cell proteins relevant to understanding and addressing the COVID-19 global pandemic.


Assuntos
Biologia Computacional/métodos , Bases de Dados de Proteínas , Substâncias Macromoleculares/química , Conformação Proteica , Proteínas/química , Bioengenharia/métodos , Pesquisa Biomédica/métodos , Biotecnologia/métodos , COVID-19/epidemiologia , COVID-19/prevenção & controle , COVID-19/virologia , Humanos , Substâncias Macromoleculares/metabolismo , Pandemias , Proteínas/genética , Proteínas/metabolismo , SARS-CoV-2/genética , SARS-CoV-2/metabolismo , SARS-CoV-2/fisiologia , Software , Proteínas Virais/química , Proteínas Virais/genética , Proteínas Virais/metabolismo
14.
J Struct Biol ; 210(3): 107498, 2020 06 01.
Artigo em Inglês | MEDLINE | ID: mdl-32276087

RESUMO

Cryo-EM Single Particle Analysis workflows require tens of thousands of high-quality particle projections to unveil the three-dimensional structure of macromolecules. Conventional methods for automatic particle picking tend to suffer from high false-positive rates, hampering the reconstruction process. One common cause of this problem is the presence of carbon and different types of high-contrast contaminations. In order to overcome this limitation, we have developed MicrographCleaner, a deep learning package designed to discriminate, in an automated fashion, between regions of micrographs which are suitable for particle picking, and those which are not. MicrographCleaner implements a U-net-like deep learning model trained on a manually curated dataset compiled from over five hundred micrographs. The benchmarking, carried out on approximately one hundred independent micrographs, shows that MicrographCleaner is a very efficient approach for micrograph preprocessing. MicrographCleaner (micrograph_cleaner_em) package is available at PyPI and Anaconda Cloud and also as a Scipion/Xmipp protocol. Source code is available at https://github.com/rsanchezgarc/micrograph_cleaner_em.


Assuntos
Microscopia Crioeletrônica/métodos , Aprendizado Profundo , Algoritmos , Substâncias Macromoleculares/metabolismo , Software
15.
BMC Geriatr ; 20(1): 101, 2020 03 12.
Artigo em Inglês | MEDLINE | ID: mdl-32164542

RESUMO

BACKGROUND: Preventive home visits are suited for patients with reduced mobility, such as older people. Healthcare needs for older patients are expected to increase due to the extended life expectancy estimated in coming years. The implementation of low-cost, patient-centered methodologies may buffer this rise in health care costs without affecting the quality of service. In order to find the best home care model with less investment, this paper describes a study protocol comparing two models of home care for older people. METHODS: We describe a quasi-experimental study that compares the outcome of two different home care models already implemented in two primary care centers in Badalona (Barcelona, Spain). The traditional model (control model) is integrated in the sense that is continuous, the same primary care center team looks after its assigned patients both at the center and in preventive home visits. The new functional home care model (study model), consisting of a highly trained team, is specifically designed to meet patient needs and give total attention to preventive home interventions. The study will start and end on the expected dates, June 2018 to October 2020, and include all patients over 65 years old already enrolled in the home care programs of the primary care centers selected. The primary endpoint assessed will be the difference in hospitalization days between patients included in both home care programs. Other variables regarding health status, quality of care and resource utilization will also be compared between the two models. DISCUSSION: The study in progress will assess whether a functional and highly trained home care team will meet the ever-aging population needs in terms of cost and health outcomes better than a traditional, integrated one. Lessons learned from this pilot study will provide guidelines for a future model of home care based on the IHI Triple Aim: better care, better health, and lower costs. TRIAL REGISTRATION: Registered in ClinicalTrials.gov (Identifier: NCT03461315; March 12, 2018).


Assuntos
Serviços de Assistência Domiciliar , Qualidade de Vida , Idoso , Idoso de 80 Anos ou mais , Protocolos de Ensaio Clínico como Assunto , Feminino , Humanos , Masculino , Projetos Piloto , Equilíbrio Postural , Espanha , Estudos de Tempo e Movimento
16.
Protein Sci ; 29(1): 52-65, 2020 01.
Artigo em Inglês | MEDLINE | ID: mdl-31531901

RESUMO

Analyses of publicly available structural data reveal interesting insights into the impact of the three-dimensional (3D) structures of protein targets important for discovery of new drugs (e.g., G-protein-coupled receptors, voltage-gated ion channels, ligand-gated ion channels, transporters, and E3 ubiquitin ligases). The Protein Data Bank (PDB) archive currently holds > 155,000 atomic-level 3D structures of biomolecules experimentally determined using crystallography, nuclear magnetic resonance spectroscopy, and electron microscopy. The PDB was established in 1971 as the first open-access, digital-data resource in biology, and is now managed by the Worldwide PDB partnership (wwPDB; wwPDB.org). US PDB operations are the responsibility of the Research Collaboratory for Structural Bioinformatics PDB (RCSB PDB). The RCSB PDB serves millions of RCSB.org users worldwide by delivering PDB data integrated with ∼40 external biodata resources, providing rich structural views of fundamental biology, biomedicine, and energy sciences. Recently published work showed that the PDB archival holdings facilitated discovery of ∼90% of the 210 new drugs approved by the US Food and Drug Administration 2010-2016. We review user-driven development of RCSB PDB services, examine growth of the PDB archive in terms of size and complexity, and present examples and opportunities for structure-guided drug discovery for challenging targets (e.g., integral membrane proteins).


Assuntos
Biologia Computacional/métodos , Bases de Dados de Proteínas , Proteínas/química , Cristalografia , Descoberta de Drogas , Espectroscopia de Ressonância Magnética , Microscopia Eletrônica , Conformação Proteica , Interface Usuário-Computador
17.
Sci Rep ; 9(1): 9487, 2019 07 01.
Artigo em Inglês | MEDLINE | ID: mdl-31263230

RESUMO

Nucleoplasmin (NP) is a pentameric histone chaperone that regulates the condensation state of chromatin in different cellular processes. We focus here on the interaction of NP with the histone octamer, showing that NP could bind sequentially the histone components to assemble an octamer-like particle, and crosslinked octamers with high affinity. The three-dimensional reconstruction of the NP/octamer complex generated by single-particle cryoelectron microscopy, revealed that several intrinsically disordered tail domains of two NP pentamers, facing each other through their distal face, encage the histone octamer in a nucleosome-like conformation and prevent its dissociation. Formation of this complex depended on post-translational modification and exposure of the acidic tract at the tail domain of NP. Finally, NP was capable of transferring the histone octamers to DNA in vitro, assembling nucleosomes. This activity may have biological relevance for processes in which the histone octamer must be rapidly removed from or deposited onto the DNA.


Assuntos
Proteínas Aviárias/química , DNA/química , Histonas/química , Nucleoplasminas/química , Nucleossomos/química , Proteínas de Xenopus/química , Animais , Proteínas Aviárias/metabolismo , Galinhas , DNA/metabolismo , Histonas/metabolismo , Nucleoplasminas/metabolismo , Nucleossomos/metabolismo , Proteínas de Xenopus/metabolismo , Xenopus laevis
18.
Bioinformatics ; 35(18): 3512-3513, 2019 09 15.
Artigo em Inglês | MEDLINE | ID: mdl-30768147

RESUMO

MOTIVATION: Many diseases are associated to single nucleotide polymorphisms that affect critical regions of proteins as binding sites or post translational modifications. Therefore, analysing genomic variants with structural and molecular biology data is a powerful framework in order to elucidate the potential causes of such diseases. RESULTS: A new version of our web framework 3DBIONOTES is presented. This version offers new tools to analyse and visualize protein annotations and genomic variants, including a contingency analysis of variants and amino acid features by means of a Fisher exact test, the integration of a gene annotation viewer to highlight protein features on gene sequences and a protein-protein interaction viewer to display protein annotations at network level. AVAILABILITY AND IMPLEMENTATION: The web server is available at https://3dbionotes.cnb.csic.es. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. CONTACT: Spanish National Institute for Bioinformatics (INB ELIXIR-ES) and Biocomputing Unit, National Centre of Biotechnology (CSIC)/Instruct Image Processing Centre, C/ Darwin nº 3, Campus of Cantoblanco, 28049 Madrid, Spain.


Assuntos
Genômica , Software , Sítios de Ligação , Biologia Computacional , Anotação de Sequência Molecular , Proteínas
19.
Bioinformatics ; 35(3): 470-477, 2019 02 01.
Artigo em Inglês | MEDLINE | ID: mdl-30020406

RESUMO

Motivation: Protein-Protein Interactions (PPI) are essentials for most cellular processes and thus, unveiling how proteins interact is a crucial question that can be better understood by identifying which residues are responsible for the interaction. Computational approaches are orders of magnitude cheaper and faster than experimental ones, leading to proliferation of multiple methods aimed to predict which residues belong to the interface of an interaction. Results: We present BIPSPI, a new machine learning-based method for the prediction of partner-specific PPI sites. Contrary to most binding site prediction methods, the proposed approach takes into account a pair of interacting proteins rather than a single one in order to predict partner-specific binding sites. BIPSPI has been trained employing sequence-based and structural features from both protein partners of each complex compiled in the Protein-Protein Docking Benchmark version 5.0 and in an additional set independently compiled. Also, a version trained only on sequences has been developed. The performance of our approach has been assessed by a leave-one-out cross-validation over different benchmarks, outperforming state-of-the-art methods. Availability and implementation: BIPSPI web server is freely available at http://bipspi.cnb.csic.es. BIPSPI code is available at https://github.com/bioinsilico/BIPSPI. Docker image is available at https://hub.docker.com/r/bioinsilico/bipspi/. Supplementary information: Supplementary data are available at Bioinformatics online.


Assuntos
Aprendizado de Máquina , Mapas de Interação de Proteínas , Proteínas/química , Sítios de Ligação , Domínios Proteicos , Análise de Sequência de Proteína
20.
J Struct Biol X ; 1: 100006, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-32647812

RESUMO

The West-Life project (https://about.west-life.eu/) is a Horizon 2020 project funded by the European Commission to provide data processing and data management services for the international community of structural biologists, and in particular to support integrative experimental approaches within the field of structural biology. It has developed enhancements to existing web services for structure solution and analysis, created new pipelines to link these services into more complex higher-level workflows, and added new data management facilities. Through this work it has striven to make the benefits of European e-Infrastructures more accessible to life-science researchers in general and structural biologists in particular.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...