Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 44
Filtrar
Mais filtros

Bases de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Nucleic Acids Res ; 51(D1): D1373-D1380, 2023 01 06.
Artigo em Inglês | MEDLINE | ID: mdl-36305812

RESUMO

PubChem (https://pubchem.ncbi.nlm.nih.gov) is a popular chemical information resource that serves a wide range of use cases. In the past two years, a number of changes were made to PubChem. Data from more than 120 data sources was added to PubChem. Some major highlights include: the integration of Google Patents data into PubChem, which greatly expanded the coverage of the PubChem Patent data collection; the creation of the Cell Line and Taxonomy data collections, which provide quick and easy access to chemical information for a given cell line and taxon, respectively; and the update of the bioassay data model. In addition, new functionalities were added to the PubChem programmatic access protocols, PUG-REST and PUG-View, including support for target-centric data download for a given protein, gene, pathway, cell line, and taxon and the addition of the 'standardize' option to PUG-REST, which returns the standardized form of an input chemical structure. A significant update was also made to PubChemRDF. The present paper provides an overview of these changes.


Assuntos
Bases de Dados de Compostos Químicos , Descoberta de Drogas , Descoberta de Drogas/métodos , Bioensaio , Proteínas , Quimioinformática
2.
Nucleic Acids Res ; 49(D1): D1388-D1395, 2021 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-33151290

RESUMO

PubChem (https://pubchem.ncbi.nlm.nih.gov) is a popular chemical information resource that serves the scientific community as well as the general public, with millions of unique users per month. In the past two years, PubChem made substantial improvements. Data from more than 100 new data sources were added to PubChem, including chemical-literature links from Thieme Chemistry, chemical and physical property links from SpringerMaterials, and patent links from the World Intellectual Properties Organization (WIPO). PubChem's homepage and individual record pages were updated to help users find desired information faster. This update involved a data model change for the data objects used by these pages as well as by programmatic users. Several new services were introduced, including the PubChem Periodic Table and Element pages, Pathway pages, and Knowledge panels. Additionally, in response to the coronavirus disease 2019 (COVID-19) outbreak, PubChem created a special data collection that contains PubChem data related to COVID-19 and the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2).


Assuntos
COVID-19/prevenção & controle , Bases de Dados de Compostos Químicos , Armazenamento e Recuperação da Informação/estatística & dados numéricos , SARS-CoV-2/isolamento & purificação , Interface Usuário-Computador , COVID-19/epidemiologia , COVID-19/virologia , Descoberta de Drogas/estatística & dados numéricos , Epidemias , Humanos , Armazenamento e Recuperação da Informação/métodos , Internet , Saúde Pública/estatística & dados numéricos , SARS-CoV-2/fisiologia , Software
3.
Nucleic Acids Res ; 47(D1): D1102-D1109, 2019 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-30371825

RESUMO

PubChem (https://pubchem.ncbi.nlm.nih.gov) is a key chemical information resource for the biomedical research community. Substantial improvements were made in the past few years. New data content was added, including spectral information, scientific articles mentioning chemicals, and information for food and agricultural chemicals. PubChem released new web interfaces, such as PubChem Target View page, Sources page, Bioactivity dyad pages and Patent View page. PubChem also released a major update to PubChem Widgets and introduced a new programmatic access interface, called PUG-View. This paper describes these new developments in PubChem.


Assuntos
Biologia Computacional/métodos , Bases de Dados de Compostos Químicos , Preparações Farmacêuticas/química , Bibliotecas de Moléculas Pequenas/química , Animais , Bioensaio/métodos , Descoberta de Drogas/métodos , Ensaios de Triagem em Larga Escala/métodos , Humanos , Armazenamento e Recuperação da Informação/métodos , Internet , Estrutura Molecular , Patentes como Assunto , Relação Estrutura-Atividade
4.
Nucleic Acids Res ; 45(D1): D955-D963, 2017 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-27899599

RESUMO

PubChem's BioAssay database (https://pubchem.ncbi.nlm.nih.gov) has served as a public repository for small-molecule and RNAi screening data since 2004 providing open access of its data content to the community. PubChem accepts data submission from worldwide researchers at academia, industry and government agencies. PubChem also collaborates with other chemical biology database stakeholders with data exchange. With over a decade's development effort, it becomes an important information resource supporting drug discovery and chemical biology research. To facilitate data discovery, PubChem is integrated with all other databases at NCBI. In this work, we provide an update for the PubChem BioAssay database describing several recent development including added sources of research data, redesigned BioAssay record page, new BioAssay classification browser and new features in the Upload system facilitating data sharing.


Assuntos
Bases de Dados de Compostos Químicos , Bases de Dados de Ácidos Nucleicos , Interferência de RNA , Ferramenta de Busca , Bibliotecas de Moléculas Pequenas , Descoberta de Drogas , Regulação da Expressão Gênica/efeitos dos fármacos , Humanos , Software , Interface Usuário-Computador , Navegador
5.
Nucleic Acids Res ; 44(D1): D1202-13, 2016 Jan 04.
Artigo em Inglês | MEDLINE | ID: mdl-26400175

RESUMO

PubChem (https://pubchem.ncbi.nlm.nih.gov) is a public repository for information on chemical substances and their biological activities, launched in 2004 as a component of the Molecular Libraries Roadmap Initiatives of the US National Institutes of Health (NIH). For the past 11 years, PubChem has grown to a sizable system, serving as a chemical information resource for the scientific research community. PubChem consists of three inter-linked databases, Substance, Compound and BioAssay. The Substance database contains chemical information deposited by individual data contributors to PubChem, and the Compound database stores unique chemical structures extracted from the Substance database. Biological activity data of chemical substances tested in assay experiments are contained in the BioAssay database. This paper provides an overview of the PubChem Substance and Compound databases, including data sources and contents, data organization, data submission using PubChem Upload, chemical structure standardization, web-based interfaces for textual and non-textual searches, and programmatic access. It also gives a brief description of PubChem3D, a resource derived from theoretical three-dimensional structures of compounds in PubChem, as well as PubChemRDF, Resource Description Framework (RDF)-formatted PubChem data for data sharing, analysis and integration with information contained in other databases.


Assuntos
Bases de Dados de Compostos Químicos , Internet , Estrutura Molecular , Preparações Farmacêuticas/química , Software
6.
Nucleic Acids Res ; 42(Database issue): D1075-82, 2014 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-24198245

RESUMO

PubChem's BioAssay database (http://pubchem.ncbi.nlm.nih.gov) is a public repository for archiving biological tests of small molecules generated through high-throughput screening experiments, medicinal chemistry studies, chemical biology research and drug discovery programs. In addition, the BioAssay database contains data from high-throughput RNA interference screening aimed at identifying critical genes responsible for a biological process or disease condition. The mission of PubChem is to serve the community by providing free and easy access to all deposited data. To this end, PubChem BioAssay is integrated into the National Center for Biotechnology Information retrieval system, making them searchable by Entrez queries and cross-linked to other biomedical information archived at National Center for Biotechnology Information. Moreover, PubChem BioAssay provides web-based and programmatic tools allowing users to search, access and analyze bioassay test results and metadata. In this work, we provide an update for the PubChem BioAssay resource, such as information content growth, new developments supporting data integration and search, and the recently deployed PubChem Upload to streamline chemical structure and bioassay submissions.


Assuntos
Bases de Dados de Compostos Químicos , Ensaios de Triagem em Larga Escala , Interferência de RNA , Descoberta de Drogas , Genes , Humanos , Internet , Proteínas/genética , Bibliotecas de Moléculas Pequenas , Integração de Sistemas
7.
Biophys J ; 109(6): 1295-306, 2015 Sep 15.
Artigo em Inglês | MEDLINE | ID: mdl-26213149

RESUMO

Structures of protein complexes provide atomistic insights into protein interactions. Human proteins represent a quarter of all structures in the Protein Data Bank; however, available protein complexes cover less than 10% of the human proteome. Although it is theoretically possible to infer interactions in human proteins based on structures of homologous protein complexes, it is still unclear to what extent protein interactions and binding sites are conserved, and whether protein complexes from remotely related species can be used to infer interactions and binding sites. We considered biological units of protein complexes and clustered protein-protein binding sites into similarity groups based on their structure and sequence, which allowed us to identify unique binding sites. We showed that the growth rate of the number of unique binding sites in the Protein Data Bank was much slower than the growth rate of the number of structural complexes. Next, we investigated the evolutionary roots of unique binding sites and identified the major phyletic branches with the largest expansion in the number of novel binding sites. We found that many binding sites could be traced to the universal common ancestor of all cellular organisms, whereas relatively few binding sites emerged at the major evolutionary branching points. We analyzed the physicochemical properties of unique binding sites and found that the most ancient sites were the largest in size, involved many salt bridges, and were the most compact and least planar. In contrast, binding sites that appeared more recently in the evolution of eukaryotes were characterized by a larger fraction of polar and aromatic residues, and were less compact and more planar, possibly due to their more transient nature and roles in signaling processes.


Assuntos
Sítios de Ligação/genética , Evolução Molecular , Ligação Proteica/genética , Proteínas/genética , Proteínas/metabolismo , Animais , Humanos , Modelos Moleculares
8.
Bull World Health Organ ; 93(8): 566-576D, 2015 Aug 01.
Artigo em Inglês | MEDLINE | ID: mdl-26478614

RESUMO

OBJECTIVE: To collect, compile and evaluate publicly available national health accounts (NHA) reports produced worldwide between 1996 and 2010. METHODS: We downloaded country-generated NHA reports from the World Health Organization global health expenditure database and the Organisation for Economic Co-operation and Development (OECD) StatExtract website. We also obtained reports from Abt Associates, through contacts in individual countries and through an online search. We compiled data in the four main types used in these reports: (i) financing source; (ii) financing agent; (iii) health function; and (iv) health provider. We combined and adjusted data to conform with OECD's first edition of A system of health accounts manual, (2000). FINDINGS: We identified 872 NHA reports from 117 countries containing a total of 2936 matrices for the four data types. Most countries did not provide complete health expenditure data: only 252 of the 872 reports contained data in all four types. Thirty-eight countries reported an average not-specified-by-kind value greater than 20% for all data types and years. Some countries reported substantial year-on-year changes in both the level and composition of health expenditure that were probably produced by data-generation processes. All study data are publicly available at http://vizhub.healthdata.org/nha/. CONCLUSION: Data from NHA reports on health expenditure are often incomplete and, in some cases, of questionable quality. Better data would help finance ministries allocate resources to health systems, assist health ministries in allocating capital within the health sector and enable researchers to make accurate comparisons between health systems.


Assuntos
Gastos em Saúde/estatística & dados numéricos , Coleta de Dados/métodos , Coleta de Dados/estatística & dados numéricos , Interpretação Estatística de Dados , Bases de Dados Factuais , Saúde Global , Humanos , Organização Mundial da Saúde
9.
EMBO Rep ; 13(3): 266-71, 2012 Mar 01.
Artigo em Inglês | MEDLINE | ID: mdl-22261719

RESUMO

Although the identification of protein interactions by high-throughput (HTP) methods progresses at a fast pace, 'interactome' data sets still suffer from high rates of false positives and low coverage. To map the human protein interactome, we describe a new framework that uses experimental evidence on structural complexes, the atomic details of binding interfaces and evolutionary conservation. The structurally inferred interaction network is highly modular and more functionally coherent compared with experimental interaction networks derived from multiple literature citations. Moreover, structurally inferred and high-confidence HTP networks complement each other well, allowing us to construct a merged network to generate testable hypotheses and provide valuable experimental leads.


Assuntos
Complexos Multiproteicos/química , Mapeamento de Interação de Proteínas/métodos , Proteômica/métodos , Sítios de Ligação , Biologia Computacional/métodos , Bases de Dados Genéticas , Humanos , Ligação Proteica , Domínios e Motivos de Interação entre Proteínas , Software
10.
Nucleic Acids Res ; 40(Database issue): D834-40, 2012 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-22102591

RESUMO

We have recently developed the Inferred Biomolecular Interaction Server (IBIS) and database, which reports, predicts and integrates different types of interaction partners and locations of binding sites in proteins based on the analysis of homologous structural complexes. Here, we highlight several new IBIS features and options. The server's webpage is now redesigned to allow users easier access to data for different interaction types. An entry page is added to give a quick summary of available results and to now accept protein sequence accessions. To elucidate the formation of protein complexes, not just binary interactions, IBIS currently presents an expandable interaction network. Previously, IBIS provided annotations for four different types of binding partners: proteins, small molecules, nucleic acids and peptides; in the current version a new protein-ion interaction type has been added. Several options provide easy downloads of IBIS data for all Protein Data Bank (PDB) protein chains and the results for each query. In this study, we show that about one-third of all RefSeq sequences can be annotated with IBIS interaction partners and binding sites. The IBIS server is available at http://www.ncbi.nlm.nih.gov/Structure/ibis/ibis.cgi and updated biweekly.


Assuntos
Bases de Dados de Proteínas , Mapeamento de Interação de Proteínas , Proteínas/química , Sítios de Ligação , Gráficos por Computador , Íons/química , Anotação de Sequência Molecular , Complexos Multiproteicos/química , Ácidos Nucleicos/química , Peptídeos/química , Análise de Sequência de Proteína , Integração de Sistemas , Interface Usuário-Computador
11.
Nucleic Acids Res ; 40(Database issue): D400-12, 2012 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-22140110

RESUMO

PubChem (http://pubchem.ncbi.nlm.nih.gov) is a public repository for biological activity data of small molecules and RNAi reagents. The mission of PubChem is to deliver free and easy access to all deposited data, and to provide intuitive data analysis tools. The PubChem BioAssay database currently contains 500,000 descriptions of assay protocols, covering 5000 protein targets, 30,000 gene targets and providing over 130 million bioactivity outcomes. PubChem's bioassay data are integrated into the NCBI Entrez information retrieval system, thus making PubChem data searchable and accessible by Entrez queries. Also, as a repository, PubChem constantly optimizes and develops its deposition system answering many demands of both high- and low-volume depositors. The PubChem information platform allows users to search, review and download bioassay description and data. The PubChem platform also enables researchers to collect, compare and analyze biological test results through web-based and programmatic tools. In this work, we provide an update for the PubChem BioAssay resource, including information content growth, data model extension and new developments of data submission, retrieval, analysis and download tools.


Assuntos
Bases de Dados Factuais , Descoberta de Drogas , Interferência de RNA , Bioensaio , Ensaios de Triagem em Larga Escala , Indicadores e Reagentes , Estrutura Molecular , Software
12.
Acta Ophthalmol ; 102(3): e367-e380, 2024 May.
Artigo em Inglês | MEDLINE | ID: mdl-37786359

RESUMO

PURPOSE: This study aims to characterize the dependence of measured retinal arterial and venous saturation on vessel diameter and central reflex in retinal oximetry, with an ultimate goal of identifying potential causes and suggesting approaches to improve measurement accuracy. METHODS: In 10 subjects, oxygen saturation, vessel diameter and optical density are obtained using Oxymap Analyzer software without diameter correction. Diameter dependence of saturation is characterized using linear regression between measured values of saturation and diameter. Occurrences of negative values of vessel optical densities (ODs) associated with central vessel reflex are acquired from Oxymap Analyzer. A conceptual model is used to calculate the ratio of optical densities (ODRs) according to retinal reflectance properties and single and double-pass light transmission across fixed path lengths. Model-predicted values are compared with measured oximetry values at different vessel diameters. RESULTS: Venous saturation shows an inverse relationship with vessel diameter (D) across subjects, with a mean slope of -0.180 (SE = 0.022) %/µm (20 < D < 180 µm) and a more rapid saturation increase at small vessel diameters reaching to over 80%. Arterial saturation yields smaller positive and negative slopes in individual subjects, with an average of -0.007 (SE = 0.021) %/µm (20 < D < 200 µm) across all subjects. Measurements where vessel brightness exceeds that of the retinal background result in negative values of optical density, causing an artifactual increase in saturation. Optimization of model reflectance values produces a good fit of the conceptual model to measured ODRs. CONCLUSION: Measurement artefacts in retinal oximetry are caused by strong central vessel reflections, and apparent diameter sensitivity may result from single and double-pass transmission in vessels. Improvement in correction for vessel diameter is indicated for arteries however further study is necessary for venous corrections.


Assuntos
Oximetria , Oxigênio , Humanos , Retina/diagnóstico por imagem , Vasos Retinianos/diagnóstico por imagem , Reflexo
13.
Int J Cardiol Heart Vasc ; 52: 101381, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38854746

RESUMO

Cardiovascular disease is the leading cause of morbidity and mortality in patients with autoimmune rheumatic diseases. Much of this may be attributed to systemic inflammation resulting in coronary atherosclerosis and myocarditis. Cardiac magnetic resonance imaging is the gold standard for the evaluation of cardiac structure and function, including tissue characterization, which allows for detection of myocardial edema, inflammation, and fibrosis. Advances in parametric mapping and coronary flow reserve measurement techniques have the potential to change the diagnosis, risk stratification, and management of patients with autoimmune rheumatic diseases. We provide an overview of the current evidence and suggest potential future roles for the use of comprehensive cardiac magnetic resonance in patients with autoimmune rheumatic diseases in the field of cardio-rheumatology.

14.
Circ Arrhythm Electrophysiol ; 17(1): e012072, 2024 01.
Artigo em Inglês | MEDLINE | ID: mdl-38099441

RESUMO

Although there is consensus on the management of patients with Brugada Syndrome with high risk for sudden cardiac arrest, asymptomatic or intermediate-risk patients present clinical management challenges. This document explores the management opinions of experts throughout the world for patients with Brugada Syndrome who do not fit guideline recommendations. Four real-world clinical scenarios were presented with commentary from small expert groups for each case. All authors voted on case-specific questions to evaluate the level of consensus among the entire group in nuanced diagnostic and management decisions relevant to each case. Points of agreement, points of controversy, and gaps in knowledge are highlighted.


Assuntos
Síndrome de Brugada , Parada Cardíaca , Humanos , Síndrome de Brugada/diagnóstico , Síndrome de Brugada/terapia , Eletrocardiografia , Parada Cardíaca/diagnóstico , Parada Cardíaca/terapia , Morte Súbita Cardíaca/etiologia , Morte Súbita Cardíaca/prevenção & controle , Consenso
15.
Environ Sci Process Impacts ; 25(11): 1788-1801, 2023 Nov 15.
Artigo em Inglês | MEDLINE | ID: mdl-37431591

RESUMO

The term "exposome" is defined as a comprehensive study of life-course environmental exposures and the associated biological responses. Humans are exposed to many different chemicals, which can pose a major threat to the well-being of humanity. Targeted or non-targeted mass spectrometry techniques are widely used to identify and characterize various environmental stressors when linking exposures to human health. However, identification remains challenging due to the huge chemical space applicable to exposomics, combined with the lack of sufficient relevant entries in spectral libraries. Addressing these challenges requires cheminformatics tools and database resources to share curated open spectral data on chemicals to improve the identification of chemicals in exposomics studies. This article describes efforts to contribute spectra relevant for exposomics to the open mass spectral library MassBank (https://www.massbank.eu) using various open source software efforts, including the R packages RMassBank and Shinyscreen. The experimental spectra were obtained from ten mixtures containing toxicologically relevant chemicals from the US Environmental Protection Agency (EPA) Non-Targeted Analysis Collaborative Trial (ENTACT). Following processing and curation, 5582 spectra from 783 of the 1268 ENTACT compounds were added to MassBank, and through this to other open spectral libraries (e.g., MoNA, GNPS) for community benefit. Additionally, an automated deposition and annotation workflow was developed with PubChem to enable the display of all MassBank mass spectra in PubChem, which is rerun with each MassBank release. The new spectral records have already been used in several studies to increase the confidence in identification in non-target small molecule identification workflows applied to environmental and exposomics research.


Assuntos
Exposição Ambiental , Software , Humanos , Espectrometria de Massas/métodos , Exposição Ambiental/análise , Bases de Dados Factuais
16.
Nucleic Acids Res ; 38(Database issue): D283-7, 2010 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-19906708

RESUMO

Most of the proteins in a cell assemble into complexes to carry out their function. In this work, we have created a new database (named ComSin) of protein structures in bound (complex) and unbound (single) states to provide a researcher with exhaustive information on structures of the same or homologous proteins in bound and unbound states. From the complete Protein Data Bank (PDB), we selected 24 910 pairs of protein structures in bound and unbound states, and identified regions of intrinsic disorder. For 2448 pairs, the proteins in bound and unbound states are identical, while 7129 pairs have sequence identity 90% or larger. The developed server enables one to search for proteins in bound and unbound states with several options including sequence similarity between the corresponding proteins in bound and unbound states, and validation of interaction interfaces of protein complexes. Besides that, through our web server, one can obtain necessary information for studying disorder-to-order and order-to-disorder transitions upon complex formation, and analyze structural differences between proteins in bound and unbound states. The database is available at http://antares.protres.ru/comsin/.


Assuntos
Proteínas de Bactérias/química , Biologia Computacional/métodos , Bases de Dados Genéticas , Animais , Biologia Computacional/tendências , Bases de Dados de Proteínas , Humanos , Armazenamento e Recuperação da Informação/métodos , Internet , Modelos Moleculares , Conformação Molecular , Ligação Proteica , Conformação Proteica , Estrutura Terciária de Proteína , Software
17.
Nucleic Acids Res ; 38(Database issue): D255-66, 2010 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-19933261

RESUMO

The PubChem BioAssay database (http://pubchem.ncbi.nlm.nih.gov) is a public repository for biological activities of small molecules and small interfering RNAs (siRNAs) hosted by the US National Institutes of Health (NIH). It archives experimental descriptions of assays and biological test results and makes the information freely accessible to the public. A PubChem BioAssay data entry includes an assay description, a summary and detailed test results. Each assay record is linked to the molecular target, whenever possible, and is cross-referenced to other National Center for Biotechnology Information (NCBI) database records. 'Related BioAssays' are identified by examining the assay target relationship and activity profile of commonly tested compounds. A key goal of PubChem BioAssay is to make the biological activity information easily accessible through the NCBI information retrieval system-Entrez, and various web-based PubChem services. An integrated suite of data analysis tools are available to optimize the utility of the chemical structure and biological activity information within PubChem, enabling researchers to aggregate, compare and analyze biological test results contributed by multiple organizations. In this work, we describe the PubChem BioAssay database, including data model, bioassay deposition and utilities that PubChem provides for searching, downloading and analyzing the biological activity information contained therein.


Assuntos
Bioensaio , Biologia Computacional/métodos , Bases de Dados Factuais , Dicionários Químicos como Assunto , Animais , Biologia Computacional/tendências , Bases de Dados de Proteínas , Humanos , Armazenamento e Recuperação da Informação/métodos , Internet , National Library of Medicine (U.S.) , Preparações Farmacêuticas/química , Farmacologia , Software , Relação Estrutura-Atividade , Estados Unidos
18.
Nucleic Acids Res ; 38(Database issue): D518-24, 2010 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-19843613

RESUMO

IBIS is the NCBI Inferred Biomolecular Interaction Server. This server organizes, analyzes and predicts interaction partners and locations of binding sites in proteins. IBIS provides annotations for different types of binding partners (protein, chemical, nucleic acid and peptides), and facilitates the mapping of a comprehensive biomolecular interaction network for a given protein query. IBIS reports interactions observed in experimentally determined structural complexes of a given protein, and at the same time IBIS infers binding sites/interacting partners by inspecting protein complexes formed by homologous proteins. Similar binding sites are clustered together based on their sequence and structure conservation. To emphasize biologically relevant binding sites, several algorithms are used for verification in terms of evolutionary conservation, biological importance of binding partners, size and stability of interfaces, as well as evidence from the published literature. IBIS is updated regularly and is freely accessible via http://www.ncbi.nlm.nih.gov/Structure/ibis/ibis.html.


Assuntos
Biologia Computacional/métodos , Bases de Dados Genéticas , Bases de Dados de Proteínas , Mapeamento de Interação de Proteínas/métodos , Estrutura Terciária de Proteína , Algoritmos , Animais , Sítios de Ligação , Domínio Catalítico , Análise por Conglomerados , Biologia Computacional/tendências , Humanos , Armazenamento e Recuperação da Informação/métodos , Internet , Proteínas Tirosina Quinases/química , Software
19.
Nat Genet ; 54(3): 263-273, 2022 03.
Artigo em Inglês | MEDLINE | ID: mdl-35256806

RESUMO

Analyses of data from genome-wide association studies on unrelated individuals have shown that, for human traits and diseases, approximately one-third to two-thirds of heritability is captured by common SNPs. However, it is not known whether the remaining heritability is due to the imperfect tagging of causal variants by common SNPs, in particular whether the causal variants are rare, or whether it is overestimated due to bias in inference from pedigree data. Here we estimated heritability for height and body mass index (BMI) from whole-genome sequence data on 25,465 unrelated individuals of European ancestry. The estimated heritability was 0.68 (standard error 0.10) for height and 0.30 (standard error 0.10) for body mass index. Low minor allele frequency variants in low linkage disequilibrium (LD) with neighboring variants were enriched for heritability, to a greater extent for protein-altering variants, consistent with negative selection. Our results imply that rare variants, in particular those in regions of low linkage disequilibrium, are a major source of the still missing heritability of complex traits and disease.


Assuntos
Estudo de Associação Genômica Ampla , Herança Multifatorial , Alelos , Estudo de Associação Genômica Ampla/métodos , Humanos , Desequilíbrio de Ligação , Herança Multifatorial/genética , Polimorfismo de Nucleotídeo Único/genética
20.
BMC Bioinformatics ; 11: 365, 2010 Jul 01.
Artigo em Inglês | MEDLINE | ID: mdl-20594344

RESUMO

BACKGROUND: The study of protein-small molecule interactions is vital for understanding protein function and for practical applications in drug discovery. To benefit from the rapidly increasing structural data, it is essential to improve the tools that enable large scale binding site prediction with greater emphasis on their biological validity. RESULTS: We have developed a new method for the annotation of protein-small molecule binding sites, using inference by homology, which allows us to extend annotation onto protein sequences without experimental data available. To ensure biological relevance of binding sites, our method clusters similar binding sites found in homologous protein structures based on their sequence and structure conservation. Binding sites which appear evolutionarily conserved among non-redundant sets of homologous proteins are given higher priority. After binding sites are clustered, position specific score matrices (PSSMs) are constructed from the corresponding binding site alignments. Together with other measures, the PSSMs are subsequently used to rank binding sites to assess how well they match the query and to better gauge their biological relevance. The method also facilitates a succinct and informative representation of observed and inferred binding sites from homologs with known three-dimensional structures, thereby providing the means to analyze conservation and diversity of binding modes. Furthermore, the chemical properties of small molecules bound to the inferred binding sites can be used as a starting point in small molecule virtual screening. The method was validated by comparison to other binding site prediction methods and to a collection of manually curated binding site annotations. We show that our method achieves a sensitivity of 72% at predicting biologically relevant binding sites and can accurately discriminate those sites that bind biological small molecules from non-biological ones. CONCLUSIONS: A new algorithm has been developed to predict binding sites with high accuracy in terms of their biological validity. It also provides a common platform for function prediction, knowledge-based docking and for small molecule virtual screening. The method can be applied even for a query sequence without structure. The method is available at http://www.ncbi.nlm.nih.gov/Structure/ibis/ibis.cgi.


Assuntos
Algoritmos , Sítios de Ligação , Proteínas/química , Proteínas/metabolismo , Sequência de Aminoácidos , Análise por Conglomerados , Bases de Conhecimento , Ligação Proteica , Análise de Sequência de Proteína , Homologia Estrutural de Proteína
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA