Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 10 de 10
Filtrar
1.
Am J Hum Genet ; 102(1): 116-132, 2018 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-29290337

RESUMO

Whole-exome and targeted sequencing of 13 individuals from 10 unrelated families with overlapping clinical manifestations identified loss-of-function and missense variants in KIAA1109 allowing delineation of an autosomal-recessive multi-system syndrome, which we suggest to name Alkuraya-Kucinskas syndrome (MIM 617822). Shared phenotypic features representing the cardinal characteristics of this syndrome combine brain atrophy with clubfoot and arthrogryposis. Affected individuals present with cerebral parenchymal underdevelopment, ranging from major cerebral parenchymal thinning with lissencephalic aspect to moderate parenchymal rarefaction, severe to mild ventriculomegaly, cerebellar hypoplasia with brainstem dysgenesis, and cardiac and ophthalmologic anomalies, such as microphthalmia and cataract. Severe loss-of-function cases were incompatible with life, whereas those individuals with milder missense variants presented with severe global developmental delay, syndactyly of 2nd and 3rd toes, and severe muscle hypotonia resulting in incapacity to stand without support. Consistent with a causative role for KIAA1109 loss-of-function/hypomorphic variants in this syndrome, knockdowns of the zebrafish orthologous gene resulted in embryos with hydrocephaly and abnormally curved notochords and overall body shape, whereas published knockouts of the fruit fly and mouse orthologous genes resulted in lethality or severe neurological defects reminiscent of the probands' features.


Assuntos
Artrogripose/genética , Encéfalo/embriologia , Mutação/genética , Proteínas/genética , Adolescente , Animais , Encéfalo/diagnóstico por imagem , Encéfalo/patologia , Criança , Feminino , Técnicas de Silenciamento de Genes , Humanos , Lactente , Recém-Nascido , Imageamento por Ressonância Magnética , Masculino , Linhagem , Peixe-Zebra , Proteínas de Peixe-Zebra/genética
2.
Bioinformatics ; 33(21): 3454-3460, 2017 Nov 01.
Artigo em Inglês | MEDLINE | ID: mdl-29036270

RESUMO

MOTIVATION: Biological knowledgebases, such as UniProtKB/Swiss-Prot, constitute an essential component of daily scientific research by offering distilled, summarized and computable knowledge extracted from the literature by expert curators. While knowledgebases play an increasingly important role in the scientific community, their ability to keep up with the growth of biomedical literature is under scrutiny. Using UniProtKB/Swiss-Prot as a case study, we address this concern via multiple literature triage approaches. RESULTS: With the assistance of the PubTator text-mining tool, we tagged more than 10 000 articles to assess the ratio of papers relevant for curation. We first show that curators read and evaluate many more papers than they curate, and that measuring the number of curated publications is insufficient to provide a complete picture as demonstrated by the fact that 8000-10 000 papers are curated in UniProt each year while curators evaluate 50 000-70 000 papers per year. We show that 90% of the papers in PubMed are out of the scope of UniProt, that a maximum of 2-3% of the papers indexed in PubMed each year are relevant for UniProt curation, and that, despite appearances, expert curation in UniProt is scalable. AVAILABILITY AND IMPLEMENTATION: UniProt is freely available at http://www.uniprot.org/. CONTACT: sylvain.poux@sib.swiss. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Curadoria de Dados , Bases de Dados de Proteínas , Curadoria de Dados/estatística & dados numéricos , Mineração de Dados , Bases de Dados de Proteínas/estatística & dados numéricos , Humanos , Bases de Conhecimento , PubMed/estatística & dados numéricos , Literatura de Revisão como Assunto , Estatística como Assunto
3.
Nucleic Acids Res ; 43(Database issue): D479-84, 2015 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-25313161

RESUMO

The IntAct molecular interaction database has created a new, free, open-source, manually curated resource, the Complex Portal (www.ebi.ac.uk/intact/complex), through which protein complexes from major model organisms are being collated and made available for search, viewing and download. It has been built in close collaboration with other bioinformatics services and populated with data from ChEMBL, MatrixDB, PDBe, Reactome and UniProtKB. Each entry contains information about the participating molecules (including small molecules and nucleic acids), their stoichiometry, topology and structural assembly. Complexes are annotated with details about their function, properties and complex-specific Gene Ontology (GO) terms. Consistent nomenclature is used throughout the resource with systematic names, recommended names and a list of synonyms all provided. The use of the Evidence Code Ontology allows us to indicate for which entries direct experimental evidence is available or if the complex has been inferred based on homology or orthology. The data are searchable using standard identifiers, such as UniProt, ChEBI and GO IDs, protein, gene and complex names or synonyms. This reference resource will be maintained and grow to encompass an increasing number of organisms. Input from groups and individuals with specific areas of expertise is welcome.


Assuntos
Bases de Dados de Proteínas , Proteínas/química , Animais , Sítios de Ligação , Humanos , Internet , Substâncias Macromoleculares/química , Camundongos , Ligação Proteica , Proteínas/genética , Proteínas/metabolismo
4.
Nucleic Acids Res ; 42(Database issue): D358-63, 2014 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-24234451

RESUMO

IntAct (freely available at http://www.ebi.ac.uk/intact) is an open-source, open data molecular interaction database populated by data either curated from the literature or from direct data depositions. IntAct has developed a sophisticated web-based curation tool, capable of supporting both IMEx- and MIMIx-level curation. This tool is now utilized by multiple additional curation teams, all of whom annotate data directly into the IntAct database. Members of the IntAct team supply appropriate levels of training, perform quality control on entries and take responsibility for long-term data maintenance. Recently, the MINT and IntAct databases decided to merge their separate efforts to make optimal use of limited developer resources and maximize the curation output. All data manually curated by the MINT curators have been moved into the IntAct database at EMBL-EBI and are merged with the existing IntAct dataset. Both IntAct and MINT are active contributors to the IMEx consortium (http://www.imexconsortium.org).


Assuntos
Bases de Dados de Proteínas , Mapeamento de Interação de Proteínas , Internet , Software
5.
Nat Methods ; 9(4): 345-50, 2012 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-22453911

RESUMO

The International Molecular Exchange (IMEx) consortium is an international collaboration between major public interaction data providers to share literature-curation efforts and make a nonredundant set of protein interactions available in a single search interface on a common website (http://www.imexconsortium.org/). Common curation rules have been developed, and a central registry is used to manage the selection of articles to enter into the dataset. We discuss the advantages of such a service to the user, our quality-control measures and our data-distribution practices.


Assuntos
Bases de Dados de Proteínas , Mapeamento de Interação de Proteínas , Proteínas/metabolismo , Publicações Periódicas como Assunto , Ligação Proteica , Proteínas/química , Controle de Qualidade
6.
Nucleic Acids Res ; 40(Database issue): D841-6, 2012 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-22121220

RESUMO

IntAct is an open-source, open data molecular interaction database populated by data either curated from the literature or from direct data depositions. Two levels of curation are now available within the database, with both IMEx-level annotation and less detailed MIMIx-compatible entries currently supported. As from September 2011, IntAct contains approximately 275,000 curated binary interaction evidences from over 5000 publications. The IntAct website has been improved to enhance the search process and in particular the graphical display of the results. New data download formats are also available, which will facilitate the inclusion of IntAct's data in the Semantic Web. IntAct is an active contributor to the IMEx consortium (http://www.imexconsortium.org). IntAct source code and data are freely available at http://www.ebi.ac.uk/intact.


Assuntos
Bases de Dados de Proteínas , Mapeamento de Interação de Proteínas , Gráficos por Computador , Genes , Internet , Anotação de Sequência Molecular , Análise de Sequência de Proteína , Software
7.
Nucleic Acids Res ; 40(Database issue): D565-70, 2012 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-22123736

RESUMO

The GO annotation dataset provided by the UniProt Consortium (GOA: http://www.ebi.ac.uk/GOA) is a comprehensive set of evidenced-based associations between terms from the Gene Ontology resource and UniProtKB proteins. Currently supplying over 100 million annotations to 11 million proteins in more than 360,000 taxa, this resource has increased 2-fold over the last 2 years and has benefited from a wealth of checks to improve annotation correctness and consistency as well as now supplying a greater information content enabled by GO Consortium annotation format developments. Detailed, manual GO annotations obtained from the curation of peer-reviewed papers are directly contributed by all UniProt curators and supplemented with manual and electronic annotations from 36 model organism and domain-focused scientific resources. The inclusion of high-quality, automatic annotation predictions ensures the UniProt GO annotation dataset supplies functional information to a wide range of proteins, including those from poorly characterized, non-model organism species. UniProt GO annotations are freely available in a range of formats accessible by both file downloads and web-based views. In addition, the introduction of a new, normalized file format in 2010 has made for easier handling of the complete UniProt-GOA data set.


Assuntos
Bases de Dados de Proteínas , Anotação de Sequência Molecular , Vocabulário Controlado , Anotação de Sequência Molecular/normas
8.
Nat Biotechnol ; 22(2): 177-83, 2004 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-14755292

RESUMO

A major goal of proteomics is the complete description of the protein interaction network underlying cell physiology. A large number of small scale and, more recently, large-scale experiments have contributed to expanding our understanding of the nature of the interaction network. However, the necessary data integration across experiments is currently hampered by the fragmentation of publicly available protein interaction data, which exists in different formats in databases, on authors' websites or sometimes only in print publications. Here, we propose a community standard data model for the representation and exchange of protein interaction data. This data model has been jointly developed by members of the Proteomics Standards Initiative (PSI), a work group of the Human Proteome Organization (HUPO), and is supported by major protein interaction data providers, in particular the Biomolecular Interaction Network Database (BIND), Cellzome (Heidelberg, Germany), the Database of Interacting Proteins (DIP), Dana Farber Cancer Institute (Boston, MA, USA), the Human Protein Reference Database (HPRD), Hybrigenics (Paris, France), the European Bioinformatics Institute's (EMBL-EBI, Hinxton, UK) IntAct, the Molecular Interactions (MINT, Rome, Italy) database, the Protein-Protein Interaction Database (PPID, Edinburgh, UK) and the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING, EMBL, Heidelberg, Germany).


Assuntos
Sistemas de Gerenciamento de Base de Dados/normas , Bases de Dados de Proteínas/normas , Armazenamento e Recuperação da Informação/normas , Mapeamento de Interação de Proteínas/normas , Proteínas/classificação , Proteômica/normas , Interface Usuário-Computador , Guias como Assunto , Armazenamento e Recuperação da Informação/métodos , Internacionalidade , Processamento de Linguagem Natural , Ligação Proteica , Mapeamento de Interação de Proteínas/métodos , Proteínas/química , Proteínas/genética , Proteínas/metabolismo , Proteômica/métodos , Padrões de Referência , Software
9.
Nucleic Acids Res ; 32(Database issue): D452-5, 2004 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-14681455

RESUMO

IntAct provides an open source database and toolkit for the storage, presentation and analysis of protein interactions. The web interface provides both textual and graphical representations of protein interactions, and allows exploring interaction networks in the context of the GO annotations of the interacting proteins. A web service allows direct computational access to retrieve interaction networks in XML format. IntAct currently contains approximately 2200 binary and complex interactions imported from the literature and curated in collaboration with the Swiss-Prot team, making intensive use of controlled vocabularies to ensure data consistency. All IntAct software, data and controlled vocabularies are available at http://www.ebi.ac.uk/intact.


Assuntos
Bases de Dados de Proteínas , Ligação Proteica , Proteínas/metabolismo , Animais , Biologia Computacional , Humanos , Armazenamento e Recuperação da Informação , Internet , Software , Interface Usuário-Computador , Vocabulário Controlado
10.
C R Biol ; 328(10-11): 882-99, 2005.
Artigo em Inglês | MEDLINE | ID: mdl-16286078

RESUMO

We all know that the dogma 'one gene, one protein' is obsolete. A functional protein and, likewise, a protein's ultimate function depend not only on the underlying genetic information but also on the ongoing conditions of the cellular system. Frequently the transcript, like the polypeptide, is processed in multiple ways, but only one or a few out of a multitude of possible variants are produced at a time. An overview on processes that can lead to sequence variety and structural diversity in eukaryotes is given. The UniProtKB/Swiss-Prot protein knowledgebase provides a wealth of information regarding protein variety, function and associated disorders. Examples for such annotation are shown and further ones are available at http://www.expasy.org/sprot/tutorial/examples_CRB.


Assuntos
Bases de Conhecimento , Proteínas/química , Sequência de Aminoácidos , Dados de Sequência Molecular , Dobramento de Proteína
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA