Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 10 de 10
Filtrar
Más filtros













Base de datos
Intervalo de año de publicación
1.
Am J Hum Genet ; 102(1): 116-132, 2018 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-29290337

RESUMEN

Whole-exome and targeted sequencing of 13 individuals from 10 unrelated families with overlapping clinical manifestations identified loss-of-function and missense variants in KIAA1109 allowing delineation of an autosomal-recessive multi-system syndrome, which we suggest to name Alkuraya-Kucinskas syndrome (MIM 617822). Shared phenotypic features representing the cardinal characteristics of this syndrome combine brain atrophy with clubfoot and arthrogryposis. Affected individuals present with cerebral parenchymal underdevelopment, ranging from major cerebral parenchymal thinning with lissencephalic aspect to moderate parenchymal rarefaction, severe to mild ventriculomegaly, cerebellar hypoplasia with brainstem dysgenesis, and cardiac and ophthalmologic anomalies, such as microphthalmia and cataract. Severe loss-of-function cases were incompatible with life, whereas those individuals with milder missense variants presented with severe global developmental delay, syndactyly of 2nd and 3rd toes, and severe muscle hypotonia resulting in incapacity to stand without support. Consistent with a causative role for KIAA1109 loss-of-function/hypomorphic variants in this syndrome, knockdowns of the zebrafish orthologous gene resulted in embryos with hydrocephaly and abnormally curved notochords and overall body shape, whereas published knockouts of the fruit fly and mouse orthologous genes resulted in lethality or severe neurological defects reminiscent of the probands' features.


Asunto(s)
Artrogriposis/genética , Encéfalo/embriología , Mutación/genética , Proteínas/genética , Adolescente , Animales , Encéfalo/diagnóstico por imagen , Encéfalo/patología , Niño , Femenino , Técnicas de Silenciamiento del Gen , Humanos , Lactante , Recién Nacido , Imagen por Resonancia Magnética , Masculino , Linaje , Pez Cebra , Proteínas de Pez Cebra/genética
2.
Bioinformatics ; 33(21): 3454-3460, 2017 Nov 01.
Artículo en Inglés | MEDLINE | ID: mdl-29036270

RESUMEN

MOTIVATION: Biological knowledgebases, such as UniProtKB/Swiss-Prot, constitute an essential component of daily scientific research by offering distilled, summarized and computable knowledge extracted from the literature by expert curators. While knowledgebases play an increasingly important role in the scientific community, their ability to keep up with the growth of biomedical literature is under scrutiny. Using UniProtKB/Swiss-Prot as a case study, we address this concern via multiple literature triage approaches. RESULTS: With the assistance of the PubTator text-mining tool, we tagged more than 10 000 articles to assess the ratio of papers relevant for curation. We first show that curators read and evaluate many more papers than they curate, and that measuring the number of curated publications is insufficient to provide a complete picture as demonstrated by the fact that 8000-10 000 papers are curated in UniProt each year while curators evaluate 50 000-70 000 papers per year. We show that 90% of the papers in PubMed are out of the scope of UniProt, that a maximum of 2-3% of the papers indexed in PubMed each year are relevant for UniProt curation, and that, despite appearances, expert curation in UniProt is scalable. AVAILABILITY AND IMPLEMENTATION: UniProt is freely available at http://www.uniprot.org/. CONTACT: sylvain.poux@sib.swiss. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Curaduría de Datos , Bases de Datos de Proteínas , Curaduría de Datos/estadística & datos numéricos , Minería de Datos , Bases de Datos de Proteínas/estadística & datos numéricos , Humanos , Bases del Conocimiento , PubMed/estadística & datos numéricos , Literatura de Revisión como Asunto , Estadística como Asunto
3.
Nucleic Acids Res ; 43(Database issue): D479-84, 2015 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-25313161

RESUMEN

The IntAct molecular interaction database has created a new, free, open-source, manually curated resource, the Complex Portal (www.ebi.ac.uk/intact/complex), through which protein complexes from major model organisms are being collated and made available for search, viewing and download. It has been built in close collaboration with other bioinformatics services and populated with data from ChEMBL, MatrixDB, PDBe, Reactome and UniProtKB. Each entry contains information about the participating molecules (including small molecules and nucleic acids), their stoichiometry, topology and structural assembly. Complexes are annotated with details about their function, properties and complex-specific Gene Ontology (GO) terms. Consistent nomenclature is used throughout the resource with systematic names, recommended names and a list of synonyms all provided. The use of the Evidence Code Ontology allows us to indicate for which entries direct experimental evidence is available or if the complex has been inferred based on homology or orthology. The data are searchable using standard identifiers, such as UniProt, ChEBI and GO IDs, protein, gene and complex names or synonyms. This reference resource will be maintained and grow to encompass an increasing number of organisms. Input from groups and individuals with specific areas of expertise is welcome.


Asunto(s)
Bases de Datos de Proteínas , Proteínas/química , Animales , Sitios de Unión , Humanos , Internet , Sustancias Macromoleculares/química , Ratones , Unión Proteica , Proteínas/genética , Proteínas/metabolismo
4.
Nucleic Acids Res ; 42(Database issue): D358-63, 2014 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-24234451

RESUMEN

IntAct (freely available at http://www.ebi.ac.uk/intact) is an open-source, open data molecular interaction database populated by data either curated from the literature or from direct data depositions. IntAct has developed a sophisticated web-based curation tool, capable of supporting both IMEx- and MIMIx-level curation. This tool is now utilized by multiple additional curation teams, all of whom annotate data directly into the IntAct database. Members of the IntAct team supply appropriate levels of training, perform quality control on entries and take responsibility for long-term data maintenance. Recently, the MINT and IntAct databases decided to merge their separate efforts to make optimal use of limited developer resources and maximize the curation output. All data manually curated by the MINT curators have been moved into the IntAct database at EMBL-EBI and are merged with the existing IntAct dataset. Both IntAct and MINT are active contributors to the IMEx consortium (http://www.imexconsortium.org).


Asunto(s)
Bases de Datos de Proteínas , Mapeo de Interacción de Proteínas , Internet , Programas Informáticos
5.
Nat Methods ; 9(4): 345-50, 2012 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-22453911

RESUMEN

The International Molecular Exchange (IMEx) consortium is an international collaboration between major public interaction data providers to share literature-curation efforts and make a nonredundant set of protein interactions available in a single search interface on a common website (http://www.imexconsortium.org/). Common curation rules have been developed, and a central registry is used to manage the selection of articles to enter into the dataset. We discuss the advantages of such a service to the user, our quality-control measures and our data-distribution practices.


Asunto(s)
Bases de Datos de Proteínas , Mapeo de Interacción de Proteínas , Proteínas/metabolismo , Publicaciones Periódicas como Asunto , Unión Proteica , Proteínas/química , Control de Calidad
6.
Nucleic Acids Res ; 40(Database issue): D565-70, 2012 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-22123736

RESUMEN

The GO annotation dataset provided by the UniProt Consortium (GOA: http://www.ebi.ac.uk/GOA) is a comprehensive set of evidenced-based associations between terms from the Gene Ontology resource and UniProtKB proteins. Currently supplying over 100 million annotations to 11 million proteins in more than 360,000 taxa, this resource has increased 2-fold over the last 2 years and has benefited from a wealth of checks to improve annotation correctness and consistency as well as now supplying a greater information content enabled by GO Consortium annotation format developments. Detailed, manual GO annotations obtained from the curation of peer-reviewed papers are directly contributed by all UniProt curators and supplemented with manual and electronic annotations from 36 model organism and domain-focused scientific resources. The inclusion of high-quality, automatic annotation predictions ensures the UniProt GO annotation dataset supplies functional information to a wide range of proteins, including those from poorly characterized, non-model organism species. UniProt GO annotations are freely available in a range of formats accessible by both file downloads and web-based views. In addition, the introduction of a new, normalized file format in 2010 has made for easier handling of the complete UniProt-GOA data set.


Asunto(s)
Bases de Datos de Proteínas , Anotación de Secuencia Molecular , Vocabulario Controlado , Anotación de Secuencia Molecular/normas
7.
Nucleic Acids Res ; 40(Database issue): D841-6, 2012 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-22121220

RESUMEN

IntAct is an open-source, open data molecular interaction database populated by data either curated from the literature or from direct data depositions. Two levels of curation are now available within the database, with both IMEx-level annotation and less detailed MIMIx-compatible entries currently supported. As from September 2011, IntAct contains approximately 275,000 curated binary interaction evidences from over 5000 publications. The IntAct website has been improved to enhance the search process and in particular the graphical display of the results. New data download formats are also available, which will facilitate the inclusion of IntAct's data in the Semantic Web. IntAct is an active contributor to the IMEx consortium (http://www.imexconsortium.org). IntAct source code and data are freely available at http://www.ebi.ac.uk/intact.


Asunto(s)
Bases de Datos de Proteínas , Mapeo de Interacción de Proteínas , Gráficos por Computador , Genes , Internet , Anotación de Secuencia Molecular , Análisis de Secuencia de Proteína , Programas Informáticos
8.
C R Biol ; 328(10-11): 882-99, 2005.
Artículo en Inglés | MEDLINE | ID: mdl-16286078

RESUMEN

We all know that the dogma 'one gene, one protein' is obsolete. A functional protein and, likewise, a protein's ultimate function depend not only on the underlying genetic information but also on the ongoing conditions of the cellular system. Frequently the transcript, like the polypeptide, is processed in multiple ways, but only one or a few out of a multitude of possible variants are produced at a time. An overview on processes that can lead to sequence variety and structural diversity in eukaryotes is given. The UniProtKB/Swiss-Prot protein knowledgebase provides a wealth of information regarding protein variety, function and associated disorders. Examples for such annotation are shown and further ones are available at http://www.expasy.org/sprot/tutorial/examples_CRB.


Asunto(s)
Bases del Conocimiento , Proteínas/química , Secuencia de Aminoácidos , Datos de Secuencia Molecular , Pliegue de Proteína
9.
Nat Biotechnol ; 22(2): 177-83, 2004 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-14755292

RESUMEN

A major goal of proteomics is the complete description of the protein interaction network underlying cell physiology. A large number of small scale and, more recently, large-scale experiments have contributed to expanding our understanding of the nature of the interaction network. However, the necessary data integration across experiments is currently hampered by the fragmentation of publicly available protein interaction data, which exists in different formats in databases, on authors' websites or sometimes only in print publications. Here, we propose a community standard data model for the representation and exchange of protein interaction data. This data model has been jointly developed by members of the Proteomics Standards Initiative (PSI), a work group of the Human Proteome Organization (HUPO), and is supported by major protein interaction data providers, in particular the Biomolecular Interaction Network Database (BIND), Cellzome (Heidelberg, Germany), the Database of Interacting Proteins (DIP), Dana Farber Cancer Institute (Boston, MA, USA), the Human Protein Reference Database (HPRD), Hybrigenics (Paris, France), the European Bioinformatics Institute's (EMBL-EBI, Hinxton, UK) IntAct, the Molecular Interactions (MINT, Rome, Italy) database, the Protein-Protein Interaction Database (PPID, Edinburgh, UK) and the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING, EMBL, Heidelberg, Germany).


Asunto(s)
Sistemas de Administración de Bases de Datos/normas , Bases de Datos de Proteínas/normas , Almacenamiento y Recuperación de la Información/normas , Mapeo de Interacción de Proteínas/normas , Proteínas/clasificación , Proteómica/normas , Interfaz Usuario-Computador , Guías como Asunto , Almacenamiento y Recuperación de la Información/métodos , Internacionalidad , Procesamiento de Lenguaje Natural , Unión Proteica , Mapeo de Interacción de Proteínas/métodos , Proteínas/química , Proteínas/genética , Proteínas/metabolismo , Proteómica/métodos , Estándares de Referencia , Programas Informáticos
10.
Nucleic Acids Res ; 32(Database issue): D452-5, 2004 Jan 01.
Artículo en Inglés | MEDLINE | ID: mdl-14681455

RESUMEN

IntAct provides an open source database and toolkit for the storage, presentation and analysis of protein interactions. The web interface provides both textual and graphical representations of protein interactions, and allows exploring interaction networks in the context of the GO annotations of the interacting proteins. A web service allows direct computational access to retrieve interaction networks in XML format. IntAct currently contains approximately 2200 binary and complex interactions imported from the literature and curated in collaboration with the Swiss-Prot team, making intensive use of controlled vocabularies to ensure data consistency. All IntAct software, data and controlled vocabularies are available at http://www.ebi.ac.uk/intact.


Asunto(s)
Bases de Datos de Proteínas , Unión Proteica , Proteínas/metabolismo , Animales , Biología Computacional , Humanos , Almacenamiento y Recuperación de la Información , Internet , Programas Informáticos , Interfaz Usuario-Computador , Vocabulario Controlado
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA