Your browser doesn't support javascript.
loading
Protein Ontology (PRO): enhancing and scaling up the representation of protein entities.
Natale, Darren A; Arighi, Cecilia N; Blake, Judith A; Bona, Jonathan; Chen, Chuming; Chen, Sheng-Chih; Christie, Karen R; Cowart, Julie; D'Eustachio, Peter; Diehl, Alexander D; Drabkin, Harold J; Duncan, William D; Huang, Hongzhan; Ren, Jia; Ross, Karen; Ruttenberg, Alan; Shamovsky, Veronica; Smith, Barry; Wang, Qinghua; Zhang, Jian; El-Sayed, Abdelrahman; Wu, Cathy H.
Afiliação
  • Natale DA; Protein Information Resource, Georgetown University Medical Center, Washington, DC 20007, USA dan5@georgetown.edu.
  • Arighi CN; Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE 19711, USA.
  • Blake JA; The Jackson Laboratory, Bar Harbor, ME 04609, USA.
  • Bona J; Oral Diagnostic Sciences, University at Buffalo School of Dental Medicine, Buffalo, NY 14214, USA.
  • Chen C; Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE 19711, USA.
  • Chen SC; Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE 19711, USA.
  • Christie KR; The Jackson Laboratory, Bar Harbor, ME 04609, USA.
  • Cowart J; Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE 19711, USA.
  • D'Eustachio P; Department of Biochemistry & Molecular Pharmacology, NYU School of Medicine, New York, NY 10016, USA.
  • Diehl AD; Department of Neurology, Jacobs School of Medicine and Biomedical Sciences, University at Buffalo, Buffalo, NY 14203, USA.
  • Drabkin HJ; New York State Center of Excellence in Bioinformatics and Life Sciences, University at Buffalo, Buffalo, NY 14203, USA.
  • Duncan WD; The Jackson Laboratory, Bar Harbor, ME 04609, USA.
  • Huang H; Roswell Park Cancer Institute, Buffalo, NY 14203, USA.
  • Ren J; New York State Center of Excellence in Bioinformatics and Life Sciences, University at Buffalo, Buffalo, NY 14203, USA.
  • Ross K; Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE 19711, USA.
  • Ruttenberg A; Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE 19711, USA.
  • Shamovsky V; Protein Information Resource, Georgetown University Medical Center, Washington, DC 20007, USA.
  • Smith B; Oral Diagnostic Sciences, University at Buffalo School of Dental Medicine, Buffalo, NY 14214, USA.
  • Wang Q; Department of Biochemistry & Molecular Pharmacology, NYU School of Medicine, New York, NY 10016, USA.
  • Zhang J; National Center for Ontological Research, University at Buffalo, Buffalo, NY 14214, USA.
  • El-Sayed A; Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE 19711, USA.
  • Wu CH; Protein Information Resource, Georgetown University Medical Center, Washington, DC 20007, USA.
Nucleic Acids Res ; 45(D1): D339-D346, 2017 01 04.
Article em En | MEDLINE | ID: mdl-27899649
ABSTRACT
The Protein Ontology (PRO; http//purl.obolibrary.org/obo/pr) formally defines and describes taxon-specific and taxon-neutral protein-related entities in three major areas proteins related by evolution; proteins produced from a given gene; and protein-containing complexes. PRO thus serves as a tool for referencing protein entities at any level of specificity. To enhance this ability, and to facilitate the comparison of such entities described in different resources, we developed a standardized representation of proteoforms using UniProtKB as a sequence reference and PSI-MOD as a post-translational modification reference. We illustrate its use in facilitating an alignment between PRO and Reactome protein entities. We also address issues of scalability, describing our first steps into the use of text mining to identify protein-related entities, the large-scale import of proteoform information from expert curated resources, and our ability to dynamically generate PRO terms. Web views for individual terms are now more informative about closely-related terms, including for example an interactive multiple sequence alignment. Finally, we describe recent improvement in semantic utility, with PRO now represented in OWL and as a SPARQL endpoint. These developments will further support the anticipated growth of PRO and facilitate discoverability of and allow aggregation of data relating to protein entities.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Limite: Animals / Humans Idioma: En Ano de publicação: 2017 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Limite: Animals / Humans Idioma: En Ano de publicação: 2017 Tipo de documento: Article