Your browser doesn't support javascript.
loading
Enhancing the interoperability of glycan data flow between ChEBI, PubChem and GlyGen.
Navelkar, Rahi; Owen, Gareth; Mutherkrishnan, Venkatesh; Thiessen, Paul; Cheng, Tiejun; Bolton, Evan; Edwards, Nathan; Tiemeyer, Michael; Campbell, Matthew P; Martin, Maria; Vora, Jeet; Kahsay, Robel; Mazumder, Raja.
Afiliación
  • Navelkar R; The Department of Biochemistry and Molecular Biology, George Washington University Medical Center, 2300 I St NW, Washington DC 20052, USA.
  • Owen G; Cheminformatics and Metabolism, European Molecular Biology Laboratory-European Bioinformatics Institute (EMBL-EBI), Cambridgeshire, CB10 1SD, Hinxton, UK.
  • Mutherkrishnan V; National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD 20894, USA.
  • Thiessen P; National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD 20894, USA.
  • Cheng T; National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD 20894, USA.
  • Bolton E; National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD 20894, USA.
  • Edwards N; Department of Biochemistry and Cellular Biology, Georgetown University, 3900 Reservoir RD NW, Washington, DC 20007, USA.
  • Tiemeyer M; Complex Carbohydrate Research Center, Department of Biochemistry and Molecular Biology, and Department of Chemistry, University of Georgia, 315 Riverbend Rd, Athens, GA 30602, USA.
  • Campbell MP; Institute for Glycomics, Griffith University, Glycomics 1, G26/1 Parklands Dr, Southport QLD 4215, Australia.
  • Martin M; European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK.
  • Vora J; The Department of Biochemistry and Molecular Biology, George Washington University Medical Center, 2300 I St NW, Washington DC 20052, USA.
  • Kahsay R; The Department of Biochemistry and Molecular Biology, George Washington University Medical Center, 2300 I St NW, Washington DC 20052, USA.
  • Mazumder R; The Department of Biochemistry and Molecular Biology, George Washington University Medical Center, 2300 I St NW, Washington DC 20052, USA.
Glycobiology ; 31(11): 1510-1519, 2021 12 18.
Article en En | MEDLINE | ID: mdl-34314492
ABSTRACT
Glycans play a vital role in health, disease, bioenergy, biomaterials and bio-therapeutics. As a result, there is keen interest to identify and increase glycan data in bioinformatics databases like ChEBI and PubChem, and connecting them to resources at the EMBL-EBI and NCBI to facilitate access to important annotations at a global level. GlyTouCan is a comprehensive archival database that contains glycans obtained primarily through batch upload from glycan repositories, glycoprotein databases and individual laboratories. In many instances, the glycan structures deposited in GlyTouCan may not be fully defined or have supporting experimental evidence and citations. Databases like ChEBI and PubChem were designed to accommodate complete atomistic structures with well-defined chemical linkages. As a result, they cannot easily accommodate the structural ambiguity inherent in glycan databases. Consequently, there is a need to improve the organization of glycan data coherently to enhance connectivity across the major NCBI, EMBL-EBI and glycoscience databases. This paper outlines a workflow developed in collaboration between GlyGen, ChEBI and PubChem to improve the visibility and connectivity of glycan data across these resources. GlyGen hosts a subset of glycans (~29,000) from the GlyTouCan database and has submitted valuable glycan annotations to the PubChem database and integrated over 10,500 (including ambiguously defined) glycans into the ChEBI database. The integrated glycans were prioritized based on links to PubChem and connectivity to glycoprotein data. The pipeline provides a blueprint for how glycan data can be harmonized between different resources. The current PubChem, ChEBI and GlyTouCan mappings can be downloaded from GlyGen (https//data.glygen.org).
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Polisacáridos / Programas Informáticos / Glicoproteínas / Bases de Datos de Compuestos Químicos Idioma: En Revista: Glycobiology Asunto de la revista: BIOQUIMICA Año: 2021 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Polisacáridos / Programas Informáticos / Glicoproteínas / Bases de Datos de Compuestos Químicos Idioma: En Revista: Glycobiology Asunto de la revista: BIOQUIMICA Año: 2021 Tipo del documento: Article País de afiliación: Estados Unidos