Your browser doesn't support javascript.
loading
Putting everything in its place: using the INSDC compliant Pathogen Data Object Model to better structure genomic data submitted for public health applications.
Timme, Ruth E; Karsch-Mizrachi, Ilene; Waheed, Zahra; Arita, Masanori; MacCannell, Duncan; Maguire, Finlay; Petit Iii, Robert; Page, Andrew J; Mendes, Catarina Inês; Nasar, Muhammad Ibtisam; Oluniyi, Paul; Tyler, Andrea D; Raphenya, Amogelang R; Guthrie, Jennifer L; Olawoye, Idowu; Rinck, Gabriele; O'Cathail, Colman; Lees, John; Cochrane, Guy; Cummins, Carla; Brister, J Rodney; Klimke, William; Feldgarden, Michael; Griffiths, Emma.
Affiliation
  • Timme RE; Center for Food Safety and Applied Nutrition, U.S. Food and Drug Administration, College Park, MD, USA.
  • Karsch-Mizrachi I; National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA.
  • Waheed Z; European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK.
  • Arita M; DNA Data Bank of Japan, National Institute of Genetics, Mishima, Japan.
  • MacCannell D; National Center for Emerging and Zoonotic Infectious Diseases, Centers for Disease Control and Prevention, Atlanta, GA, USA.
  • Maguire F; Department of Community Health & Epidemiology, Faculty of Medicine, Dalhousie University, Halifax, Canada.
  • Petit Iii R; Faculty of Computer Science, Dalhousie University, Halifax, Canada.
  • Page AJ; Wyoming Public Health Laboratory, Wyoming, USA.
  • Mendes CI; Theiagen Genomics LLC, Highlands Ranch, CO, USA.
  • Nasar MI; Quadram Institute Bioscience, Norwich, Norfolk, UK.
  • Oluniyi P; Theiagen Genomics LLC, Highlands Ranch, CO, USA.
  • Tyler AD; Department of Biology, College of Science, United Arab Emirates University- Al Ain, Abu Dhabi, UAE.
  • Raphenya AR; Chan Zuckerberg Biohub Network, San Francisco, CA, USA.
  • Guthrie JL; Science Technology Cores and Services, National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, Canada.
  • Olawoye I; Department of Biochemistry and Biomedical Sciences and the Michael G. DeGroote Institute for Infectious Disease Research, McMaster University, Hamilton, Ontario, Canada.
  • Rinck G; Schulich School of Medicine & Dentistry, University of Western Ontario, London, Ontario, Canada.
  • O'Cathail C; Schulich School of Medicine & Dentistry, University of Western Ontario, London, Ontario, Canada.
  • Lees J; European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK.
  • Cochrane G; European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK.
  • Cummins C; European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK.
  • Brister JR; European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK.
  • Klimke W; European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK.
  • Feldgarden M; National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA.
  • Griffiths E; National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA.
Microb Genom ; 9(12)2023 Dec.
Article in En | MEDLINE | ID: mdl-38085797
ABSTRACT
Fast, efficient public health actions require well-organized and coordinated systems that can supply timely and accurate knowledge. Public databases of pathogen genomic data, such as the International Nucleotide Sequence Database Collaboration (INSDC), have become essential tools for efficient public health decisions. However, these international resources began primarily for academic purposes, rather than for surveillance or interventions. Now, queries need to access not only the whole genomes of multiple pathogens but also make connections using robust contextual metadata to identify issues of public health relevance. Databases that over time developed a patchwork of submission formats and requirements need to be consistently organized and coordinated internationally to allow effective searches.To help resolve these issues, we propose a common pathogen data structure called the Pathogen Data Object Model (DOM) that will formalize the minimum pieces of sequence data and contextual data necessary for general public health uses, while recognizing that submitters will likely withhold a wide range of non-public contextual data. Further, we propose contributors use the Pathogen DOM for all pathogen submissions (bacterial, viral, fungal, and parasites), which will simplify data submissions and provide a consistent and transparent data structure for downstream data analyses. We also highlight how improved submission tools can support the Pathogen DOM, offering users additional easy-to-use methods to ensure this structure is followed.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Public Health / Nucleotides Language: En Journal: Microb Genom Year: 2023 Document type: Article Affiliation country:

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Public Health / Nucleotides Language: En Journal: Microb Genom Year: 2023 Document type: Article Affiliation country: