Your browser doesn't support javascript.
loading
Introducing 'identification probability' for automated and transferable assessment of metabolite identification confidence in metabolomics and related studies.
Metz, Thomas O; Chang, Christine H; Gautam, Vasuk; Anjum, Afia; Tian, Siyang; Wang, Fei; Colby, Sean M; Nunez, Jamie R; Blumer, Madison R; Edison, Arthur S; Fiehn, Oliver; Jones, Dean P; Li, Shuzhao; Morgan, Edward T; Patti, Gary J; Ross, Dylan H; Shapiro, Madelyn R; Williams, Antony J; Wishart, David S.
Afiliación
  • Metz TO; Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA USA.
  • Chang CH; Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA USA.
  • Gautam V; Department of Biological Sciences, University of Alberta, Edmonton, AB, Canada.
  • Anjum A; Department of Biological Sciences, University of Alberta, Edmonton, AB, Canada.
  • Tian S; Department of Biological Sciences, University of Alberta, Edmonton, AB, Canada.
  • Wang F; Department of Computing Science, University of Alberta, Edmonton, AB, Canada.
  • Colby SM; Alberta Machine Intelligence Institute, Edmonton, AB, Canada.
  • Nunez JR; Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA USA.
  • Blumer MR; Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA USA.
  • Edison AS; Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA USA.
  • Fiehn O; Department of Biochemistry & Molecular Biology, Complex Carbohydrate Research Center and Institute of Bioinformatics, University of Georgia, Athens, GA, USA.
  • Jones DP; West Coast Metabolomics Center, University of California Davis, Davis, CA, USA.
  • Li S; Clinical Biomarkers Laboratory, Department of Medicine, Emory University, Atlanta, Georgia, USA.
  • Morgan ET; The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA.
  • Patti GJ; Department of Pharmacology and Chemical Biology, Emory University School of Medicine, Atlanta, Georgia, USA.
  • Ross DH; Center for Mass Spectrometry and Metabolic Tracing, Department of Chemistry, Department of Medicine, Washington University, Saint Louis, Missouri, USA.
  • Shapiro MR; Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA USA.
  • Williams AJ; Artificial Intelligence & Data Analytics Division, Pacific Northwest National Laboratory, Richland, WA USA.
  • Wishart DS; U.S. Environmental Protection Agency, Office of Research & Development, Center for Computational Toxicology & Exposure (CCTE), Research Triangle Park, NC USA.
bioRxiv ; 2024 Jul 31.
Article en En | MEDLINE | ID: mdl-39131324
ABSTRACT
Methods for assessing compound identification confidence in metabolomics and related studies have been debated and actively researched for the past two decades. The earliest effort in 2007 focused primarily on mass spectrometry and nuclear magnetic resonance spectroscopy and resulted in four recommended levels of metabolite identification confidence - the Metabolite Standards Initiative (MSI) Levels. In 2014, the original MSI Levels were expanded to five levels (including two sublevels) to facilitate communication of compound identification confidence in high resolution mass spectrometry studies. Further refinement in identification levels have occurred, for example to accommodate use of ion mobility spectrometry in metabolomics workflows, and alternate approaches to communicate compound identification confidence also have been developed based on identification points schema. However, neither qualitative levels of identification confidence nor quantitative scoring systems address the degree of ambiguity in compound identifications in context of the chemical space being considered, are easily automated, or are transferable between analytical platforms. In this perspective, we propose that the metabolomics and related communities consider identification probability as an approach for automated and transferable assessment of compound identification and ambiguity in metabolomics and related studies. Identification probability is defined simply as 1/N, where N is the number of compounds in a reference library or chemical space that match to an experimentally measured molecule within user-defined measurement precision(s), for example mass measurement or retention time accuracy, etc. We demonstrate the utility of identification probability in an in silico analysis of multi-property reference libraries constructed from the Human Metabolome Database and computational property predictions, provide guidance to the community in transparent implementation of the concept, and invite the community to further evaluate this concept in parallel with their current preferred methods for assessing metabolite identification confidence.

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Idioma: En Revista: BioRxiv Año: 2024 Tipo del documento: Article

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Idioma: En Revista: BioRxiv Año: 2024 Tipo del documento: Article