RESUMEN
When performing chromatography-mass spectrometry-based nontargeted metabolomics, or exposomics, one of the key steps in the analysis is to obtain MS1-based feature tables. Inapt parameter settings in feature detection will result in missing or wrong quantitative values and might ultimately lead to downstream incorrect biological interpretations. However, until recently, no strategies to assess the completeness and abundance accuracy of feature tables were available. Here, we show that mzRAPP enables the generation of benchmark peak lists by using an internal set of known molecules in the analyzed data set. Using the benchmark, the completeness and abundance accuracy of feature tables can be assessed in an automated pipeline. We demonstrate that our approach adds to other commonly applied quality assurance methods such as manual or automatized parameter optimization techniques or removal of false-positive signals. Moreover, we show that as few as 10 benchmark molecules can already allow for representative performance metrics to further improve quantitative biological understanding.
Asunto(s)
Metabolómica , Cromatografía Liquida/métodos , Espectrometría de Masas/métodos , Metabolómica/métodosRESUMEN
SUMMARY: Reliability assessment of automated pre-processing of liquid chromatography-high resolution mass spectrometry data presents a significant challenge. Here, we present a tool named mzRAPP, which generates and validates a benchmark from user-supplied information and later utilizes it for reliability assessment of data pre-processing. As a result, mzRAPP produces several performance metrics for different steps of the pre-processing workflow, supporting five of the most commonly used pre-processing tools. AVAILABILITY AND IMPLEMENTATION: mzRAPP is implemented in R and can be downloaded from GitHub under GNU GPL v.3.0 licence. Extensive documentation, background and examples are available at (https://github.com/YasinEl/mzRAPP). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
RESUMEN
Exposome-Explorer (http://exposome-explorer.iarc.fr) is a database of dietary and pollutant biomarkers measured in population studies. In its first release, Exposome-Explorer contained comprehensive information on 692 biomarkers of dietary and pollution exposures extracted from the analysis of 480 peer-reviewed publications. Today, Exposome-Explorer has been further expanded and contains a total of 908 biomarkers. Two additional types of information have been collected. First, 185 candidate dietary biomarkers having 403 associations with food intake (as measured by metabolomic studies) have been identified and added. Second, 1356 associations between dietary biomarkers and cancer risk in epidemiological studies, which were collected from 313 publications, have also been added to the database. Classifications for both foods and compounds have been revised, and new classifications for biospecimens, analytical methods and cancers have been implemented. Finally, the web interface has been redesigned to significantly improve the user experience.
Asunto(s)
Bases de Datos de Compuestos Químicos , Dieta , Biomarcadores Ambientales , Contaminantes Ambientales , Exposoma , Neoplasias/epidemiología , Recolección de Datos , Manejo de Datos , Humanos , Factores de RiesgoRESUMEN
Metabolomics encompasses the systematic identification and quantification of all metabolic products in the human body. This field could provide clinicians with novel sets of diagnostic biomarkers for disease states in addition to quantifying treatment response to medications at an individualized level. This literature review aims to highlight the technology underpinning metabolic profiling, identify potential applications of metabolomics in clinical practice, and discuss the translational challenges that the field faces. We searched PubMed, MEDLINE, and EMBASE for primary and secondary research articles regarding clinical applications of metabolomics. Metabolic profiling can be performed using mass spectrometry and nuclear magnetic resonance-based techniques using a variety of biological samples. This is carried out in vivo or in vitro following careful sample collection, preparation, and analysis. The potential clinical applications constitute disruptive innovations in their respective specialities, particularly oncology and metabolic medicine. Outstanding issues currently preventing widespread clinical use are scalability of data interpretation, standardization of sample handling practice, and e-infrastructure. Routine utilization of metabolomics at a patient and population level will constitute an integral part of future healthcare provision.
Asunto(s)
Metabolómica , Medicina de Precisión , Estetoscopios , HumanosRESUMEN
MOTIVATION: Developing a robust and performant data analysis workflow that integrates all necessary components whilst still being able to scale over multiple compute nodes is a challenging task. We introduce a generic method based on the microservice architecture, where software tools are encapsulated as Docker containers that can be connected into scientific workflows and executed using the Kubernetes container orchestrator. RESULTS: We developed a Virtual Research Environment (VRE) which facilitates rapid integration of new tools and developing scalable and interoperable workflows for performing metabolomics data analysis. The environment can be launched on-demand on cloud resources and desktop computers. IT-expertise requirements on the user side are kept to a minimum, and workflows can be re-used effortlessly by any novice user. We validate our method in the field of metabolomics on two mass spectrometry, one nuclear magnetic resonance spectroscopy and one fluxomics study. We showed that the method scales dynamically with increasing availability of computational resources. We demonstrated that the method facilitates interoperability using integration of the major software suites resulting in a turn-key workflow encompassing all steps for mass-spectrometry-based metabolomics including preprocessing, statistics and identification. Microservices is a generic methodology that can serve any scientific discipline and opens up for new types of large-scale integrative science. AVAILABILITY AND IMPLEMENTATION: The PhenoMeNal consortium maintains a web portal (https://portal.phenomenal-h2020.eu) providing a GUI for launching the Virtual Research Environment. The GitHub repository https://github.com/phnmnl/ hosts the source code of all projects. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Asunto(s)
Análisis de Datos , Metabolómica , Biología Computacional , Programas Informáticos , Flujo de TrabajoRESUMEN
Mass spectrometry (MS) is one of the primary techniques used for large-scale analysis of small molecules in metabolomics studies. To date, there has been little data format standardization in this field, as different software packages export results in different formats represented in XML or plain text, making data sharing, database deposition, and reanalysis highly challenging. Working within the consortia of the Metabolomics Standards Initiative, Proteomics Standards Initiative, and the Metabolomics Society, we have created mzTab-M to act as a common output format from analytical approaches using MS on small molecules. The format has been developed over several years, with input from a wide range of stakeholders. mzTab-M is a simple tab-separated text format, but importantly, the structure is highly standardized through the design of a detailed specification document, tightly coupled to validation software, and a mandatory controlled vocabulary of terms to populate it. The format is able to represent final quantification values from analyses, as well as the evidence trail in terms of features measured directly from MS (e.g., LC-MS, GC-MS, DIMS, etc.) and different types of approaches used to identify molecules. mzTab-M allows for ambiguity in the identification of molecules to be communicated clearly to readers of the files (both people and software). There are several implementations of the format available, and we anticipate widespread adoption in the field.
Asunto(s)
Metabolómica/métodos , Programas Informáticos , Bases de Datos Factuales , Espectrometría de MasasRESUMEN
The 2017 Dagstuhl Seminar on Computational Proteomics provided an opportunity for a broad discussion on the current state and future directions of the generation and use of peptide tandem mass spectrometry spectral libraries. Their use in proteomics is growing slowly, but there are multiple challenges in the field that must be addressed to further increase the adoption of spectral libraries and related techniques. The primary bottlenecks are the paucity of high quality and comprehensive libraries and the general difficulty of adopting spectral library searching into existing workflows. There are several existing spectral library formats, but none captures a satisfactory level of metadata; therefore, a logical next improvement is to design a more advanced, Proteomics Standards Initiative-approved spectral library format that can encode all of the desired metadata. The group discussed a series of metadata requirements organized into three designations of completeness or quality, tentatively dubbed bronze, silver, and gold. The metadata can be organized at four different levels of granularity: at the collection (library) level, at the individual entry (peptide ion) level, at the peak (fragment ion) level, and at the peak annotation level. Strategies for encoding mass modifications in a consistent manner and the requirement for encoding high-quality and commonly seen but as-yet-unidentified spectra were discussed. The group also discussed related topics, including strategies for comparing two spectra, techniques for generating representative spectra for a library, approaches for selection of optimal signature ions for targeted workflows, and issues surrounding the merging of two or more libraries into one. We present here a review of this field and the challenges that the community must address in order to accelerate the adoption of spectral libraries in routine analysis of proteomics datasets.
Asunto(s)
Bases de Datos de Proteínas/normas , Biblioteca de Péptidos , Proteómica/métodos , Animales , Humanos , Espectrometría de Masas en Tándem/métodos , Flujo de TrabajoRESUMEN
NMR is a widely used analytical technique with a growing number of repositories available. As a result, demands for a vendor-agnostic, open data format for long-term archiving of NMR data have emerged with the aim to ease and encourage sharing, comparison, and reuse of NMR data. Here we present nmrML, an open XML-based exchange and storage format for NMR spectral data. The nmrML format is intended to be fully compatible with existing NMR data for chemical, biochemical, and metabolomics experiments. nmrML can capture raw NMR data, spectral data acquisition parameters, and where available spectral metadata, such as chemical structures associated with spectral assignments. The nmrML format is compatible with pure-compound NMR data for reference spectral libraries as well as NMR data from complex biomixtures, i.e., metabolomics experiments. To facilitate format conversions, we provide nmrML converters for Bruker, JEOL and Agilent/Varian vendor formats. In addition, easy-to-use Web-based spectral viewing, processing, and spectral assignment tools that read and write nmrML have been developed. Software libraries and Web services for data validation are available for tool developers and end-users. The nmrML format has already been adopted for capturing and disseminating NMR data for small molecules by several open source data processing tools and metabolomics reference spectral libraries, e.g., serving as storage format for the MetaboLights data repository. The nmrML open access data standard has been endorsed by the Metabolomics Standards Initiative (MSI), and we here encourage user participation and feedback to increase usability and make it a successful standard.
Asunto(s)
Bases de Datos de Compuestos Químicos/normas , Espectroscopía de Resonancia Magnética/estadística & datos numéricos , Metabolómica/métodos , Programas InformáticosRESUMEN
SUMMARY: Submission to the MetaboLights repository for metabolomics data currently places the burden of reporting instrument and acquisition parameters in ISA-Tab format on users, who have to do it manually, a process that is time consuming and prone to user input error. Since the large majority of these parameters are embedded in instrument raw data files, an opportunity exists to capture this metadata more accurately. Here we report a set of Python packages that can automatically generate ISA-Tab metadata file stubs from raw XML metabolomics data files. The parsing packages are separated into mzML2ISA (encompassing mzML and imzML formats) and nmrML2ISA (nmrML format only). Overall, the use of mzML2ISA & nmrML2ISA reduces the time needed to capture metadata substantially (capturing 90% of metadata on assay and sample levels), is much less prone to user input errors, improves compliance with minimum information reporting guidelines and facilitates more finely grained data exploration and querying of datasets. AVAILABILITY AND IMPLEMENTATION: mzML2ISA & nmrML2ISA are available under version 3 of the GNU General Public Licence at https://github.com/ISA-tools. Documentation is available from http://2isa.readthedocs.io/en/latest/. CONTACT: reza.salek@ebi.ac.uk or isatools@googlegroups.com. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Asunto(s)
Almacenamiento y Recuperación de la Información , Metabolómica/métodos , Metadatos , Programas Informáticos , Minería de Datos/métodosRESUMEN
INTRODUCTION: Adoption of automatic profiling tools for 1H-NMR-based metabolomic studies still lags behind other approaches in the absence of the flexibility and interactivity necessary to adapt to the properties of study data sets of complex matrices. OBJECTIVES: To provide an open source tool that fully integrates these needs and enables the reproducibility of the profiling process. METHODS: rDolphin incorporates novel techniques to optimize exploratory analysis, metabolite identification, and validation of profiling output quality. RESULTS: The information and quality achieved in two public datasets of complex matrices are maximized. CONCLUSION: rDolphin is an open-source R package ( http://github.com/danielcanueto/rDolphin ) able to provide the best balance between accuracy, reproducibility and ease of use.
Asunto(s)
Metabolómica/métodos , Espectroscopía de Protones por Resonancia Magnética/métodos , Programas Informáticos , Conjuntos de Datos como Asunto , Humanos , Metaboloma , Metabolómica/normas , Espectroscopía de Protones por Resonancia Magnética/normas , Reproducibilidad de los ResultadosRESUMEN
The Proteomics Standards Initiative (PSI) of the Human Proteome Organization (HUPO) has now been developing and promoting open community standards and software tools in the field of proteomics for 15 years. Under the guidance of the chair, cochairs, and other leadership positions, the PSI working groups are tasked with the development and maintenance of community standards via special workshops and ongoing work. Among the existing ratified standards, the PSI working groups continue to update PSI-MI XML, MITAB, mzML, mzIdentML, mzQuantML, mzTab, and the MIAPE (Minimum Information About a Proteomics Experiment) guidelines with the advance of new technologies and techniques. Furthermore, new standards are currently either in the final stages of completion (proBed and proBAM for proteogenomics results as well as PEFF) or in early stages of design (a spectral library standard format, a universal spectrum identifier, the qcML quality control format, and the Protein Expression Interface (PROXI) web services Application Programming Interface). In this work we review the current status of all of these aspects of the PSI, describe synergies with other efforts such as the ProteomeXchange Consortium, the Human Proteome Project, and the metabolomics community, and provide a look at future directions of the PSI.
Asunto(s)
Proteómica/normas , Programas Informáticos , Bases de Datos de Proteínas/normas , Bases de Datos de Proteínas/tendencias , Guías como Asunto , Humanos , Metabolómica , Proteómica/tendencias , Estándares de Referencia , Programas Informáticos/normas , Programas Informáticos/tendenciasRESUMEN
To have confidence in results acquired during biological mass spectrometry experiments, a systematic approach to quality control is of vital importance. Nonetheless, until now, only scattered initiatives have been undertaken to this end, and these individual efforts have often not been complementary. To address this issue, the Human Proteome Organization-Proteomics Standards Initiative has established a new working group on quality control at its meeting in the spring of 2016. The goal of this working group is to provide a unifying framework for quality control data. The initial focus will be on providing a community-driven standardized file format for quality control. For this purpose, the previously proposed qcML format will be adapted to support a variety of use cases for both proteomics and metabolomics applications, and it will be established as an official PSI format. An important consideration is to avoid enforcing restrictive requirements on quality control but instead provide the basic technical necessities required to support extensive quality control for any type of mass spectrometry-based workflow. We want to emphasize that this is an open community effort, and we seek participation from all scientists with an interest in this field.
Asunto(s)
Proteoma/análisis , Proteómica , Bases de Datos de Proteínas , Humanos , Espectrometría de Masas/normas , Proteoma/normas , Proteómica/normas , Control de CalidadRESUMEN
Metabolic traits are molecular phenotypes that can drive clinical phenotypes and may predict disease progression. Here, we report results from a metabolome- and genome-wide association study on (1)H-NMR urine metabolic profiles. The study was conducted within an untargeted approach, employing a novel method for compound identification. From our discovery cohort of 835 Caucasian individuals who participated in the CoLaus study, we identified 139 suggestively significant (P<5×10(-8)) and independent associations between single nucleotide polymorphisms (SNP) and metabolome features. Fifty-six of these associations replicated in the TasteSensomics cohort, comprising 601 individuals from São Paulo of vastly diverse ethnic background. They correspond to eleven gene-metabolite associations, six of which had been previously identified in the urine metabolome and three in the serum metabolome. Our key novel findings are the associations of two SNPs with NMR spectral signatures pointing to fucose (rs492602, Pâ=â6.9×10(-44)) and lysine (rs8101881, Pâ=â1.2×10(-33)), respectively. Fine-mapping of the first locus pinpointed the FUT2 gene, which encodes a fucosyltransferase enzyme and has previously been associated with Crohn's disease. This implicates fucose as a potential prognostic disease marker, for which there is already published evidence from a mouse model. The second SNP lies within the SLC7A9 gene, rare mutations of which have been linked to severe kidney damage. The replication of previous associations and our new discoveries demonstrate the potential of untargeted metabolomics GWAS to robustly identify molecular disease markers.
Asunto(s)
Metaboloma/genética , Metabolómica , Polimorfismo de Nucleótido Simple/genética , Orina , Sistemas de Transporte de Aminoácidos Básicos/genética , Animales , Enfermedad de Crohn/genética , Enfermedad de Crohn/metabolismo , Fucosiltransferasas/genética , Fucosiltransferasas/metabolismo , Estudio de Asociación del Genoma Completo , Humanos , Enfermedades Renales/genética , Enfermedades Renales/metabolismo , Espectroscopía de Resonancia Magnética , Masculino , Ratones , Galactósido 2-alfa-L-FucosiltransferasaRESUMEN
NMR-based metabolomics has shown considerable promise in disease diagnosis and biomarker discovery because it allows one to nondestructively identify and quantify large numbers of novel metabolite biomarkers in both biofluids and tissues. Precise metabolite quantification is a prerequisite to move any chemical biomarker or biomarker panel from the lab to the clinic. Among the biofluids commonly used for disease diagnosis and prognosis, urine has several advantages. It is abundant, sterile, and easily obtained, needs little sample preparation, and does not require invasive medical procedures for collection. Furthermore, urine captures and concentrates many "unwanted" or "undesirable" compounds throughout the body, providing a rich source of potentially useful disease biomarkers; however, incredible variation in urine chemical concentrations makes analysis of urine and identification of useful urinary biomarkers by NMR challenging. We discuss a number of the most significant issues regarding NMR-based urinary metabolomics with specific emphasis on metabolite quantification for disease biomarker applications and propose data collection and instrumental recommendations regarding NMR pulse sequences, acceptable acquisition parameter ranges, relaxation effects on quantitation, proper handling of instrumental differences, sample preparation, and biomarker assessment.
Asunto(s)
Biomarcadores/orina , Espectroscopía de Resonancia Magnética/métodos , Metabolómica/métodos , Urinálisis/métodos , Guías como Asunto/normas , Humanos , Espectroscopía de Resonancia Magnética/normas , Metabolómica/normas , Estándares de Referencia , Reproducibilidad de los ResultadosRESUMEN
The HUPO Proteomics Standards Initiative has developed several standardized data formats to facilitate data sharing in mass spectrometry (MS)-based proteomics. These allow researchers to report their complete results in a unified way. However, at present, there is no format to describe the final qualitative and quantitative results for proteomics and metabolomics experiments in a simple tabular format. Many downstream analysis use cases are only concerned with the final results of an experiment and require an easily accessible format, compatible with tools such as Microsoft Excel or R. We developed the mzTab file format for MS-based proteomics and metabolomics results to meet this need. mzTab is intended as a lightweight supplement to the existing standard XML-based file formats (mzML, mzIdentML, mzQuantML), providing a comprehensive summary, similar in concept to the supplemental material of a scientific publication. mzTab files can contain protein, peptide, and small molecule identifications together with experimental metadata and basic quantitative information. The format is not intended to store the complete experimental evidence but provides mechanisms to report results at different levels of detail. These range from a simple summary of the final results to a representation of the results including the experimental design. This format is ideally suited to make MS-based proteomics and metabolomics results available to a wider biological community outside the field of MS. Several software tools for proteomics and metabolomics have already adapted the format as an output format. The comprehensive mzTab specification document and extensive additional documentation can be found online.
Asunto(s)
Bases de Datos de Proteínas , Programas Informáticos , Acceso a la Información , Espectrometría de Masas , Metabolómica , Proteómica , Interfaz Usuario-ComputadorRESUMEN
MetaboLights (http://www.ebi.ac.uk/metabolights) is the first general-purpose, open-access repository for metabolomics studies, their raw experimental data and associated metadata, maintained by one of the major open-access data providers in molecular biology. Metabolomic profiling is an important tool for research into biological functioning and into the systemic perturbations caused by diseases, diet and the environment. The effectiveness of such methods depends on the availability of public open data across a broad range of experimental methods and conditions. The MetaboLights repository, powered by the open source ISA framework, is cross-species and cross-technique. It will cover metabolite structures and their reference spectra as well as their biological roles, locations, concentrations and raw data from metabolic experiments. Studies automatically receive a stable unique accession number that can be used as a publication reference (e.g. MTBLS1). At present, the repository includes 15 submitted studies, encompassing 93 protocols for 714 assays, and span over 8 different species including human, Caenorhabditis elegans, Mus musculus and Arabidopsis thaliana. Eight hundred twenty-seven of the metabolites identified in these studies have been mapped to ChEBI. These studies cover a variety of techniques, including NMR spectroscopy and mass spectrometry.
Asunto(s)
Bases de Datos de Compuestos Químicos , Metaboloma , Metabolómica , Animales , Humanos , Internet , Ratones , Interfaz Usuario-ComputadorRESUMEN
The Annual 2014 Spring Workshop of the Proteomics Standards Initiative (PSI) of the Human Proteome Organization (HUPO) was held this year jointly with the metabolomics COordination of Standards in MetabOlomicS (COSMOS) group. The range of existing MS standards (mzML, mzIdentML, mzQuantML, mzTab, TraML) was reviewed and updated in the light of new methodologies and advances in technologies. Adaptations to meet the needs of the metabolomics community were incorporated and a new data format for NMR, nmrML, was presented. The molecular interactions workgroup began work on a new version of the existing XML data interchange format. PSI-MI XML3.0 will enable the capture of more abstract data types such as protein complex topology derived from experimental data, allosteric binding, and dynamic interactions. Further information about the work of the HUPO-PSI can be found at http://www.psidev.info.
Asunto(s)
Proteoma/análisis , Proteómica/métodos , Compresión de Datos/métodos , Bases de Datos de Proteínas , Alemania , Humanos , Espectrometría de Masas/métodos , Metabolómica/educación , Metabolómica/métodos , Mapeo de Interacción de Proteínas/métodos , Proteoma/metabolismo , Proteómica/educación , Programas InformáticosRESUMEN
One of the main challenges in nuclear magnetic resonance (NMR) metabolomics is to obtain valuable metabolic information from large datasets of raw NMR spectra in a high throughput, automatic, and reproducible way. To date, established software packages used to match and quantify metabolites in NMR spectra remain mostly manually operated, leading to low resolution results and subject to inconsistencies not attributable to the NMR technique itself. Here, we introduce a new software package, called Dolphin, able to automatically quantify a set of target metabolites in multiple sample measurements using an approach based on 1D and 2D NMR techniques to overcome the inherent limitations of 1D (1)H-NMR spectra in metabolomics. Dolphin takes advantage of the 2D J-resolved NMR spectroscopy signal dispersion to avoid inconsistencies in signal position detection, enhancing the reliability and confidence in metabolite matching. Furthermore, in order to improve accuracy in quantification, Dolphin uses 2D NMR spectra to obtain additional information on all neighboring signals surrounding the target metabolite. We have compared the targeted profiling results of Dolphin, recorded from standard biological mixtures, with those of two well established approaches in NMR metabolomics. Overall, Dolphin produced more accurate results with the added advantage of being a fully automated and high throughput processing package.
Asunto(s)
Espectroscopía de Resonancia Magnética/métodos , Metabolómica/métodos , Programas Informáticos , Animales , Humanos , Hígado/química , Hígado/metabolismo , Metaboloma , Ratas , Reproducibilidad de los ResultadosRESUMEN
In this study, a comprehensive methodology combining machine learning and statistical analysis was employed to investigate alterations in the metabolite profiles, including lipids, of breast cancer tissues and their subtypes. By integrating biological and machine learning feature selection techniques, along with univariate and multivariate analyses, a notable lipid signature was identified in breast cancer tissues. The results revealed elevated levels of saturated and monounsaturated phospholipids in breast cancer tissues, consistent with external validation findings. Additionally, lipidomics analysis in both the original and validation datasets indicated lower levels of most triacylglycerols compared to non-cancerous tissues, suggesting potential alterations in lipid storage and metabolism within cancer cells. Analysis of cancer subtypes revealed that levels of PC 30:0 were relatively reduced in HER2(-) samples that were ER(+) and PR(+) compared to those that were ER(-) and PR(-). Conversely, HER2(+) tumors, which were ER(-) and PR(-), exhibited increased concentrations of PC 30:0. This increase could potentially be linked to the role of Stearoyl-CoA-Desaturase 1 in breast cancer. Comprehensive metabolomic analyses of breast cancer can offer crucial insights into cancer development, aiding in early detection and treatment evaluation of this devastating disease.
Asunto(s)
Neoplasias de la Mama , Lipidómica , Humanos , Neoplasias de la Mama/metabolismo , Neoplasias de la Mama/patología , Femenino , Lipidómica/métodos , Metabolismo de los Lípidos , Aprendizaje Automático , Lípidos/análisis , Receptor ErbB-2/metabolismo , Estearoil-CoA Desaturasa/metabolismoRESUMEN
This work presents a proposed extension to the International Union of Pure and Applied Chemistry (IUPAC) International Chemical Identifier (InChI) standard that allows the representation of isotopically-resolved chemical entities at varying levels of ambiguity in isotope location. This extension includes an improved interpretation of the current isotopic layer within the InChI standard and a new isotopologue layer specification for representing chemical intensities with ambiguous isotope localization. Both improvements support the unique isotopically-resolved chemical identification of features detected and measured in analytical instrumentation, specifically nuclear magnetic resonance and mass spectrometry. SCIENTIFIC CONTRIBUTION: This new extension to the InChI standard would enable improved annotation of analytical datasets characterizing chemical entities, supporting the FAIR (Findable, Accessible, Interoperable, and Reusable) guiding principles of data stewardship for chemical datasets, ultimately promoting Open Science in chemistry.