Pesquisa | BVS Doenças Infecciosas e Parasitárias

Common data models to streamline metabolomics processing and annotation, and implementation in a Python pipeline.

Mitchell, Joshua M; Chi, Yuanye; Thapa, Maheshwor; Pang, Zhiqiang; Xia, Jianguo; Li, Shuzhao.

PLoS Comput Biol ; 20(6): e1011912, 2024 Jun.

Artigo em Inglês | MEDLINE | ID: mdl-38843301

RESUMO

To standardize metabolomics data analysis and facilitate future computational developments, it is essential to have a set of well-defined templates for common data structures. Here we describe a collection of data structures involved in metabolomics data processing and illustrate how they are utilized in a full-featured Python-centric pipeline. We demonstrate the performance of the pipeline, and the details in annotation and quality control using large-scale LC-MS metabolomics and lipidomics data and LC-MS/MS data. Multiple previously published datasets are also reanalyzed to showcase its utility in biological data analysis. This pipeline allows users to streamline data processing, quality control, annotation, and standardization in an efficient and transparent manner. This work fills a major gap in the Python ecosystem for computational metabolomics.

Assuntos

Metabolômica , Software , Metabolômica/métodos , Metabolômica/estatística & dados numéricos , Biologia Computacional/métodos , Lipidômica/métodos , Cromatografia Líquida/métodos , Espectrometria de Massas em Tandem/métodos , Linguagens de Programação , Humanos

Common data models to streamline metabolomics processing and annotation, and implementation in a Python pipeline.

Mitchell, Joshua M; Chi, Yuanye; Thapa, Maheshwor; Pang, Zhiqiang; Xia, Jianguo; Li, Shuzhao.

bioRxiv ; 2024 Feb 14.

Artigo em Inglês | MEDLINE | ID: mdl-38405981

RESUMO

To standardize metabolomics data analysis and facilitate future computational developments, it is essential is have a set of well-defined templates for common data structures. Here we describe a collection of data structures involved in metabolomics data processing and illustrate how they are utilized in a full-featured Python-centric pipeline. We demonstrate the performance of the pipeline, and the details in annotation and quality control using large-scale LC-MS metabolomics and lipidomics data and LC-MS/MS data. Multiple previously published datasets are also reanalyzed to showcase its utility in biological data analysis. This pipeline allows users to streamline data processing, quality control, annotation, and standardization in an efficient and transparent manner. This work fills a major gap in the Python ecosystem for computational metabolomics.

Trackable and scalable LC-MS metabolomics data processing using asari.

Li, Shuzhao; Siddiqa, Amnah; Thapa, Maheshwor; Chi, Yuanye; Zheng, Shujian.

Nat Commun ; 14(1): 4113, 2023 07 11.

Artigo em Inglês | MEDLINE | ID: mdl-37433854

RESUMO

Significant challenges remain in the computational processing of data from liquid chomratography-mass spectrometry (LC-MS)-based metabolomic experiments into metabolite features. In this study, we examine the issues of provenance and reproducibility using the current software tools. Inconsistency among the tools examined is attributed to the deficiencies of mass alignment and controls of feature quality. To address these issues, we develop the open-source software tool asari for LC-MS metabolomics data processing. Asari is designed with a set of specific algorithmic framework and data structures, and all steps are explicitly trackable. Asari compares favorably to other tools in feature detection and quantification. It offers substantial improvement in computational performance over current tools, and it is highly scalable.

Assuntos

Metabolômica , Espectrometria de Massas em Tandem , Cromatografia Líquida , Reprodutibilidade dos Testes

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA