Your browser doesn't support javascript.
loading
Adaptive Box-Cox Transformation: A Highly Flexible Feature-Specific Data Transformation to Improve Metabolomic Data Normality for Better Statistical Analysis.
Yu, Huaxu; Sang, Peijun; Huan, Tao.
Afiliação
  • Yu H; Department of Chemistry, Faculty of Science, University of British Columbia, Vancouver Campus, 2036 Main Mall, Vancouver, British Columbia V6T 1Z1, Canada.
  • Sang P; Department of Statistics and Actuarial Science, Faculty of Mathematics, University of Waterloo, Waterloo, M3-200 University Avenue West, Waterloo, Ontario N2L 3G1, Canada.
  • Huan T; Department of Chemistry, Faculty of Science, University of British Columbia, Vancouver Campus, 2036 Main Mall, Vancouver, British Columbia V6T 1Z1, Canada.
Anal Chem ; 94(23): 8267-8276, 2022 06 14.
Article em En | MEDLINE | ID: mdl-35657711
ABSTRACT
Metabolomic data normality is vital for many statistical analyses to identify significantly different metabolic features. However, despite the thousands of metabolomic publications every year, the study of metabolomic data distribution is rare. Using large-scale metabolomic data sets, we performed a comprehensive study of metabolomic data distributions. We showcased that metabolic features have diverse data distribution types, and the majority of them cannot be normalized correctly using conventional data transformation algorithms, including log and square root transformations. To understand the various non-normal data distributions, we proposed fitting metabolomic data into nine beta distributions, each representing a unique data distribution. The results of three large-scale data sets consistently show that two low normality types are very common. Next, we created the adaptive Box-Cox (ABC) transformation, a novel feature-specific data transformation approach for improving data normality. By tuning a power parameter based on a normality test result, ABC transformation was made to work for various data distribution types, and it showed great performance in normalizing skewed metabolomic data. Tested on a series of simulated data in Monte Carlo simulations, ABC transformation outperformed conventional data transformation approaches for both positively and negatively skewed data distributions. ABC transformation was further demonstrated in a real metabolomic study composed of three pairwise comparisons. Additional 84, 44, and 57 significant metabolites were newly confirmed after ABC transformation, corresponding to respective increases of 70.6, 13.4, and 22.9% in significant metabolites compared to the conventional metabolomic workflow. Some of these newly discovered metabolites showed promising biological meanings. ABC transformation was implemented in the R package ABCstats and is freely available on GitHub (https//github.com/HuanLab/ABCstats).
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Projetos de Pesquisa / Algoritmos Tipo de estudo: Health_economic_evaluation / Prognostic_studies Idioma: En Ano de publicação: 2022 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Projetos de Pesquisa / Algoritmos Tipo de estudo: Health_economic_evaluation / Prognostic_studies Idioma: En Ano de publicação: 2022 Tipo de documento: Article