Your browser doesn't support javascript.
loading
HarmonizR enables data harmonization across independent proteomic datasets with appropriate handling of missing values.
Voß, Hannah; Schlumbohm, Simon; Barwikowski, Philip; Wurlitzer, Marcus; Dottermusch, Matthias; Neumann, Philipp; Schlüter, Hartmut; Neumann, Julia E; Krisp, Christoph.
Afiliação
  • Voß H; Section Mass Spectrometry and Proteomics, Institute of Clinical Chemistry and Laboratory Medicine, University Medical Center Hamburg-Eppendorf (UKE), Hamburg, Germany.
  • Schlumbohm S; High Performance Computing, Helmut Schmidt University, Hamburg, Germany.
  • Barwikowski P; Research Group Molecular Pathology in Neurooncology, Center for Molecular Neurobiology (ZMNH), University Medical Center Hamburg-Eppendorf, Hamburg, Germany.
  • Wurlitzer M; Section Mass Spectrometry and Proteomics, Institute of Clinical Chemistry and Laboratory Medicine, University Medical Center Hamburg-Eppendorf (UKE), Hamburg, Germany.
  • Dottermusch M; Business division for Information Technology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany.
  • Neumann P; Research Group Molecular Pathology in Neurooncology, Center for Molecular Neurobiology (ZMNH), University Medical Center Hamburg-Eppendorf, Hamburg, Germany.
  • Schlüter H; Institute of Neuropathology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany.
  • Neumann JE; High Performance Computing, Helmut Schmidt University, Hamburg, Germany.
  • Krisp C; Section Mass Spectrometry and Proteomics, Institute of Clinical Chemistry and Laboratory Medicine, University Medical Center Hamburg-Eppendorf (UKE), Hamburg, Germany.
Nat Commun ; 13(1): 3523, 2022 06 20.
Article em En | MEDLINE | ID: mdl-35725563
ABSTRACT
Dataset integration is common practice to overcome limitations in statistically underpowered omics datasets. Proteome datasets display high technical variability and frequent missing values. Sophisticated strategies for batch effect reduction are lacking or rely on error-prone data imputation. Here we introduce HarmonizR, a data harmonization tool with appropriate missing value handling. The method exploits the structure of available data and matrix dissection for minimal data loss, without data imputation. This strategy implements two common batch effect reduction methods-ComBat and limma (removeBatchEffect()). The HarmonizR strategy, evaluated on four exemplarily analyzed datasets with up to 23 batches, demonstrated successful data harmonization for different tissue preservation techniques, LC-MS/MS instrumentation setups, and quantification approaches. Compared to data imputation methods, HarmonizR was more efficient and performed superior regarding the detection of significant proteins. HarmonizR is an efficient tool for missing data tolerant experimental variance reduction and is easily adjustable for individual dataset properties and user preferences.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Proteômica / Espectrometria de Massas em Tandem Idioma: En Ano de publicação: 2022 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Proteômica / Espectrometria de Massas em Tandem Idioma: En Ano de publicação: 2022 Tipo de documento: Article