Your browser doesn't support javascript.
loading
Mitigating site effects in covariance for machine learning in neuroimaging data.
Chen, Andrew A; Beer, Joanne C; Tustison, Nicholas J; Cook, Philip A; Shinohara, Russell T; Shou, Haochang.
Afiliação
  • Chen AA; Penn Statistics in Imaging and Visualization Center, Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, Philadelphia, Pennsylvania, USA.
  • Beer JC; Center for Biomedical Image Computing and Analytics, University of Pennsylvania, Philadelphia, Pennsylvania, USA.
  • Tustison NJ; Penn Statistics in Imaging and Visualization Center, Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, Philadelphia, Pennsylvania, USA.
  • Cook PA; Department of Radiology and Medical Imaging, University of Virginia, Charlottesville, Virginia, USA.
  • Shinohara RT; Department of Radiology, University of Pennsylvania, Philadelphia, Pennsylvania, USA.
  • Shou H; Penn Statistics in Imaging and Visualization Center, Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, Philadelphia, Pennsylvania, USA.
Hum Brain Mapp ; 43(4): 1179-1195, 2022 03.
Article em En | MEDLINE | ID: mdl-34904312
ABSTRACT
To acquire larger samples for answering complex questions in neuroscience, researchers have increasingly turned to multi-site neuroimaging studies. However, these studies are hindered by differences in images acquired across multiple sites. These effects have been shown to bias comparison between sites, mask biologically meaningful associations, and even introduce spurious associations. To address this, the field has focused on harmonizing data by removing site-related effects in the mean and variance of measurements. Contemporaneously with the increase in popularity of multi-center imaging, the use of machine learning (ML) in neuroimaging has also become commonplace. These approaches have been shown to provide improved sensitivity, specificity, and power due to their modeling the joint relationship across measurements in the brain. In this work, we demonstrate that methods for removing site effects in mean and variance may not be sufficient for ML. This stems from the fact that such methods fail to address how correlations between measurements can vary across sites. Data from the Alzheimer's Disease Neuroimaging Initiative is used to show that considerable differences in covariance exist across sites and that popular harmonization techniques do not address this issue. We then propose a novel harmonization method called Correcting Covariance Batch Effects (CovBat) that removes site effects in mean, variance, and covariance. We apply CovBat and show that within-site correlation matrices are successfully harmonized. Furthermore, we find that ML methods are unable to distinguish scanner manufacturer after our proposed harmonization is applied, and that the CovBat-harmonized data retain accurate prediction of disease group.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Processamento de Imagem Assistida por Computador / Córtex Cerebral / Estudos Multicêntricos como Assunto / Neuroimagem Tipo de estudo: Clinical_trials / Prognostic_studies Limite: Humans Idioma: En Ano de publicação: 2022 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Processamento de Imagem Assistida por Computador / Córtex Cerebral / Estudos Multicêntricos como Assunto / Neuroimagem Tipo de estudo: Clinical_trials / Prognostic_studies Limite: Humans Idioma: En Ano de publicação: 2022 Tipo de documento: Article