RESUMO
BACKGROUND: With the development of noninvasive imaging technology, collecting different imaging measurements of the same brain has become more and more easy. These multimodal imaging data carry complementary information of the same brain, with both specific and shared information being intertwined. Within these multimodal data, it is essential to discriminate the specific information from the shared information since it is of benefit to comprehensively characterize brain diseases. While most existing methods are unqualified, in this paper, we propose a parameter decomposition based sparse multi-view canonical correlation analysis (PDSMCCA) method. PDSMCCA could identify both modality-shared and -specific information of multimodal data, leading to an in-depth understanding of complex pathology of brain disease. RESULTS: Compared with the SMCCA method, our method obtains higher correlation coefficients and better canonical weights on both synthetic data and real neuroimaging data. This indicates that, coupled with modality-shared and -specific feature selection, PDSMCCA improves the multi-view association identification and shows meaningful feature selection capability with desirable interpretation. CONCLUSIONS: The novel PDSMCCA confirms that the parameter decomposition is a suitable strategy to identify both modality-shared and -specific imaging features. The multimodal association and the diverse information of multimodal imaging data enable us to better understand the brain disease such as Alzheimer's disease.
Assuntos
Doença de Alzheimer , Análise de Correlação Canônica , Algoritmos , Doença de Alzheimer/diagnóstico por imagem , Encéfalo/diagnóstico por imagem , Humanos , Imageamento por Ressonância Magnética/métodos , Neuroimagem/métodosRESUMO
Image genetics mainly explores the pathogenesis of Alzheimer's disease (AD) by studying the relationship between genetic data (such as SNP, gene expression data, and DNA methylation) and imaging data (such as structural MRI (sMRI), fMRI, and PET). Most of the existing research on brain imaging genomics uses two-way or three-way bi-multivariate methods to explore the correlation analysis between genes and brain imaging. However, many of these methods are still affected by the gradient domination or cannot take into account the effect of feature redundancy on the results, so that the typical correlation coefficient and program running speed are not significantly improved. In order to solve the above problems, this paper proposes a multi-constrained uncertainty-aware adaptive sparse multi-view canonical correlation analysis method (MC-unAdaSMCCA) to explore associations among SNPs, gene expression data, and sMRI; that is, based on traditional unAdaSMCCA, orthogonal constraints are imposed on the weights of the three data features through linear programming, which can reduce the redundancy of feature weights to improve the correlation between the data and reduce the complexity of the algorithm to significantly speed up the running speed of the program. Three adaptive sparse multi-view canonical correlation analysis methods are used as benchmarks to evaluate the difference between real neuroimaging data and synthetic data. Compared with the other three methods, our proposed method has obtained better or comparable typical correlation coefficients and typical weights. Moreover, the following experimental results show that the MC-unAdaSMCCA method cannot only identify biomarkers related to AD and mild cognitive impairment (MCI), but also has a strong ability to resist noise and process high-dimensional data. Therefore, our proposed method provides a reliable approach to multi-modal imaging genetic researches.