RESUMO
Imaging findings inconsistent with those expected at specific chronological age ranges may serve as early indicators of neurological disorders and increased mortality risk. Estimation of chronological age, and deviations from expected results, from structural magnetic resonance imaging (MRI) data has become an important proxy task for developing biomarkers that are sensitive to such deviations. Complementary to structural analysis, diffusion tensor imaging (DTI) has proven effective in identifying age-related microstructural changes within the brain white matter, thereby presenting itself as a promising additional modality for brain age prediction. Although early studies have sought to harness DTI's advantages for age estimation, there is no evidence that the success of this prediction is owed to the unique microstructural and diffusivity features that DTI provides, rather than the macrostructural features that are also available in DTI data. Therefore, we seek to develop white-matter-specific age estimation to capture deviations from normal white matter aging. Specifically, we deliberately disregard the macrostructural information when predicting age from DTI scalar images, using two distinct methods. The first method relies on extracting only microstructural features from regions of interest (ROIs). The second applies 3D residual neural networks (ResNets) to learn features directly from the images, which are non-linearly registered and warped to a template to minimize macrostructural variations. When tested on unseen data, the first method yields mean absolute error (MAE) of 6.11 ± 0.19 years for cognitively normal participants and MAE of 6.62 ± 0.30 years for cognitively impaired participants, while the second method achieves MAE of 4.69 ± 0.23 years for cognitively normal participants and MAE of 4.96 ± 0.28 years for cognitively impaired participants. We find that the ResNet model captures subtler, non-macrostructural features for brain age prediction.
RESUMO
Diffusion MRI (dMRI) streamline tractography, the gold-standard for in vivo estimation of white matter (WM) pathways in the brain, has long been considered as a product of WM microstructure. However, recent advances in tractography demonstrated that convolutional recurrent neural networks (CoRNN) trained with a teacher-student framework have the ability to learn to propagate streamlines directly from T1 and anatomical context. Training for this network has previously relied on high resolution dMRI. In this paper, we generalize the training mechanism to traditional clinical resolution data, which allows generalizability across sensitive and susceptible study populations. We train CoRNN on a small subset of the Baltimore Longitudinal Study of Aging (BLSA), which better resembles clinical scans. We define a metric, termed the epsilon ball seeding method, to compare T1 tractography and traditional diffusion tractography at the streamline level. We show that under this metric T1 tractography generated by CoRNN reproduces diffusion tractography with approximately three millimeters of error.
RESUMO
Subject head motion during the acquisition of diffusion-weighted imaging (DWI) of the brain induces artifacts and affects image quality. Information about the frequency and extent of motion could reveal which aspects of motion correction are most necessary. Therefore, we investigate the extent of translation and rotation among participants, and how the motion changes during the scan acquisition. We analyze 5,380 DWI scans from 1,034 participants. We measure the rotations and translations in the sagittal, coronal and transverse planes needed to align the volumes to the first and previous volumes, as well as the displacement. The different types of motion are compared with each other and compared over time. The largest rotation (per minute) is around the right - left axis (median 0.378 °/min, range 0.000 - 11.466°) and the largest translation (per minute) is along the anterior - posterior axis (median 1.867 mm/min, range 0.000 - 10.944 mm). We additionally observe that spikes in movement occur at the beginning of the scan, particularly in anterior - posterior translation. The results show that all scans are affected by subtle head motion, which may impact subsequent image analysis.
RESUMO
INTRODUCTION: The effects of sex, race, and Apolipoprotein E (APOE) - Alzheimer's disease (AD) risk factors - on white matter integrity are not well characterized. METHODS: Diffusion MRI data from nine well-established longitudinal cohorts of aging were free-water (FW)-corrected and harmonized. This dataset included 4,702 participants (age=73.06 ± 9.75) with 9,671 imaging sessions over time. FW and FW-corrected fractional anisotropy (FAFWcorr) were used to assess differences in white matter microstructure by sex, race, and APOE-ε4 carrier status. RESULTS: Sex differences in FAFWcorr in association and projection tracts, racial differences in FAFWcorr in projection tracts, and APOE-ε4 differences in FW limbic and occipital transcallosal tracts were most pronounced. DISCUSSION: There are prominent differences in white matter microstructure by sex, race, and APOE-ε4 carrier status. This work adds to our understanding of disparities in AD. Additional work to understand the etiology of these differences is warranted.
RESUMO
Purpose: Diffusion tensor imaging (DTI) is a magnetic resonance imaging technique that provides unique information about white matter microstructure in the brain but is susceptible to confounding effects introduced by scanner or acquisition differences. ComBat is a leading approach for addressing these site biases. However, despite its frequent use for harmonization, ComBat's robustness toward site dissimilarities and overall cohort size have not yet been evaluated in terms of DTI. Approach: As a baseline, we match N=358 participants from two sites to create a "silver standard" that simulates a cohort for multi-site harmonization. Across sites, we harmonize mean fractional anisotropy and mean diffusivity, calculated using participant DTI data, for the regions of interest defined by the JHU EVE-Type III atlas. We bootstrap 10 iterations at 19 levels of total sample size, 10 levels of sample size imbalance between sites, and 6 levels of mean age difference between sites to quantify (i) ßAGE, the linear regression coefficient of the relationship between FA and age; (ii) γ/f*, the ComBat-estimated site-shift; and (iii) δ/f*, the ComBat-estimated site-scaling. We characterize the reliability of ComBat by evaluating the root mean squared error in these three metrics and examine if there is a correlation between the reliability of ComBat and a violation of assumptions. Results: ComBat remains well behaved for ßAGE when N>162 and when the mean age difference is less than 4 years. The assumptions of the ComBat model regarding the normality of residual distributions are not violated as the model becomes unstable. Conclusion: Prior to harmonization of DTI data with ComBat, the input cohort should be examined for size and covariate distributions of each site. Direct assessment of residual distributions is less informative on stability than bootstrap analysis. We caution use ComBat of in situations that do not conform to the above thresholds.
RESUMO
T1-weighted (T1w) MRI has low frequency intensity artifacts due to magnetic field inhomogeneities. Removal of these biases in T1w MRI images is a critical preprocessing step to ensure spatially consistent image interpretation. N4ITK bias field correction, the current state-of-the-art, is implemented in such a way that makes it difficult to port between different pipelines and workflows, thus making it hard to reimplement and reproduce results across local, cloud, and edge platforms. Moreover, N4ITK is opaque to optimization before and after its application, meaning that methodological development must work around the inhomogeneity correction step. Given the importance of bias fields correction in structural preprocessing and flexible implementation, we pursue a deep learning approximation / reinterpretation of the N4ITK bias fields correction to create a method which is portable, flexible, and fully differentiable. In this paper, we trained a deep learning network "DeepN4" on eight independent cohorts from 72 different scanners and age ranges with N4ITK-corrected T1w MRI and bias field for supervision in log space. We found that we can closely approximate N4ITK bias fields correction with naïve networks. We evaluate the peak signal to noise ratio (PSNR) in test dataset against the N4ITK corrected images. The median PSNR of corrected images between N4ITK and DeepN4 was 47.96 dB. In addition, we assess the DeepN4 model on eight additional external datasets and show the generalizability of the approach. This study establishes that incompatible N4ITK preprocessing steps can be closely approximated by naïve deep neural networks, facilitating more flexibility. All code and models are released at https://github.com/MASILab/DeepN4 .
Assuntos
Processamento de Imagem Assistida por Computador , Imageamento por Ressonância Magnética , Processamento de Imagem Assistida por Computador/métodos , Imageamento por Ressonância Magnética/métodos , Algoritmos , Redes Neurais de Computação , ViésRESUMO
Data harmonization is necessary for removing confounding effects in multi-site diffusion image analysis. One such harmonization method, LinearRISH, scales rotationally invariant spherical harmonic (RISH) features from one site ("target") to the second ("reference") to reduce confounding scanner effects. However, reference and target site designations are not arbitrary and resultant diffusion metrics (fractional anisotropy, mean diffusivity) are biased by this choice. In this work we propose MidRISH: rather than scaling reference RISH features to target RISH features, we project both sites to a mid-space. We validate MidRISH with the following experiments: harmonizing scanner differences from 37 matched patients free of cognitive impairment, and harmonizing acquisition and study differences on 117 matched patients free of cognitive impairment. We find that MidRISH reduces bias of reference selection while preserving harmonization efficacy of LinearRISH. Users should be cautious when performing LinearRISH harmonization. To select a reference site is to choose diffusion metric effect-size. Our proposed method eliminates the bias-inducing site selection step.
Assuntos
Algoritmos , Humanos , Feminino , Masculino , Processamento de Imagem Assistida por Computador/métodos , Reprodutibilidade dos Testes , Encéfalo/diagnóstico por imagem , Imagem de Difusão por Ressonância Magnética/métodos , Anisotropia , Idoso , Pessoa de Meia-Idade , Imagem de Tensor de Difusão/métodos , Disfunção Cognitiva/diagnóstico por imagem , Interpretação de Imagem Assistida por Computador/métodosRESUMO
Imaging findings inconsistent with those expected at specific chronological age ranges may serve as early indicators of neurological disorders and increased mortality risk. Estimation of chronological age, and deviations from expected results, from structural magnetic resonance imaging (MRI) data has become an important proxy task for developing biomarkers that are sensitive to such deviations. Complementary to structural analysis, diffusion tensor imaging (DTI) has proven effective in identifying age-related microstructural changes within the brain white matter, thereby presenting itself as a promising additional modality for brain age prediction. Although early studies have sought to harness DTI's advantages for age estimation, there is no evidence that the success of this prediction is owed to the unique microstructural and diffusivity features that DTI provides, rather than the macrostructural features that are also available in DTI data. Therefore, we seek to develop white-matter-specific age estimation to capture deviations from normal white matter aging. Specifically, we deliberately disregard the macrostructural information when predicting age from DTI scalar images, using two distinct methods. The first method relies on extracting only microstructural features from regions of interest (ROIs). The second applies 3D residual neural networks (ResNets) to learn features directly from the images, which are non-linearly registered and warped to a template to minimize macrostructural variations. When tested on unseen data, the first method yields mean absolute error (MAE) of 6.11 ± 0.19 years for cognitively normal participants and MAE of 6.62 ± 0.30 years for cognitively impaired participants, while the second method achieves MAE of 4.69 ± 0.23 years for cognitively normal participants and MAE of 4.96 ± 0.28 years for cognitively impaired participants. We find that the ResNet model captures subtler, non-macrostructural features for brain age prediction.
RESUMO
T1-weighted (T1w) MRI has low frequency intensity artifacts due to magnetic field inhomogeneities. Removal of these biases in T1w MRI images is a critical preprocessing step to ensure spatially consistent image interpretation. N4ITK bias field correction, the current state-of-the-art, is implemented in such a way that makes it difficult to port between different pipelines and workflows, thus making it hard to reimplement and reproduce results across local, cloud, and edge platforms. Moreover, N4ITK is opaque to optimization before and after its application, meaning that methodological development must work around the inhomogeneity correction step. Given the importance of bias fields correction in structural preprocessing and flexible implementation, we pursue a deep learning approximation / reinterpretation of the N4ITK bias fields correction to create a method which is portable, flexible, and fully differentiable. In this paper, we trained a deep learning network "DeepN4" on eight independent cohorts from 72 different scanners and age ranges with N4ITK-corrected T1w MRI and bias field for supervision in log space. We found that we can closely approximate N4ITK bias fields correction with naïve networks. We evaluate the peak signal to noise ratio (PSNR) in test dataset against the N4ITK corrected images. The median PSNR of corrected images between N4ITK and DeepN4 was 47.96 dB. In addition, we assess the DeepN4 model on eight additional external datasets and show the generalizability of the approach. This study establishes that incompatible N4ITK preprocessing steps can be closely approximated by naïve deep neural networks, facilitating more flexibility. All code and models are released at https://github.com/MASILab/DeepN4.
RESUMO
Introduction: It is unclear how rates of white matter microstructural decline differ between normal aging and abnormal aging. Methods: Diffusion MRI data from several well-established longitudinal cohorts of aging (Alzheimer's Disease Neuroimaging Initiative [ADNI], Baltimore Longitudinal Study of Aging [BLSA], Vanderbilt Memory & Aging Project [VMAP]) were free-water corrected and harmonized. This dataset included 1723 participants (age at baseline: 72.8 ± 8.87 years, 49.5% male) and 4605 imaging sessions (follow-up time: 2.97 ± 2.09 years, follow-up range: 1-13 years, mean number of visits: 4.42 ± 1.98). Differences in white matter microstructural decline in normal and abnormal agers was assessed. Results: While we found a global decline in white matter in normal/abnormal aging, we found that several white matter tracts (e.g., cingulum bundle) were vulnerable to abnormal aging. Conclusions: There is a prevalent role of white matter microstructural decline in aging, and future large-scale studies in this area may further refine our understanding of the underlying neurodegenerative processes. HIGHLIGHTS: Longitudinal data were free-water corrected and harmonized.Global effects of white matter decline were seen in normal and abnormal aging.The free-water metric was most vulnerable to abnormal aging.Cingulum free-water was the most vulnerable to abnormal aging.
RESUMO
Recently, increasing evidence suggests that fMRI signals in white matter (WM), conventionally ignored as nuisance, are robustly detectable using appropriate processing methods and are related to neural activity, while changes in WM with aging and degeneration are also well documented. These findings suggest variations in patterns of BOLD signals in WM should be investigated. However, existing fMRI analysis tools, which were designed for processing gray matter signals, are not well suited for large-scale processing of WM signals in fMRI data. We developed an automatic pipeline for high-performance preprocessing of fMRI images with emphasis on quantifying changes in BOLD signals in WM in an aging population. At the image processing level, the pipeline integrated existing software modules with fine parameter tunings and modifications to better extract weaker WM signals. The preprocessing results primarily included whole-brain time-courses, functional connectivity, maps and tissue masks in a common space. At the job execution level, this pipeline exploited a local XNAT to store datasets and results, while using DAX tool to automatic distribute batch jobs that run on high-performance computing clusters. Through the pipeline, 5,034 fMRI/T1 scans were preprocessed. The intraclass correlation coefficient (ICC) of test-retest experiment based on the preprocessed data is 0.52 - 0.86 (N=1000), indicating a high reliability of our pipeline, comparable to previously reported ICC in gray matter experiments. This preprocessing pipeline highly facilitates our future analyses on WM functional alterations in aging and may be of benefit to a larger community interested in WM fMRI studies.
RESUMO
Objective: Data harmonization is necessary for removing confounding effects in multi-site diffusion image analysis. One such harmonization method, LinearRISH, scales rotationally invariant spherical harmonic (RISH) features from one site ("target") to the second ("reference") to reduce confounding scanner effects. However, reference and target site designations are not arbitrary and resultant diffusion metrics (fractional anisotropy, mean diffusivity) are biased by this choice. In this work we propose MidRISH: rather than scaling reference RISH features to target RISH features, we project both sites to a mid-space. Methods: We validate MidRISH with the following experiments: harmonizing scanner differences from 37 matched patients free of cognitive impairment, and harmonizing acquisition and study differences on 117 matched patients free of cognitive impairment. Conclusion: MidRISH reduces bias of reference selection while preserving harmonization efficacy of LinearRISH. Significance: Users should be cautious when performing LinearRISH harmonization. To select a reference site is to choose diffusion metric effect-size. Our proposed method eliminates the bias-inducing site selection step.
RESUMO
Batch size is a key hyperparameter in training deep learning models. Conventional wisdom suggests larger batches produce improved model performance. Here we present evidence to the contrary, particularly when using autoencoders to derive meaningful latent spaces from data with spatially global similarities and local differences, such as electronic health records (EHR) and medical imaging. We investigate batch size effects in both EHR data from the Baltimore Longitudinal Study of Aging and medical imaging data from the multimodal brain tumor segmentation (BraTS) challenge. We train fully connected and convolutional autoencoders to compress the EHR and imaging input spaces, respectively, into 32-dimensional latent spaces via reconstruction losses for various batch sizes between 1 and 100. Under the same hyperparameter configurations, smaller batches improve loss performance for both datasets. Additionally, latent spaces derived by autoencoders with smaller batches capture more biologically meaningful information. Qualitatively, we visualize 2-dimensional projections of the latent spaces and find that with smaller batches the EHR network better separates the sex of the individuals, and the imaging network better captures the right-left laterality of tumors. Quantitatively, the analogous sex classification and laterality regressions using the latent spaces demonstrate statistically significant improvements in performance at smaller batch sizes. Finally, we find improved individual variation locally in visualizations of representative data reconstructions at lower batch sizes. Taken together, these results suggest that smaller batch sizes should be considered when designing autoencoders to extract meaningful latent spaces among EHR and medical imaging data driven by global similarities and local variation.
RESUMO
The effects of normal aging on functional connectivity (FC) within various brain networks of gray matter (GM) have been well-documented. However, the age effects on the networks of FC between white matter (WM) and GM, namely WM-GM FC, remains unclear. Evaluating crucial properties, such as global efficiency (GE), for a WM-GM FC network poses a challenge due to the absence of closed triangle paths which are essential for assessing network properties in traditional graph models. In this study, we propose a bipartite graph model to characterize the WM-GM FC network and quantify these challenging network properties. Leveraging this model, we assessed the WM-GM FC network properties at multiple scales across 1,462 cognitively normal subjects aged 22-96 years from three repositories (ADNI, BLSA and OASIS-3) and investigated the age effects on these properties throughout adulthood and during late adulthood (age ≥70 years). Our findings reveal that (1) heterogeneous alterations occurred in region-specific WM-GM FC over the adulthood and decline predominated during late adulthood; (2) the FC density of WM bundles engaged in memory, executive function and processing speed declined with age over adulthood, particularly in later years; and (3) the GE of attention, default, somatomotor, frontoparietal and limbic networks reduced with age over adulthood, and GE of visual network declined during late adulthood. These findings provide unpresented insights into multi-scale alterations in networks of WM-GM functional synchronizations during normal aging. Furthermore, our bipartite graph model offers an extendable framework for quantifying WM-engaged networks, which may contribute to a wide range of neuroscience research.
Assuntos
Substância Cinzenta , Substância Branca , Humanos , Adulto , Substância Cinzenta/diagnóstico por imagem , Imageamento por Ressonância Magnética , Envelhecimento , Encéfalo , Substância Branca/diagnóstico por imagemRESUMO
INTRODUCTION: It is unclear how rates of white matter microstructural decline differ between normal aging and abnormal aging. METHODS: Diffusion MRI data from several well-established longitudinal cohorts of aging [Alzheimer's Neuroimaging Initiative (ADNI), Baltimore Longitudinal Study of Aging (BLSA), Vanderbilt Memory & Aging Project (VMAP)] was free-water corrected and harmonized. This dataset included 1,723 participants (age at baseline: 72.8±8.87 years, 49.5% male) and 4,605 imaging sessions (follow-up time: 2.97±2.09 years, follow-up range: 1-13 years, mean number of visits: 4.42±1.98). Differences in white matter microstructural decline in normal and abnormal agers was assessed. RESULTS: While we found global decline in white matter in normal/abnormal aging, we found that several white matter tracts (e.g., cingulum bundle) were vulnerable to abnormal aging. CONCLUSIONS: There is a prevalent role of white matter microstructural decline in aging, and future large-scale studies in this area may further refine our understanding of the underlying neurodegenerative processes. HIGHLIGHTS: Longitudinal data was free-water corrected and harmonizedGlobal effects of white matter decline were seen in normal and abnormal agingThe free-water metric was most vulnerable to abnormal agingCingulum free-water was the most vulnerable to abnormal aging.
RESUMO
Introduction: White matter microstructure may be abnormal along the Alzheimer's disease (AD) continuum. Methods: Diffusion magnetic resonance imaging (dMRI) data from the Alzheimer's Disease Neuroimaging Initiative (ADNI, n = 627), Baltimore Longitudinal Study of Aging (BLSA, n = 684), and Vanderbilt Memory & Aging Project (VMAP, n = 296) cohorts were free-water (FW) corrected and conventional, and FW-corrected microstructural metrics were quantified within 48 white matter tracts. Microstructural values were subsequently harmonized using the Longitudinal ComBat technique and inputted as independent variables to predict diagnosis (cognitively unimpaired [CU], mild cognitive impairment [MCI], AD). Models were adjusted for age, sex, race/ethnicity, education, apolipoprotein E (APOE) ε4 carrier status, and APOE ε2 carrier status. Results: Conventional dMRI metrics were associated globally with diagnostic status; following FW correction, the FW metric itself exhibited global associations with diagnostic status, but intracellular metric associations were diminished. Discussion: White matter microstructure is altered along the AD continuum. FW correction may provide further understanding of the white matter neurodegenerative process in AD. Highlights: Longitudinal ComBat successfully harmonized large-scale diffusion magnetic resonance imaging (dMRI) metrics.Conventional dMRI metrics were globally sensitive to diagnostic status.Free-water (FW) correction mitigated intracellular associations with diagnostic status.The FW metric itself was globally sensitive to diagnostic status. Multivariate conventional and FW-corrected models may provide complementary information.
RESUMO
OBJECTIVE: We examined medical records to determine health conditions associated with dementia at varied intervals prior to dementia diagnosis in participants from the Baltimore Longitudinal Study of Aging (BLSA). METHODS: Data were available for 347 Alzheimer's disease (AD), 76 vascular dementia (VaD), and 811 control participants without dementia. Logistic regressions were performed associating International Classification of Diseases, 9th Revision (ICD-9) health codes with dementia status across all time points, at 5 and 1 year(s) prior to dementia diagnosis, and at the year of diagnosis, controlling for age, sex, and follow-up length of the medical record. RESULTS: In AD, the earliest and most consistent associations across all time points included depression, erectile dysfunction, gait abnormalities, hearing loss, and nervous and musculoskeletal symptoms. Cardiomegaly, urinary incontinence, non-epithelial skin cancer, and pneumonia were not significant until 1 year before dementia diagnosis. In VaD, the earliest and most consistent associations across all time points included abnormal electrocardiogram (EKG), cardiac dysrhythmias, cerebrovascular disease, non-epithelial skin cancer, depression, and hearing loss. Atrial fibrillation, occlusion of cerebral arteries, essential tremor, and abnormal reflexes were not significant until 1 year before dementia diagnosis. INTERPRETATION: These findings suggest that some health conditions are associated with future dementia beginning at least 5 years before dementia diagnosis and are consistently seen over time, while others only reach significance closer to the date of diagnosis. These results also show that there are both shared and distinctive health conditions associated with AD and VaD. These results reinforce the need for medical intervention and treatment to lessen the impact of health comorbidities in the aging population. ANN NEUROL 2023;93:805-818.
Assuntos
Doença de Alzheimer , Transtornos Cerebrovasculares , Demência Vascular , Masculino , Humanos , Idoso , Doença de Alzheimer/complicações , Doença de Alzheimer/epidemiologia , Doença de Alzheimer/diagnóstico , Demência Vascular/complicações , Demência Vascular/epidemiologia , Estudos Longitudinais , Transtornos Cerebrovasculares/epidemiologia , ComorbidadeRESUMO
Characterizing relationships between gray matter (GM) and white matter (WM) in early Alzheimer's disease (AD) would improve understanding of how and when AD impacts the brain. However, modeling these relationships across brain regions and longitudinally remains a challenge. Thus, we propose extending joint independent component analysis (jICA) into spatiotemporal modeling of regional cortical thickness and WM bundle volumes leveraging multimodal MRI. We jointly characterize these GM and WM features in a normal aging (n=316) and an age- and sex-matched preclinical AD cohort (n=81) at each of two imaging sessions spaced three years apart, training on the normal aging population in cross-validation and interrogating the preclinical AD cohort. We find this joint model identifies reproducible, longitudinal changes in GM and WM between the two imaging sessions and that these changes are associated with preclinical AD and are plausible considering the literature. We compare this joint model to two focused models: (1) GM features at the first session and WM at the second and (2) vice versa. The joint model identifies components that correlate poorly with those from the focused models, suggesting the different models resolve different patterns. We find the strength of association with preclinical AD is improved in the GM to WM model, which supports the hypothesis that medial temporal and frontal thinning precedes volume loss in the uncinate fasciculus and inferior anterior-posterior association fibers. These results suggest that jICA effectively generates spatiotemporal hypotheses about GM and WM in preclinical AD, especially when specific intermodality relationships are considered a priori.
RESUMO
Diffusion weighted MRI (DW-MRI) harmonization is necessary for multi-site or multi-acquisition studies. Current statistical methods address the need to harmonize from one site to another, but do not simultaneously consider the use of multiple datasets which are comprised of multiple sites, acquisitions protocols, and age demographics. This work explores deep learning methods which can generalize across these variations through semi-supervised and unsupervised learning while also learning to estimate multi-shell data from single-shell data using the Multi-shell Diffusion MRI Harmonization Challenge (MUSHAC) and Baltimore Longitudinal Study on Aging (BLSA) datasets. We compare disentanglement harmonization models, which seek to encode anatomy and acquisition in separate latent spaces, and a CycleGAN harmonization model, which uses generative adversarial networks (GAN) to perform style transfer between sites, to the baseline preprocessing and to SHORE interpolation. We find that the disentanglement models achieve superior performance in harmonizing all data while at the same transforming the input data to a single target space across several diffusion metrics (fractional anisotropy, mean diffusivity, mean kurtosis, primary eigenvector).