Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 47
Filtrar
1.
ArXiv ; 2024 Aug 22.
Artigo em Inglês | MEDLINE | ID: mdl-39253635

RESUMO

$\textbf{Purpose:}$ To develop a new method for free-breathing 3D extracellular volume (ECV) mapping of the whole heart at 3T. $\textbf{Methods:}$ A free-breathing 3D cardiac ECV mapping method was developed at 3T. T1 mapping was performed before and after contrast agent injection using a free-breathing ECG-gated inversion-recovery sequence with spoiled gradient echo readout. A linear tangent space alignment (LTSA) model-based method was used to reconstruct high-frame-rate dynamic images from (k,t)-space data sparsely sampled along a random stack-of-stars trajectory. Joint T1 and transmit B1 estimation was performed voxel-by-voxel for pre- and post-contrast T1 mapping. To account for the time-varying T1 after contrast agent injection, a linearly time-varying T1 model was introduced for post-contrast T1 mapping. ECV maps were generated by aligning pre- and post-contrast T1 maps through affine transformation. $\textbf{Results:}$ The feasibility of the proposed method was demonstrated using in vivo studies with six healthy volunteers at 3T. We obtained 3D ECV maps at a spatial resolution of 1.9$\times$1.9$\times$4.5 $mm^{3}$ and a FOV of 308$\times$308$\times$144 $mm^{3}$, with a scan time of 10.1$\pm$1.4 and 10.6$\pm$1.6 min before and after contrast agent injection, respectively. The ECV maps and the pre- and post-contrast T1 maps obtained by the proposed method were in good agreement with the 2D MOLLI method both qualitatively and quantitatively. $\textbf{Conclusion:}$ The proposed method allows for free-breathing 3D ECV mapping of the whole heart within a practically feasible imaging time. The estimated ECV values from the proposed method were comparable to those from the existing method. $\textbf{Keywords:}$ cardiac extracellular volume (ECV) mapping, cardiac T1 mapping, linear tangent space alignment (LTSA), manifold learning.

2.
JASA Express Lett ; 4(9)2024 Sep 01.
Artigo em Inglês | MEDLINE | ID: mdl-39240196

RESUMO

The human tongue exhibits an orchestrated arrangement of internal muscles, working in sequential order to execute tongue movements. Understanding the muscle coordination patterns involved in tongue protrusive motion is crucial for advancing knowledge of tongue structure and function. To achieve this, this work focuses on five muscles known to contribute to protrusive motion. Tagged and diffusion MRI data are collected for analysis of muscle fiber geometry and motion patterns. Lagrangian strain measurements are derived, and Granger causal analysis is carried out to assess predictive information among the muscles. Experimental results suggest sequential muscle coordination of protrusive motion among distinct muscle groups.


Assuntos
Imagem de Difusão por Ressonância Magnética , Língua , Humanos , Imagem de Difusão por Ressonância Magnética/métodos , Língua/fisiologia , Língua/diagnóstico por imagem , Movimento/fisiologia , Músculo Esquelético/fisiologia , Músculo Esquelético/diagnóstico por imagem , Adulto
3.
Med Phys ; 2024 Aug 23.
Artigo em Inglês | MEDLINE | ID: mdl-39177300

RESUMO

A National Institutes of Health (NIH) and U.S. Department of Energy (DOE) Office of Science virtual workshop on shared general topics was held in July of 2021 and reported on in this publication in January of 2023. Following the inaugural 2021 joint meeting representatives from the DOE Office of Science and NIH met to discuss organizing a second joint workshop that would concentrate on radiation detection to bring together teams from both agencies and their grantee populations to stimulate collaboration and efficiency. To meet this scientific mission within the NIH and DOE radiation detection space, the organizers assembled workshop sessions covering the state-of-the-art in cameras, detectors, and sensors for radiation external and internal (diagnostic and therapeutic) to human, data acquisition and electronics, image reconstruction and processing, and the application of artificial intelligence. NIH and DOE are committed to continuing the process of convening a joint workshop every 12-24 months. This Special Report recaps the findings of this second workshop. Beyond showing only the innovations and areas of success, important gaps in our knowledge were defined and presented. We summarize by defining four areas of greatest opportunity and need that emerged from the unique, dynamic dialogue the in-person workshop provided the attendees.

4.
ArXiv ; 2024 Aug 01.
Artigo em Inglês | MEDLINE | ID: mdl-39130200

RESUMO

Delineating lesions and anatomical structure is important for image-guided interventions. Point-supervised medical image segmentation (PSS) has great potential to alleviate costly expert delineation labeling. However, due to the lack of precise size and boundary guidance, the effectiveness of PSS often falls short of expectations. Although recent vision foundational models, such as the medical segment anything model (MedSAM), have made significant advancements in bounding-box-prompted segmentation, it is not straightforward to utilize point annotation, and is prone to semantic ambiguity. In this preliminary study, we introduce an iterative framework to facilitate semantic-aware point-supervised MedSAM. Specifically, the semantic box-prompt generator (SBPG) module has the capacity to convert the point input into potential pseudo bounding box suggestions, which are explicitly refined by the prototype-based semantic similarity. This is then succeeded by a prompt-guided spatial refinement (PGSR) module that harnesses the exceptional generalizability of MedSAM to infer the segmentation mask, which also updates the box proposal seed in SBPG. Performance can be progressively improved with adequate iterations. We conducted an evaluation on BraTS2018 for the segmentation of whole brain tumors and demonstrated its superior performance compared to traditional PSS methods and on par with box-supervised methods.

5.
Alzheimers Dement ; 2024 Jul 22.
Artigo em Inglês | MEDLINE | ID: mdl-39039896

RESUMO

INTRODUCTION: Understanding early neuropathological changes and their associations with cognition may aid dementia prevention. This study investigated associations of cerebral amyloid and tau positron emission tomography (PET) retention with cognition in a predominately middle-aged community-based cohort and examined factors that may modify these relationships. METHODS: 11C-Pittsburgh compound B amyloid and 18F-flortaucipir tau PET imaging were performed. Associations of amyloid and tau PET with cognition were evaluated using linear regression. Interactions with age, apolipoprotein E (APOE) ε4 status, and education were examined. RESULTS: Amyloid and tau PET were not associated with cognition in the overall sample (N = 423; mean: 57 ± 10 years; 50% female). However, younger age (< 55 years) and APOE ε4 were significant effect modifiers, worsening cognition in the presence of higher amyloid and tau. DISCUSSION: Higher levels of Aß and tau may have a pernicious effect on cognition among APOE ε4 carriers and younger adults, suggesting a potential role for targeted early interventions. HIGHLIGHTS: Risk and resilience factors influenced cognitive vulnerability due to Aß and tau. Higher fusiform tau associated with poorer visuospatial skills in younger adults. APOE ε4 interacted with Aß and tau to worsen cognition across multiple domains.

6.
Artigo em Inglês | MEDLINE | ID: mdl-38009135

RESUMO

Investigating the relationship between internal tissue point motion of the tongue and oropharyngeal muscle deformation measured from tagged MRI and intelligible speech can aid in advancing speech motor control theories and developing novel treatment methods for speech related-disorders. However, elucidating the relationship between these two sources of information is challenging, due in part to the disparity in data structure between spatiotemporal motion fields (i.e., 4D motion fields) and one-dimensional audio waveforms. In this work, we present an efficient encoder-decoder translation network for exploring the predictive information inherent in 4D motion fields via 2D spectrograms as a surrogate of the audio data. Specifically, our encoder is based on 3D convolutional spatial modeling and transformer-based temporal modeling. The extracted features are processed by an asymmetric 2D convolution decoder to generate spectrograms that correspond to 4D motion fields. Furthermore, we incorporate a generative adversarial training approach into our framework to further improve synthesis quality on our generated spectrograms. We experiment on 63 paired motion field sequences and speech waveforms, demonstrating that our framework enables the generation of clear audio waveforms from a sequence of motion fields. Thus, our framework has the potential to improve our understanding of the relationship between these two modalities and inform the development of treatments for speech disorders.

7.
bioRxiv ; 2023 Oct 18.
Artigo em Inglês | MEDLINE | ID: mdl-37732236

RESUMO

Gabapentin, a selective ligand for the α2δ subunit of voltage-dependent calcium channels, is an anticonvulsant medication used in the treatment of neuropathic pain, epilepsy and other neurological conditions. We recently described two radiofluorinated derivatives of gabapentin (trans-4-[18F]fluorogabapentin, [18F]tGBP4F, and cis-4-[18F]fluorogabapentin, [18F]cGBP4F) and showed that these compounds accumulate in the injured nerves in a rodent model of neuropathic pain. Given the use of gabapentin in brain diseases, here we investigate whether these radiofluorinated derivatives of gabapentin can be used for imaging α2δ receptors in the brain. Specifically, we developed automated radiosynthesis methods for [18F]tGBP4F and [18F]cGBP4F and conducted dynamic PET imaging in adult rhesus macaques with and without preadministration of pharmacological doses of gabapentin. Both radiotracers showed very high metabolic stability, negligible plasma protein binding and slow accumulation in the brain. [18F]tGBP4F, the isomer with higher binding affinity, showed low brain uptake and could not be displaced whereas [18F]cGBP4F showed moderate brain uptake and could be partially displaced. Kinetic modeling of brain regional time-activity curves using a metabolite-corrected arterial input function shows that a 1-tissue compartment model accurately fits the data. Graphical analysis using Logan or multilinear analysis 1 produced similar results as compartmental modeling indicating robust quantification. This study advances our understanding of how gabapentinoids work and provides an important advancement towards imaging α2δ receptors in the brain.

8.
bioRxiv ; 2023 Mar 28.
Artigo em Inglês | MEDLINE | ID: mdl-37034655

RESUMO

Purpose: 4-Aminopyridine (4AP) is a medication for the symptomatic treatment of multiple sclerosis. Several 4AP-based PET tracers have been developed for imaging demyelination. In preclinical studies, [ 11 C]3MeO4AP has shown promise due to its high brain permeability, high metabolic stability, high plasma availability, and high in vivo binding affinity. To prepare for the translation to human studies, we developed a cGMP-compliant automated radiosynthesis protocol and evaluated the whole-body biodistribution and radiation dosimetry of [ 11 C]3MeO4AP in non-human primates (NHPs). Methods: Automated radiosynthesis was carried out using a GE TRACERlab FX-C Pro synthesis module. One male and one female adult rhesus macaques were used in the study. A high-resolution CT from cranial vertex to knee was acquired. PET data were collected using a dynamic acquisition protocol with 4 bed positions and 13 passes over a total scan time of ∼150 minutes. Based on the CT and PET images, volumes of interest (VOIs) were manually drawn for selected organs. Non-decay corrected time-activity curves (TACs) were extracted for each VOI. Radiation dosimetry and effective dose were calculated from the integrated TACs using OLINDA software. Results: Fully automated radiosynthesis of [ 11 C]3MeO4AP was achieved with 7.3 ± 1.2 % (n = 4) of non-decay corrected radiochemical yield within 38 min of synthesis and purification time. [ 11 C]3MeO4AP distributed quickly throughout the body and into the brain. The organs with highest dose were the kidneys. The average effective dose of [ 11 C]3MeO4AP was 4.27 ± 0.57 µSv/MBq. No significant changes in vital signs were observed during the scan. Conclusion: The cGMP compliant automated radiosynthesis of [ 11 C]3MeO4AP was developed. The whole-body biodistribution and radiation dosimetry of [ 11 C]3MeO4AP was successfully evaluated in NHPs. [ 11 C]3MeO4AP shows lower average effective dose than [ 18 F]3F4AP and similar average effective dose as other carbon-11 tracers.

11.
Magn Reson Med ; 89(4): 1297-1313, 2023 04.
Artigo em Inglês | MEDLINE | ID: mdl-36404676

RESUMO

PURPOSE: To develop a manifold learning-based method that leverages the intrinsic low-dimensional structure of MR Spectroscopic Imaging (MRSI) signals for joint spectral quantification. METHODS: A linear tangent space alignment (LTSA) model was proposed to represent MRSI signals. In the proposed model, the signals of each metabolite were represented using a subspace model and the local coordinates of the subspaces were aligned to the global coordinates of the underlying low-dimensional manifold via linear transform. With the basis functions of the subspaces predetermined via quantum mechanics simulations, the global coordinates and the matrices for the local-to-global coordinate alignment were estimated by fitting the proposed LTSA model to noisy MRSI data with a spatial smoothness constraint on the global coordinates and a sparsity constraint on the matrices. RESULTS: The performance of the proposed method was validated using numerical simulation data and in vivo proton-MRSI experimental data acquired on healthy volunteers at 3T. The results of the proposed method were compared with the QUEST method and the subspace-based method. In all the compared cases, the proposed method achieved superior performance over the QUEST and the subspace-based methods both qualitatively in terms of noise and artifacts in the estimated metabolite concentration maps, and quantitatively in terms of spectral quantification accuracy measured by normalized root mean square errors. CONCLUSION: Joint spectral quantification using linear tangent space alignment-based manifold learning improves the accuracy of MRSI spectral quantification.


Assuntos
Algoritmos , Imageamento por Ressonância Magnética , Humanos , Espectroscopia de Ressonância Magnética/métodos , Imageamento por Ressonância Magnética/métodos , Espectroscopia de Prótons por Ressonância Magnética/métodos , Simulação por Computador , Encéfalo/diagnóstico por imagem , Encéfalo/metabolismo
12.
Med Phys ; 50(3): 1539-1548, 2023 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-36331429

RESUMO

BACKGROUND: In medical imaging, images are usually treated as deterministic, while their uncertainties are largely underexplored. PURPOSE: This work aims at using deep learning to efficiently estimate posterior distributions of imaging parameters, which in turn can be used to derive the most probable parameters as well as their uncertainties. METHODS: Our deep learning-based approaches are based on a variational Bayesian inference framework, which is implemented using two different deep neural networks based on conditional variational auto-encoder (CVAE), CVAE-dual-encoder, and CVAE-dual-decoder. The conventional CVAE framework, that is, CVAE-vanilla, can be regarded as a simplified case of these two neural networks. We applied these approaches to a simulation study of dynamic brain PET imaging using a reference region-based kinetic model. RESULTS: In the simulation study, we estimated posterior distributions of PET kinetic parameters given a measurement of the time-activity curve. Our proposed CVAE-dual-encoder and CVAE-dual-decoder yield results that are in good agreement with the asymptotically unbiased posterior distributions sampled by Markov Chain Monte Carlo (MCMC). The CVAE-vanilla can also be used for estimating posterior distributions, although it has an inferior performance to both CVAE-dual-encoder and CVAE-dual-decoder. CONCLUSIONS: We have evaluated the performance of our deep learning approaches for estimating posterior distributions in dynamic brain PET. Our deep learning approaches yield posterior distributions, which are in good agreement with unbiased distributions estimated by MCMC. All these neural networks have different characteristics and can be chosen by the user for specific applications. The proposed methods are general and can be adapted to other problems.


Assuntos
Aprendizado Profundo , Teorema de Bayes , Tomografia por Emissão de Pósitrons/métodos , Simulação por Computador , Redes Neurais de Computação
13.
J Cereb Blood Flow Metab ; 43(2): 296-308, 2023 02.
Artigo em Inglês | MEDLINE | ID: mdl-36172629

RESUMO

Metabotropic glutamate receptor 2 (mGluR2) has been extensively studied for the treatment of various neurological and psychiatric disorders. Understanding of the mGluR2 function is pivotal in supporting the drug discovery targeting mGluR2. Herein, the positive allosteric modulation of mGluR2 was investigated via the in vivo positron emission tomography (PET) imaging using 2-((4-(2-[11C]methoxy-4-(trifluoromethyl)phenyl)piperidin-1-yl)methyl)-1-methyl-1H-imidazo[4,5-b]pyridine ([11C]mG2P001). Distinct from the orthosteric compounds, pretreatment with the unlabeled mG2P001, a potent mGluR2 positive allosteric modulator (PAM), resulted in a significant increase instead of decrease of the [11C]mG2P001 accumulation in rat brain detected by PET imaging. Subsequent in vitro studies with [3H]mG2P001 revealed the cooperative binding mechanism of mG2P001 with glutamate and its pharmacological effect that contributed to the enhanced binding of [3H]mG2P001 in transfected CHO cells expressing mGluR2. The in vivo PET imaging and quantitative analysis of [11C]mG2P001 in non-human primates (NHPs) further validated the characteristics of [11C]mG2P001 as an imaging ligand for mGluR2. Self-blocking studies in primates enhanced accumulation of [11C]mG2P001. Altogether, these studies show that [11C]mG2P001 is a sensitive biomarker for mGluR2 expression and the binding is affected by the tissue glutamate concentration.


Assuntos
Receptores de Glutamato Metabotrópico , Ratos , Cricetinae , Animais , Ratos Sprague-Dawley , Cricetulus , Tomografia por Emissão de Pósitrons
14.
IEEE Trans Biomed Eng ; 70(4): 1252-1263, 2023 04.
Artigo em Inglês | MEDLINE | ID: mdl-36227815

RESUMO

Deep learning (DL)-based automatic sleep staging approaches have attracted much attention recently due in part to their outstanding accuracy. At the testing stage, however, the performance of these approaches is likely to be degraded, when applied in different testing environments, because of the problem of domain shift. This is because while a pre-trained model is typically trained on noise-free electroencephalogram (EEG) signals acquired from accurate medical equipment, deployment is carried out on consumer-level devices with undesirable noise. To alleviate this challenge, in this work, we propose an efficient training approach that is robust against unseen arbitrary noise. In particular, we propose to generate the worst-case input perturbations by means of adversarial transformation in an auxiliary model, to learn a wide range of input perturbations and thereby to improve reliability. Our approach is based on two separate training models: (i) an auxiliary model to generate adversarial noise and (ii) a target network to incorporate the noise signal to enhance robustness. Furthermore, we exploit novel class-wise robustness during the training of the target network to represent different robustness patterns of each sleep stage. Our experimental results demonstrated that our approach improved sleep staging performance on healthy controls, in the presence of moderate to severe noise levels, compared with competing methods. Our approach was able to effectively train and deploy a DL model to handle different types of noise, including adversarial, Gaussian, and shot noise.


Assuntos
Eletroencefalografia , Fases do Sono , Reprodutibilidade dos Testes , Distribuição Normal
15.
IEEE Trans Med Imaging ; 42(1): 158-169, 2023 01.
Artigo em Inglês | MEDLINE | ID: mdl-36121938

RESUMO

The spatial resolution and temporal frame-rate of dynamic magnetic resonance imaging (MRI) can be improved by reconstructing images from sparsely sampled k -space data with mathematical modeling of the underlying spatiotemporal signals. These models include sparsity models, linear subspace models, and non-linear manifold models. This work presents a novel linear tangent space alignment (LTSA) model-based framework that exploits the intrinsic low-dimensional manifold structure of dynamic images for accelerated dynamic MRI. The performance of the proposed method was evaluated and compared to state-of-the-art methods using numerical simulation studies as well as 2D and 3D in vivo cardiac imaging experiments. The proposed method achieved the best performance in image reconstruction among all the compared methods. The proposed method could prove useful for accelerating many MRI applications, including dynamic MRI, multi-parametric MRI, and MR spectroscopic imaging.


Assuntos
Algoritmos , Imageamento por Ressonância Magnética , Imageamento por Ressonância Magnética/métodos , Processamento de Imagem Assistida por Computador/métodos , Simulação por Computador , Modelos Teóricos
16.
J Cereb Blood Flow Metab ; 43(4): 581-594, 2023 04.
Artigo em Inglês | MEDLINE | ID: mdl-36420769

RESUMO

[18F]MK-6240 meningeal/extracerebral off-target binding may impact tau quantification. We examined the kinetics and longitudinal changes of extracerebral and reference regions. [18F]MK-6240 PET was performed in 24 cognitively-normal and eight cognitively-impaired subjects, with arterial samples in 13 subjects. Follow-up scans at 6.1 ± 0.5 (n = 25) and 13.3 ± 0.9 (n = 16) months were acquired. Extracerebral and reference region (cerebellar gray matter (CerGM)-based, cerebral white matter (WM), pons) uptake were evaluated using standardized uptake values (SUV90-110), spectral analysis, and distribution volume. Longitudinal changes in SUV90-110 were examined. The impact of reference region on target region outcomes, partial volume correction (PVC) and regional erosion were evaluated. Eroded WM and pons showed lower variability, lower extracerebral contamination, and lower longitudinal changes than CerGM-based regions. CerGM-based regions resulted larger cross-sectional effect sizes for group differentiation. Extracerebral signal was high in 50% of subjects and exhibited irreversible kinetics and nonsignificant longitudinal changes over one-year but was highly variable at subject-level. PVC resulted in higher variability in reference region uptake and longitudinal changes. Our results suggest that eroded CerGM may be preferred for cross-sectional, whilst eroded WM or pons may be preferred for longitudinal [18F]MK-6240 studies. For CerGM, erosion was necessary (preferred over PVC) to address the heterogenous nature of extracerebral signal.


Assuntos
Disfunção Cognitiva , Humanos , Estudos Transversais , Cinética , Tomografia por Emissão de Pósitrons/métodos , Estudos de Casos e Controles
17.
Sci Rep ; 12(1): 18118, 2022 10 27.
Artigo em Inglês | MEDLINE | ID: mdl-36302815

RESUMO

Thus far, there have been no reported specific rules for systematically determining the appropriate augmented sample size to optimize model performance when conducting data augmentation. In this paper, we report on the feasibility of synthetic data augmentation using generative adversarial networks (GAN) by proposing an automation pipeline to find the optimal multiple of data augmentation to achieve the best deep learning-based diagnostic performance in a limited dataset. We used Waters' view radiographs for patients diagnosed with chronic sinusitis to demonstrate the method developed herein. We demonstrate that our approach produces significantly better diagnostic performance parameters than models trained using conventional data augmentation. The deep learning method proposed in this study could be implemented to assist radiologists in improving their diagnosis. Researchers and industry workers could overcome the lack of training data by employing our proposed automation pipeline approach in GAN-based synthetic data augmentation. This is anticipated to provide new means to overcome the shortage of graphic data for algorithm training.


Assuntos
Aprendizado Profundo , Humanos , Algoritmos , Radiografia , Automação
18.
Artigo em Inglês | MEDLINE | ID: mdl-36303579

RESUMO

Lesions or organ boundaries visible through medical imaging data are often ambiguous, thus resulting in significant variations in multi-reader delineations, i.e., the source of aleatoric uncertainty. In particular, quantifying the inter-observer variability of manual annotations with Magnetic Resonance (MR) Imaging data plays a crucial role in establishing a reference standard for various diagnosis and treatment tasks. Most segmentation methods, however, simply model a mapping from an image to its single segmentation map and do not take the disagreement of annotators into consideration. In order to account for inter-observer variability, without sacrificing accuracy, we propose a novel variational inference framework to model the distribution of plausible segmentation maps, given a specific MR image, which explicitly represents the multi-reader variability. Specifically, we resort to a latent vector to encode the multi-reader variability and counteract the inherent information loss in the imaging data. Then, we apply a variational autoencoder network and optimize its evidence lower bound (ELBO) to efficiently approximate the distribution of the segmentation map, given an MR image. Experimental results, carried out with the QUBIQ brain growth MRI segmentation datasets with seven annotators, demonstrate the effectiveness of our approach.

19.
Artigo em Inglês | MEDLINE | ID: mdl-35990931

RESUMO

Unsupervised domain adaptation (UDA) between two significantly disparate domains to learn high-level semantic alignment is a crucial yet challenging task. To this end, in this work, we propose exploiting low-level edge information to facilitate the adaptation as a precursor task, which has a small cross-domain gap, compared with semantic segmentation. The precise contour then provides spatial information to guide the semantic adaptation. More specifically, we propose a multi-task framework to learn a contouring adaptation network along with a semantic segmentation adaptation network, which takes both magnetic resonance imaging (MRI) slice and its initial edge map as input. These two networks are jointly trained with source domain labels, and the feature and edge map level adversarial learning is carried out for cross-domain alignment. In addition, self-entropy minimization is incorporated to further enhance segmentation performance. We evaluated our framework on the BraTS2018 database for cross-modality segmentation of brain tumors, showing the validity and superiority of our approach, compared with competing methods.

20.
Artigo em Inglês | MEDLINE | ID: mdl-35895653

RESUMO

Unsupervised domain adaptation (UDA) has been successfully applied to transfer knowledge from a labeled source domain to target domains without their labels. Recently introduced transferable prototypical networks (TPNs) further address class-wise conditional alignment. In TPN, while the closeness of class centers between source and target domains is explicitly enforced in a latent space, the underlying fine-grained subtype structure and the cross-domain within-class compactness have not been fully investigated. To counter this, we propose a new approach to adaptively perform a fine-grained subtype-aware alignment to improve the performance in the target domain without the subtype label in both domains. The insight of our approach is that the unlabeled subtypes in a class have the local proximity within a subtype while exhibiting disparate characteristics because of different conditional and label shifts. Specifically, we propose to simultaneously enforce subtype-wise compactness and class-wise separation, by utilizing intermediate pseudo-labels. In addition, we systematically investigate various scenarios with and without prior knowledge of subtype numbers and propose to exploit the underlying subtype structure. Furthermore, a dynamic queue framework is developed to evolve the subtype cluster centroids steadily using an alternative processing scheme. Experimental results, carried out with multiview congenital heart disease data and VisDA and DomainNet, show the effectiveness and validity of our subtype-aware UDA, compared with state-of-the-art UDA methods.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA