Search | Nursing VHL Search Portal

1.

Integrated lithium niobate microwave photonic processing engine.

Feng, Hanke; Ge, Tong; Guo, Xiaoqing; Wang, Benshan; Zhang, Yiwen; Chen, Zhaoxi; Zhu, Sha; Zhang, Ke; Sun, Wenzhao; Huang, Chaoran; Yuan, Yixuan; Wang, Cheng.

Nature ; 627(8002): 80-87, 2024 Mar.

Article in English | MEDLINE | ID: mdl-38418888

ABSTRACT

Integrated microwave photonics (MWP) is an intriguing technology for the generation, transmission and manipulation of microwave signals in chip-scale optical systems1,2. In particular, ultrafast processing of analogue signals in the optical domain with high fidelity and low latency could enable a variety of applications such as MWP filters3-5, microwave signal processing6-9 and image recognition10,11. An ideal integrated MWP processing platform should have both an efficient and high-speed electro-optic modulation block to faithfully perform microwave-optic conversion at low power and also a low-loss functional photonic network to implement various signal-processing tasks. Moreover, large-scale, low-cost manufacturability is required to monolithically integrate the two building blocks on the same chip. Here we demonstrate such an integrated MWP processing engine based on a 4 inch wafer-scale thin-film lithium niobate platform. It can perform multipurpose tasks with processing bandwidths of up to 67 GHz at complementary metal-oxide-semiconductor (CMOS)-compatible voltages. We achieve ultrafast analogue computation, namely temporal integration and differentiation, at sampling rates of up to 256 giga samples per second, and deploy these functions to showcase three proof-of-concept applications: solving ordinary differential equations, generating ultra-wideband signals and detecting edges in images. We further leverage the image edge detector to realize a photonic-assisted image segmentation model that can effectively outline the boundaries of melanoma lesion in medical diagnostic images. Our ultrafast lithium niobate MWP engine could provide compact, low-latency and cost-effective solutions for future wireless communications, high-resolution radar and photonic artificial intelligence.

Subject(s)

Microwaves , Niobium , Optics and Photonics , Oxides , Photons , Artificial Intelligence , Diagnostic Imaging/instrumentation , Diagnostic Imaging/methods , Melanoma/diagnostic imaging , Melanoma/pathology , Optics and Photonics/instrumentation , Optics and Photonics/methods , Radar , Wireless Technology , Humans

2.

Quorum sensing signal synthases enhance Vibrio parahaemolyticus swarming motility.

Liu, Fuwen; Wang, Fei; Yuan, Yixuan; Li, Xiaoran; Zhong, Xiaojun; Yang, Menghua.

Mol Microbiol ; 120(2): 241-257, 2023 08.

Article in English | MEDLINE | ID: mdl-37330634

ABSTRACT

Vibrio parahaemolyticus is a significant food-borne pathogen that is found in diverse aquatic habitats. Quorum sensing (QS), a signaling system for cell-cell communication, plays an important role in V. parahaemolyticus persistence. We characterized the function of three V. parahaemolyticus QS signal synthases, CqsAvp , LuxMvp , and LuxSvp , and show that they are essential to activate QS and regulate swarming. We found that CqsAvp , LuxMvp , and LuxSvp activate a QS bioluminescence reporter through OpaR. However, V. parahaemolyticus exhibits swarming defects in the absence of CqsAvp , LuxMvp , and LuxSvp , but not OpaR. The swarming defect of this synthase mutant (termed Δ3AI) was recovered by overexpressing either LuxOvp D47A , a mimic of dephosphorylated LuxOvp mutant, or the scrABC operon. CqsAvp , LuxMvp , and LuxSvp inhibit lateral flagellar (laf) gene expression by inhibiting the phosphorylation of LuxOvp and the expression of scrABC. Phosphorylated LuxOvp enhances laf gene expression in a mechanism that involves modulating c-di-GMP levels. However, enhancing swarming requires phosphorylated and dephosphorylated LuxOvp which is regulated by the QS signals that are synthesized by CqsAvp , LuxMvp , and LuxSvp . The data presented here suggest an important strategy of swarming regulation by the integration of QS and c-di-GMP signaling pathways in V. parahaemolyticus.

Subject(s)

Quorum Sensing , Vibrio parahaemolyticus , Quorum Sensing/genetics , Vibrio parahaemolyticus/physiology , Gene Expression Regulation, Bacterial , Bacterial Proteins/genetics , Bacterial Proteins/metabolism , Signal Transduction

3.

DICCCOL: dense individualized and common connectivity-based cortical landmarks.

Zhu, Dajiang; Li, Kaiming; Guo, Lei; Jiang, Xi; Zhang, Tuo; Zhang, Degang; Chen, Hanbo; Deng, Fan; Faraco, Carlos; Jin, Changfeng; Wee, Chong-Yaw; Yuan, Yixuan; Lv, Peili; Yin, Yan; Hu, Xiaolei; Duan, Lian; Hu, Xintao; Han, Junwei; Wang, Lihong; Shen, Dinggang; Miller, L Stephen; Li, Lingjiang; Liu, Tianming.

Cereb Cortex ; 23(4): 786-800, 2013 Apr.

Article in English | MEDLINE | ID: mdl-22490548

ABSTRACT

Is there a common structural and functional cortical architecture that can be quantitatively encoded and precisely reproduced across individuals and populations? This question is still largely unanswered due to the vast complexity, variability, and nonlinearity of the cerebral cortex. Here, we hypothesize that the common cortical architecture can be effectively represented by group-wise consistent structural fiber connections and take a novel data-driven approach to explore the cortical architecture. We report a dense and consistent map of 358 cortical landmarks, named Dense Individualized and Common Connectivity-based Cortical Landmarks (DICCCOLs). Each DICCCOL is defined by group-wise consistent white-matter fiber connection patterns derived from diffusion tensor imaging (DTI) data. Our results have shown that these 358 landmarks are remarkably reproducible over more than one hundred human brains and possess accurate intrinsically established structural and functional cross-subject correspondences validated by large-scale functional magnetic resonance imaging data. In particular, these 358 cortical landmarks can be accurately and efficiently predicted in a new single brain with DTI data. Thus, this set of 358 DICCCOL landmarks comprehensively encodes the common structural and functional cortical architectures, providing opportunities for many applications in brain science including mapping human brain connectomes, as demonstrated in this work.

Subject(s)

Brain Mapping , Cerebral Cortex/physiology , Nerve Fibers, Myelinated/physiology , Neural Pathways/physiology , Adolescent , Adult , Age Factors , Aged , Algorithms , Attention/physiology , Cerebral Cortex/anatomy & histology , Cerebral Cortex/blood supply , Diffusion Magnetic Resonance Imaging , Emotions/physiology , Female , Humans , Image Processing, Computer-Assisted , Magnetic Resonance Imaging , Male , Semantics

4.

MCPL: Multi-modal Collaborative Prompt Learning for Medical Vision-Language Model.

Wang, Pengyu; Zhang, Huaqi; Yuan, Yixuan.

IEEE Trans Med Imaging ; PP2024 Jun 24.

Article in English | MEDLINE | ID: mdl-38913527

ABSTRACT

Multi-modal prompt learning is a high-performance and cost-effective learning paradigm, which learns text as well as image prompts to tune pre-trained vision-language (V-L) models like CLIP for adapting multiple downstream tasks. However, recent methods typically treat text and image prompts as independent components without considering the dependency between prompts. Moreover, extending multi-modal prompt learning into the medical field poses challenges due to a significant gap between general- and medical-domain data. To this end, we propose a Multi-modal Collaborative Prompt Learning (MCPL) pipeline to tune a frozen V-L model for aligning medical text-image representations, thereby achieving medical downstream tasks. We first construct the anatomy-pathology (AP) prompt for multi-modal prompting jointly with text and image prompts. The AP prompt introduces instance-level anatomy and pathology information, thereby making a V-L model better comprehend medical reports and images. Next, we propose graph-guided prompt collaboration module (GPCM), which explicitly establishes multi-way couplings between the AP, text, and image prompts, enabling collaborative multi-modal prompt producing and updating for more effective prompting. Finally, we develop a novel prompt configuration scheme, which attaches the AP prompt to the query and key, and the text/image prompt to the value in self-attention layers for improving the interpretability of multi-modal prompts. Extensive experiments on numerous medical classification and object detection datasets show that the proposed pipeline achieves excellent effectiveness and generalization. Compared with state-of-the-art prompt learning methods, MCPL provides a more reliable multi-modal prompt paradigm for reducing tuning costs of V-L models on medical downstream tasks. Our code: https://github.com/CUHK-AIM-Group/MCPL.

5.

FedOSS: Federated Open Set Recognition via Inter-Client Discrepancy and Collaboration.

Zhu, Meilu; Liao, Jing; Liu, Jun; Yuan, Yixuan.

IEEE Trans Med Imaging ; 43(1): 190-202, 2024 Jan.

Article in English | MEDLINE | ID: mdl-37428659

ABSTRACT

Open set recognition (OSR) aims to accurately classify known diseases and recognize unseen diseases as the unknown class in medical scenarios. However, in existing OSR approaches, gathering data from distributed sites to construct large-scale centralized training datasets usually leads to high privacy and security risk, which could be alleviated elegantly via the popular cross-site training paradigm, federated learning (FL). To this end, we represent the first effort to formulate federated open set recognition (FedOSR), and meanwhile propose a novel Federated Open Set Synthesis (FedOSS) framework to address the core challenge of FedOSR: the unavailability of unknown samples for all anticipated clients during the training phase. The proposed FedOSS framework mainly leverages two modules, i.e., Discrete Unknown Sample Synthesis (DUSS) and Federated Open Space Sampling (FOSS), to generate virtual unknown samples for learning decision boundaries between known and unknown classes. Specifically, DUSS exploits inter-client knowledge inconsistency to recognize known samples near decision boundaries and then pushes them beyond decision boundaries to synthesize discrete virtual unknown samples. FOSS unites these generated unknown samples from different clients to estimate the class-conditional distributions of open data space near decision boundaries and further samples open data, thereby improving the diversity of virtual unknown samples. Additionally, we conduct comprehensive ablation experiments to verify the effectiveness of DUSS and FOSS. FedOSS shows superior performance on public medical datasets in comparison with state-of-the-art approaches. The source code is available at https://github.com/CityU-AIM-Group/FedOSS.

Subject(s)

Machine Learning , Software , Humans , Disease

6.

Causal Disentanglement Domain Generalization for time-series signal fault diagnosis.

Jia, Linshan; Chow, Tommy W S; Yuan, Yixuan.

Neural Netw ; 172: 106099, 2024 Apr.

Article in English | MEDLINE | ID: mdl-38237445

ABSTRACT

Domain generalization-based fault diagnosis (DGFD) presents significant prospects for recognizing faults without the accessibility of the target domain. Previous DGFD methods have achieved significant progress; however, there are some limitations. First, most DGFG methods statistically model the dependence between time-series data and labels, and they are superficial descriptions to the actual data-generating process. Second, most of the existing DGFD methods are only verified on vibrational time-series datasets, which is insufficient to show the potential of domain generalization in the fault diagnosis area. In response to the above issues, this paper first proposes a DGFD method named Causal Disentanglement Domain Generalization (CDDG), which can reestablish the data-generating process by disentangling time-series data into the causal factors (fault-related representation) and no-casual factors (domain-related representation) with a structural causal model. Specifically, in CDDG, causal aggregation loss is designed to separate the unobservable causal and non-causal factors. Meanwhile, the reconstruction loss is proposed to ensure the information completeness of the disentangled factors. We also introduce a redundancy reduction loss to learn efficient features. The proposed CDDG is verified on five cross-machine vibrational fault diagnosis cases and three cross-environment acoustical anomaly detection cases by comparing it with eight state-of-the-art (SOTA) DGFD methods. We publicize the open-source time-series DGFD Benchmark containing CDDG and the eight SOTA methods. The code repository will be available at https://github.com/ShaneSpace/DGFDBenchmark.

Subject(s)

Generalization, Psychological , Learning , Acoustics , Benchmarking , Causality

7.

Disentangle Then Calibrate With Gradient Guidance: A Unified Framework for Common and Rare Disease Diagnosis.

Chen, Yuanyuan; Guo, Xiaoqing; Xia, Yong; Yuan, Yixuan.

IEEE Trans Med Imaging ; 43(5): 1816-1827, 2024 May.

Article in English | MEDLINE | ID: mdl-38165794

ABSTRACT

The computer-aided diagnosis (CAD) for rare diseases using medical imaging poses a significant challenge due to the requirement of large volumes of labeled training data, which is particularly difficult to collect for rare diseases. Although Few-shot learning (FSL) methods have been developed for this task, these methods focus solely on rare disease diagnosis, failing to preserve the performance in common disease diagnosis. To address this issue, we propose the Disentangle then Calibrate with Gradient Guidance (DCGG) framework under the setting of generalized few-shot learning, i.e., using one model to diagnose both common and rare diseases. The DCGG framework consists of a network backbone, a gradient-guided network disentanglement (GND) module, and a gradient-induced feature calibration (GFC) module. The GND module disentangles the network into a disease-shared component and a disease-specific component based on gradient guidance, and devises independent optimization strategies for both components, respectively, when learning from rare diseases. The GFC module transfers only the disease-shared channels of common-disease features to rare diseases, and incorporates the optimal transport theory to identify the best transport scheme based on the semantic relationship among different diseases. Based on the best transport scheme, the GFC module calibrates the distribution of rare-disease features at the disease-shared channels, deriving more informative rare-disease features for better diagnosis. The proposed DCGG framework has been evaluated on three public medical image classification datasets. Our results suggest that the DCGG framework achieves state-of-the-art performance in diagnosing both common and rare diseases.

Subject(s)

Algorithms , Image Interpretation, Computer-Assisted , Rare Diseases , Humans , Rare Diseases/diagnostic imaging , Image Interpretation, Computer-Assisted/methods , Databases, Factual , Magnetic Resonance Imaging/methods , Machine Learning

8.

Comprehensive learning and adaptive teaching: Distilling multi-modal knowledge for pathological glioma grading.

Xing, Xiaohan; Zhu, Meilu; Chen, Zhen; Yuan, Yixuan.

Med Image Anal ; 91: 102990, 2024 Jan.

Article in English | MEDLINE | ID: mdl-37864912

ABSTRACT

The fusion of multi-modal data, e.g., pathology slides and genomic profiles, can provide complementary information and benefit glioma grading. However, genomic profiles are difficult to obtain due to the high costs and technical challenges, thus limiting the clinical applications of multi-modal diagnosis. In this work, we investigate the realistic problem where paired pathology-genomic data are available during training, while only pathology slides are accessible for inference. To solve this problem, a comprehensive learning and adaptive teaching framework is proposed to improve the performance of pathological grading models by transferring the privileged knowledge from the multi-modal teacher to the pathology student. For comprehensive learning of the multi-modal teacher, we propose a novel Saliency-Aware Masking (SA-Mask) strategy to explore richer disease-related features from both modalities by masking the most salient features. For adaptive teaching of the pathology student, we first devise a Local Topology Preserving and Discrepancy Eliminating Contrastive Distillation (TDC-Distill) module to align the feature distributions of the teacher and student models. Furthermore, considering the multi-modal teacher may include incorrect information, we propose a Gradient-guided Knowledge Refinement (GK-Refine) module that builds a knowledge bank and adaptively absorbs the reliable knowledge according to their agreement in the gradient space. Experiments on the TCGA GBM-LGG dataset show that our proposed distillation framework improves the pathological glioma grading and outperforms other KD methods. Notably, with the sole pathology slides, our method achieves comparable performance with existing multi-modal methods. The code is available at https://github.com/CUHK-AIM-Group/MultiModal-learning.

Subject(s)

Glioma , Learning , Humans

9.

STAR-RL: Spatial-temporal Hierarchical Reinforcement Learning for Interpretable Pathology Image Super-Resolution.

Chen, Wenting; Liu, Jie; Chow, Tommy W S; Yuan, Yixuan.

IEEE Trans Med Imaging ; PP2024 Jun 27.

Article in English | MEDLINE | ID: mdl-38935476

ABSTRACT

Pathology image are essential for accurately interpreting lesion cells in cytopathology screening, but acquiring high-resolution digital slides requires specialized equipment and long scanning times. Though super-resolution (SR) techniques can alleviate this problem, existing deep learning models recover pathology image in a black-box manner, which can lead to untruthful biological details and misdiagnosis. Additionally, current methods allocate the same computational resources to recover each pixel of pathology image, leading to the sub-optimal recovery issue due to the large variation of pathology image. In this paper, we propose the first hierarchical reinforcement learning framework named Spatial-Temporal hierARchical Reinforcement Learning (STAR-RL), mainly for addressing the aforementioned issues in pathology image super-resolution problem. We reformulate the SR problem as a Markov decision process of interpretable operations and adopt the hierarchical recovery mechanism in patch level, to avoid sub-optimal recovery. Specifically, the higher-level spatial manager is proposed to pick out the most corrupted patch for the lower-level patch worker. Moreover, the higher-level temporal manager is advanced to evaluate the selected patch and determine whether the optimization should be stopped earlier, thereby avoiding the over-processed problem. Under the guidance of spatial-temporal managers, the lower-level patch worker processes the selected patch with pixel-wise interpretable actions at each time step. Experimental results on medical images degraded by different kernels show the effectiveness of STAR-RL. Furthermore, STAR-RL validates the promotion in tumor diagnosis with a large margin and shows generalizability under various degradation. The source code is to be released.

10.

MGIML: Cancer Grading With Incomplete Radiology-Pathology Data via Memory Learning and Gradient Homogenization.

Wang, Pengyu; Zhang, Huaqi; Zhu, Meilu; Jiang, Xi; Qin, Jing; Yuan, Yixuan.

IEEE Trans Med Imaging ; 43(6): 2113-2124, 2024 Jun.

Article in English | MEDLINE | ID: mdl-38231819

ABSTRACT

Taking advantage of multi-modal radiology-pathology data with complementary clinical information for cancer grading is helpful for doctors to improve diagnosis efficiency and accuracy. However, radiology and pathology data have distinct acquisition difficulties and costs, which leads to incomplete-modality data being common in applications. In this work, we propose a Memory- and Gradient-guided Incomplete Modal-modal Learning (MGIML) framework for cancer grading with incomplete radiology-pathology data. Firstly, to remedy missing-modality information, we propose a Memory-driven Hetero-modality Complement (MH-Complete) scheme, which constructs modal-specific memory banks constrained by a coarse-grained memory boosting (CMB) loss to record generic radiology and pathology feature patterns, and develops a cross-modal memory reading strategy enhanced by a fine-grained memory consistency (FMC) loss to take missing-modality information from well-stored memories. Secondly, as gradient conflicts exist between missing-modality situations, we propose a Rotation-driven Gradient Homogenization (RG-Homogenize) scheme, which estimates instance-specific rotation matrices to smoothly change the feature-level gradient directions, and computes confidence-guided homogenization weights to dynamically balance gradient magnitudes. By simultaneously mitigating gradient direction and magnitude conflicts, this scheme well avoids the negative transfer and optimization imbalance problems. Extensive experiments on CPTAC-UCEC and CPTAC-PDA datasets show that the proposed MGIML framework performs favorably against state-of-the-art multi-modal methods on missing-modality situations.

Subject(s)

Algorithms , Neoplasm Grading , Humans , Neoplasm Grading/methods , Image Interpretation, Computer-Assisted/methods , Machine Learning , Neoplasms/diagnostic imaging

11.

MHD-Net: Memory-Aware Hetero-Modal Distillation Network for Thymic Epithelial Tumor Typing With Missing Pathology Modality.

Zhang, Huaqi; Liu, Jie; Liu, Weifan; Chen, Huang; Yu, Zekuan; Yuan, Yixuan; Wang, Pengyu; Qin, Jing.

IEEE J Biomed Health Inform ; 28(5): 3003-3014, 2024 May.

Article in English | MEDLINE | ID: mdl-38470599

ABSTRACT

Fusing multi-modal radiology and pathology data with complementary information can improve the accuracy of tumor typing. However, collecting pathology data is difficult since it is high-cost and sometimes only obtainable after the surgery, which limits the application of multi-modal methods in diagnosis. To address this problem, we propose comprehensively learning multi-modal radiology-pathology data in training, and only using uni-modal radiology data in testing. Concretely, a Memory-aware Hetero-modal Distillation Network (MHD-Net) is proposed, which can distill well-learned multi-modal knowledge with the assistance of memory from the teacher to the student. In the teacher, to tackle the challenge in hetero-modal feature fusion, we propose a novel spatial-differentiated hetero-modal fusion module (SHFM) that models spatial-specific tumor information correlations across modalities. As only radiology data is accessible to the student, we store pathology features in the proposed contrast-boosted typing memory module (CTMM) that achieves type-wise memory updating and stage-wise contrastive memory boosting to ensure the effectiveness and generalization of memory items. In the student, to improve the cross-modal distillation, we propose a multi-stage memory-aware distillation (MMD) scheme that reads memory-aware pathology features from CTMM to remedy missing modal-specific information. Furthermore, we construct a Radiology-Pathology Thymic Epithelial Tumor (RPTET) dataset containing paired CT and WSI images with annotations. Experiments on the RPTET and CPTAC-LUAD datasets demonstrate that MHD-Net significantly improves tumor typing and outperforms existing multi-modal methods on missing modality situations.

Subject(s)

Neoplasms, Glandular and Epithelial , Thymus Neoplasms , Humans , Thymus Neoplasms/diagnostic imaging , Neoplasms, Glandular and Epithelial/diagnostic imaging , Image Interpretation, Computer-Assisted/methods , Tomography, X-Ray Computed/methods , Algorithms , Neural Networks, Computer , Deep Learning , Multimodal Imaging/methods

12.

Mask-aware transformer with structure invariant loss for CT translation.

Chen, Wenting; Zhao, Wei; Chen, Zhen; Liu, Tianming; Liu, Li; Liu, Jun; Yuan, Yixuan.

Med Image Anal ; 96: 103205, 2024 Aug.

Article in English | MEDLINE | ID: mdl-38788328

ABSTRACT

Multi-phase enhanced computed tomography (MPECT) translation from plain CT can help doctors to detect the liver lesion and prevent patients from the allergy during MPECT examination. Existing CT translation methods directly learn an end-to-end mapping from plain CT to MPECT, ignoring the crucial clinical domain knowledge. As clinicians subtract the plain CT from MPECT images as subtraction image to highlight the contrast-enhanced regions and further to facilitate liver disease diagnosis in the clinical diagnosis, we aim to exploit this domain knowledge for automatic CT translation. To this end, we propose a Mask-Aware Transformer (MAFormer) with structure invariant loss for CT translation, which presents the first effort to exploit this domain knowledge for CT translation. Specifically, the proposed MAFormer introduces a mask estimator to predict the subtraction image from the plain CT image. To integrate the subtraction image into the network, the MAFormer devises a Mask-Aware Transformer based Normalization (MATNorm) as normalization layer to highlight the contrast-enhanced regions and capture the long-range dependencies among these regions. Moreover, aiming to preserve the biological structure of CT slices, a structure invariant loss is designed to extract the structural information and minimize the structural similarity between the plain and synthetic CT images to ensure the structure invariant. Extensive experiments have proven the effectiveness of the proposed method and its superiority to the state-of-the-art CT translation methods. Source code is to be released.

Subject(s)

Tomography, X-Ray Computed , Humans , Tomography, X-Ray Computed/methods , Algorithms , Subtraction Technique , Radiographic Image Interpretation, Computer-Assisted/methods

13.

Bioactive Polyurethane-Poly(ethylene Glycol) Diacrylate Hydrogels for Applications in Tissue Engineering.

Yuan, Yixuan; Tyson, Caleb; Szyniec, Annika; Agro, Samuel; Tavakol, Tara N; Harmon, Alexander; Lampkins, DessaRae; Pearson, Lauran; Dumas, Jerald E; Taite, Lakeshia J.

Gels ; 10(2)2024 Jan 29.

Article in English | MEDLINE | ID: mdl-38391438

ABSTRACT

Polyurethanes (PUs) are a highly adaptable class of biomaterials that are among some of the most researched materials for various biomedical applications. However, engineered tissue scaffolds composed of PU have not found their way into clinical application, mainly due to the difficulty of balancing the control of material properties with the desired cellular response. A simple method for the synthesis of tunable bioactive poly(ethylene glycol) diacrylate (PEGDA) hydrogels containing photocurable PU is described. These hydrogels may be modified with PEGylated peptides or proteins to impart variable biological functions, and the mechanical properties of the hydrogels can be tuned based on the ratios of PU and PEGDA. Studies with human cells revealed that PU-PEG blended hydrogels support cell adhesion and viability when cell adhesion peptides are crosslinked within the hydrogel matrix. These hydrogels represent a unique and highly tailorable system for synthesizing PU-based synthetic extracellular matrices for tissue engineering applications.

14.

Universal and extensible language-vision models for organ segmentation and tumor detection from abdominal computed tomography.

Liu, Jie; Zhang, Yixiao; Wang, Kang; Yavuz, Mehmet Can; Chen, Xiaoxi; Yuan, Yixuan; Li, Haoliang; Yang, Yang; Yuille, Alan; Tang, Yucheng; Zhou, Zongwei.

Med Image Anal ; 97: 103226, 2024 Jun 04.

Article in English | MEDLINE | ID: mdl-38852215

ABSTRACT

The advancement of artificial intelligence (AI) for organ segmentation and tumor detection is propelled by the growing availability of computed tomography (CT) datasets with detailed, per-voxel annotations. However, these AI models often struggle with flexibility for partially annotated datasets and extensibility for new classes due to limitations in the one-hot encoding, architectural design, and learning scheme. To overcome these limitations, we propose a universal, extensible framework enabling a single model, termed Universal Model, to deal with multiple public datasets and adapt to new classes (e.g., organs/tumors). Firstly, we introduce a novel language-driven parameter generator that leverages language embeddings from large language models, enriching semantic encoding compared with one-hot encoding. Secondly, the conventional output layers are replaced with lightweight, class-specific heads, allowing Universal Model to simultaneously segment 25 organs and six types of tumors and ease the addition of new classes. We train our Universal Model on 3410 CT volumes assembled from 14 publicly available datasets and then test it on 6173 CT volumes from four external datasets. Universal Model achieves first place on six CT tasks in the Medical Segmentation Decathlon (MSD) public leaderboard and leading performance on the Beyond The Cranial Vault (BTCV) dataset. In summary, Universal Model exhibits remarkable computational efficiency (6× faster than other dataset-specific models), demonstrates strong generalization across different hospitals, transfers well to numerous downstream tasks, and more importantly, facilitates the extensibility to new classes while alleviating the catastrophic forgetting of previously learned classes. Codes, models, and datasets are available at https://github.com/ljwztc/CLIP-Driven-Universal-Model.

15.

Dual domain distribution disruption with semantics preservation: Unsupervised domain adaptation for medical image segmentation.

Zheng, Boyun; Zhang, Ranran; Diao, Songhui; Zhu, Jingke; Yuan, Yixuan; Cai, Jing; Shao, Liang; Li, Shuo; Qin, Wenjian.

Med Image Anal ; 97: 103275, 2024 Jul 14.

Article in English | MEDLINE | ID: mdl-39032395

ABSTRACT

Recent unsupervised domain adaptation (UDA) methods in medical image segmentation commonly utilize Generative Adversarial Networks (GANs) for domain translation. However, the translated images often exhibit a distribution deviation from the ideal due to the inherent instability of GANs, leading to challenges such as visual inconsistency and incorrect style, consequently causing the segmentation model to fall into the fixed wrong pattern. To address this problem, we propose a novel UDA framework known as Dual Domain Distribution Disruption with Semantics Preservation (DDSP). Departing from the idea of generating images conforming to the target domain distribution in GAN-based UDA methods, we make the model domain-agnostic and focus on anatomical structural information by leveraging semantic information as constraints to guide the model to adapt to images with disrupted distributions in both source and target domains. Furthermore, we introduce the inter-channel similarity feature alignment based on the domain-invariant structural prior information, which facilitates the shared pixel-wise classifier to achieve robust performance on target domain features by aligning the source and target domain features across channels. Without any exaggeration, our method significantly outperforms existing state-of-the-art UDA methods on three public datasets (i.e., the heart dataset, the brain dataset, and the prostate dataset). The code is available at https://github.com/MIXAILAB/DDSPSeg.

16.

Axonal fiber terminations concentrate on gyri.

Nie, Jingxin; Guo, Lei; Li, Kaiming; Wang, Yonghua; Chen, Guojun; Li, Longchuan; Chen, Hanbo; Deng, Fan; Jiang, Xi; Zhang, Tuo; Huang, Ling; Faraco, Carlos; Zhang, Degang; Guo, Cong; Yap, Pew-Thian; Hu, Xintao; Li, Gang; Lv, Jinglei; Yuan, Yixuan; Zhu, Dajiang; Han, Junwei; Sabatinelli, Dean; Zhao, Qun; Miller, L Stephen; Xu, Bingqian; Shen, Ping; Platt, Simon; Shen, Dinggang; Hu, Xiaoping; Liu, Tianming.

Cereb Cortex ; 22(12): 2831-9, 2012 Dec.

Article in English | MEDLINE | ID: mdl-22190432

ABSTRACT

Convoluted cortical folding and neuronal wiring are 2 prominent attributes of the mammalian brain. However, the macroscale intrinsic relationship between these 2 general cross-species attributes, as well as the underlying principles that sculpt the architecture of the cerebral cortex, remains unclear. Here, we show that the axonal fibers connected to gyri are significantly denser than those connected to sulci. In human, chimpanzee, and macaque brains, a dominant fraction of axonal fibers were found to be connected to the gyri. This finding has been replicated in a range of mammalian brains via diffusion tensor imaging and high-angular resolution diffusion imaging. These results may have shed some lights on fundamental mechanisms for development and organization of the cerebral cortex, suggesting that axonal pushing is a mechanism of cortical folding.

Subject(s)

Axons/ultrastructure , Cerebral Cortex/ultrastructure , Macaca/anatomy & histology , Neural Pathways/ultrastructure , Pan troglodytes/anatomy & histology , Animals , Female , Humans , Male , Species Specificity , Young Adult

17.

Decoupled Unbiased Teacher for Source-Free Domain Adaptive Medical Object Detection.

Liu, Xinyu; Li, Wuyang; Yuan, Yixuan.

IEEE Trans Neural Netw Learn Syst ; PP2023 May 24.

Article in English | MEDLINE | ID: mdl-37224362

ABSTRACT

Source-free domain adaptation (SFDA) aims to adapt a lightweight pretrained source model to unlabeled new domains without the original labeled source data. Due to the privacy of patients and storage consumption concerns, SFDA is a more practical setting for building a generalized model in medical object detection. Existing methods usually apply the vanilla pseudo-labeling technique, while neglecting the bias issues in SFDA, leading to limited adaptation performance. To this end, we systematically analyze the biases in SFDA medical object detection by constructing a structural causal model (SCM) and propose an unbiased SFDA framework dubbed decoupled unbiased teacher (DUT). Based on the SCM, we derive that the confounding effect causes biases in the SFDA medical object detection task at the sample level, feature level, and prediction level. To prevent the model from emphasizing easy object patterns in the biased dataset, a dual invariance assessment (DIA) strategy is devised to generate counterfactual synthetics. The synthetics are based on unbiased invariant samples in both discrimination and semantic perspectives. To alleviate overfitting to domain-specific features in SFDA, we design a cross-domain feature intervention (CFI) module to explicitly deconfound the domain-specific prior with feature intervention and obtain unbiased features. Besides, we establish a correspondence supervision prioritization (CSP) strategy for addressing the prediction bias caused by coarse pseudo-labels by sample prioritizing and robust box supervision. Through extensive experiments on multiple SFDA medical object detection scenarios, DUT yields superior performance over previous state-of-the-art unsupervised domain adaptation (UDA) and SFDA counterparts, demonstrating the significance of addressing the bias issues in this challenging task. The code is available at https://github.com/CUHK-AIM-Group/Decoupled-Unbiased-Teacher.

18.

Hierarchical Bias Mitigation for Semi-Supervised Medical Image Classification.

Yang, Qiushi; Chen, Zhen; Yuan, Yixuan.

IEEE Trans Med Imaging ; 42(8): 2200-2210, 2023 08.

Article in English | MEDLINE | ID: mdl-37027665

ABSTRACT

Semi-supervised learning (SSL) has demonstrated remarkable advances on medical image classification, by harvesting beneficial knowledge from abundant unlabeled samples. The pseudo labeling dominates current SSL approaches, however, it suffers from intrinsic biases within the process. In this paper, we retrospect the pseudo labeling and identify three hierarchical biases: perception bias, selection bias and confirmation bias, at feature extraction, pseudo label selection and momentum optimization stages, respectively. In this regard, we propose a HierArchical BIas miTigation (HABIT) framework to amend these biases, which consists of three customized modules including Mutual Reconciliation Network (MRNet), Recalibrated Feature Compensation (RFC) and Consistency-aware Momentum Heredity (CMH). Firstly, in the feature extraction, MRNet is devised to jointly utilize convolution and permutator-based paths with a mutual information transfer module to exchanges features and reconcile spatial perception bias for better representations. To address pseudo label selection bias, RFC adaptively recalibrates the strong and weak augmented distributions to be a rational discrepancy and augments features for minority categories to achieve the balanced training. Finally, in the momentum optimization stage, in order to reduce the confirmation bias, CMH models the consistency among different sample augmentations into network updating process to improve the dependability of the model. Extensive experiments on three semi-supervised medical image classification datasets demonstrate that HABIT mitigates three biases and achieves state-of-the-art performance. Our codes are available at https://github.com/CityU-AIM-Group/HABIT.

Subject(s)

Supervised Machine Learning , Bias , Motion

19.

SIGMA++: Improved Semantic-Complete Graph Matching for Domain Adaptive Object Detection.

Li, Wuyang; Liu, Xinyu; Yuan, Yixuan.

IEEE Trans Pattern Anal Mach Intell ; 45(7): 9022-9040, 2023 Jul.

Article in English | MEDLINE | ID: mdl-37018585

ABSTRACT

Domain Adaptive Object Detection (DAOD) generalizes the object detector from an annotated domain to a label-free novel one. Recent works estimate prototypes (class centers) and minimize the corresponding distances to adapt the cross-domain class conditional distribution. However, this prototype-based paradigm 1) fails to capture the class variance with agnostic structural dependencies, and 2) ignores the domain-mismatched classes with a sub-optimal adaptation. To address these two challenges, we propose an improved SemantIc-complete Graph MAtching framework, dubbed SIGMA++, for DAOD, completing mismatched semantics and reformulating adaptation with hypergraph matching. Specifically, we propose a Hypergraphical Semantic Completion (HSC) module to generate hallucination graph nodes in mismatched classes. HSC builds a cross-image hypergraph to model class conditional distribution with high-order dependencies and learns a graph-guided memory bank to generate missing semantics. After representing the source and target batch with hypergraphs, we reformulate domain adaptation with a hypergraph matching problem, i.e., discovering well-matched nodes with homogeneous semantics to reduce the domain gap, which is solved with a Bipartite Hypergraph Matching (BHM) module. Graph nodes are used to estimate semantic-aware affinity, while edges serve as high-order structural constraints in a structure-aware matching loss, achieving fine-grained adaptation with hypergraph matching. The applicability of various object detectors verifies the generalization of SIGMA++, and extensive experiments on nine benchmarks show its state-of-the-art performance on both AP 50 and adaptation gains.

20.

FedDM: Federated Weakly Supervised Segmentation via Annotation Calibration and Gradient De-Conflicting.

Zhu, Meilu; Chen, Zhen; Yuan, Yixuan.

IEEE Trans Med Imaging ; 42(6): 1632-1643, 2023 06.

Article in English | MEDLINE | ID: mdl-37018639

ABSTRACT

Weakly supervised segmentation (WSS) aims to exploit weak forms of annotations to achieve the segmentation training, thereby reducing the burden on annotation. However, existing methods rely on large-scale centralized datasets, which are difficult to construct due to privacy concerns on medical data. Federated learning (FL) provides a cross-site training paradigm and shows great potential to address this problem. In this work, we represent the first effort to formulate federated weakly supervised segmentation (FedWSS) and propose a novel Federated Drift Mitigation (FedDM) framework to learn segmentation models across multiple sites without sharing their raw data. FedDM is devoted to solving two main challenges (i.e., local drift on client-side optimization and global drift on server-side aggregation) caused by weak supervision signals in FL setting via Collaborative Annotation Calibration (CAC) and Hierarchical Gradient De-conflicting (HGD). To mitigate the local drift, CAC customizes a distal peer and a proximal peer for each client via a Monte Carlo sampling strategy, and then employs inter-client knowledge agreement and disagreement to recognize clean labels and correct noisy labels, respectively. Moreover, in order to alleviate the global drift, HGD online builds a client hierarchy under the guidance of history gradient of the global model in each communication round. Through de-conflicting clients under the same parent nodes from bottom layers to top layers, HGD achieves robust gradient aggregation at the server side. Furthermore, we theoretically analyze FedDM and conduct extensive experiments on public datasets. The experimental results demonstrate the superior performance of our method compared with state-of-the-art approaches. The source code is available at https://github.com/CityU-AIM-Group/FedDM.

Subject(s)

Software , Supervised Machine Learning , Humans , Calibration , Monte Carlo Method

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL