Pesquisa | Secretaria de Estado da Saúde

Segment anything model for medical image segmentation: Current applications and future directions.

Zhang, Yichi; Shen, Zhenrong; Jiao, Rushi.

Comput Biol Med ; 171: 108238, 2024 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-38422961

RESUMO

Due to the inherent flexibility of prompting, foundation models have emerged as the predominant force in the fields of natural language processing and computer vision. The recent introduction of the Segment Anything Model (SAM) signifies a noteworthy expansion of the prompt-driven paradigm into the domain of image segmentation, thereby introducing a plethora of previously unexplored capabilities. However, the viability of its application to medical image segmentation remains uncertain, given the substantial distinctions between natural and medical images. In this work, we provide a comprehensive overview of recent endeavors aimed at extending the efficacy of SAM to medical image segmentation tasks, encompassing both empirical benchmarking and methodological adaptations. Additionally, we explore potential avenues for future research directions in SAM's role within medical image segmentation. While direct application of SAM to medical image segmentation does not yield satisfactory performance on multi-modal and multi-target medical datasets so far, numerous insights gleaned from these efforts serve as valuable guidance for shaping the trajectory of foundational models in the realm of medical image analysis. To support ongoing research endeavors, we maintain an active repository that contains an up-to-date paper list and a succinct summary of open-source projects at https://github.com/YichiZhang98/SAM4MIS.

Assuntos

Benchmarking , Processamento de Linguagem Natural , S-Adenosilmetionina , Incerteza , Processamento de Imagem Assistida por Computador

Distillation of multi-class cervical lesion cell detection via synthesis-aided pre-training and patch-level feature alignment.

Fei, Manman; Shen, Zhenrong; Song, Zhiyun; Wang, Xin; Cao, Maosong; Yao, Linlin; Zhao, Xiangyu; Wang, Qian; Zhang, Lichi.

Neural Netw ; 178: 106405, 2024 May 22.

Artigo em Inglês | MEDLINE | ID: mdl-38815471

RESUMO

Automated detection of cervical abnormal cells from Thin-prep cytologic test (TCT) images is crucial for efficient cervical abnormal screening using computer-aided diagnosis systems. However, the construction of the detection model is hindered by the preparation of the training images, which usually suffers from issues of class imbalance and incomplete annotations. Additionally, existing methods often overlook the visual feature correlations among cells, which are crucial in cervical lesion cell detection as pathologists commonly rely on surrounding cells for identification. In this paper, we propose a distillation framework that utilizes a patch-level pre-training network to guide the training of an image-level detection network, which can be applied to various detectors without changing their architectures during inference. The main contribution is three-fold: (1) We propose the Balanced Pre-training Model (BPM) as the patch-level cervical cell classification model, which employs an image synthesis model to construct a class-balanced patch dataset for pre-training. (2) We design the Score Correction Loss (SCL) to enable the detection network to distill knowledge from the BPM model, thereby mitigating the impact of incomplete annotations. (3) We design the Patch Correlation Consistency (PCC) strategy to exploit the correlation information of extracted cells, consistent with the behavior of cytopathologists. Experiments on public and private datasets demonstrate the superior performance of the proposed distillation method, as well as its adaptability to various detection architectures.

Spatial attention-based implicit neural representation for arbitrary reduction of MRI slice spacing.

Wang, Xin; Wang, Sheng; Xiong, Honglin; Xuan, Kai; Zhuang, Zixu; Liu, Mengjun; Shen, Zhenrong; Zhao, Xiangyu; Zhang, Lichi; Wang, Qian.

Med Image Anal ; 94: 103158, 2024 May.

Artigo em Inglês | MEDLINE | ID: mdl-38569379

RESUMO

Magnetic resonance (MR) images collected in 2D clinical protocols typically have large inter-slice spacing, resulting in high in-plane resolution and reduced through-plane resolution. Super-resolution technique can enhance the through-plane resolution of MR images to facilitate downstream visualization and computer-aided diagnosis. However, most existing works train the super-resolution network at a fixed scaling factor, which is not friendly to clinical scenes of varying inter-slice spacing in MR scanning. Inspired by the recent progress in implicit neural representation, we propose a Spatial Attention-based Implicit Neural Representation (SA-INR) network for arbitrary reduction of MR inter-slice spacing. The SA-INR aims to represent an MR image as a continuous implicit function of 3D coordinates. In this way, the SA-INR can reconstruct the MR image with arbitrary inter-slice spacing by continuously sampling the coordinates in 3D space. In particular, a local-aware spatial attention operation is introduced to model nearby voxels and their affinity more accurately in a larger receptive field. Meanwhile, to improve the computational efficiency, a gradient-guided gating mask is proposed for applying the local-aware spatial attention to selected areas only. We evaluate our method on the public HCP-1200 dataset and the clinical knee MR dataset to demonstrate its superiority over other existing methods.

Assuntos

Diagnóstico por Computador , Imageamento por Ressonância Magnética , Humanos , Imageamento por Ressonância Magnética/métodos , Redes Neurais de Computação , Articulação do Joelho , Imagens de Fantasmas , Processamento de Imagem Assistida por Computador/métodos

sTBI-GAN: An adversarial learning approach for data synthesis on traumatic brain segmentation.

Zhao, Xiangyu; Zang, Di; Wang, Sheng; Shen, Zhenrong; Xuan, Kai; Wei, Zeyu; Wang, Zhe; Zheng, Ruizhe; Wu, Xuehai; Li, Zheren; Wang, Qian; Qi, Zengxin; Zhang, Lichi.

Comput Med Imaging Graph ; 112: 102325, 2024 03.

Artigo em Inglês | MEDLINE | ID: mdl-38228021

RESUMO

Automatic brain segmentation of magnetic resonance images (MRIs) from severe traumatic brain injury (sTBI) patients is critical for brain abnormality assessments and brain network analysis. Construction of sTBI brain segmentation model requires manually annotated MR scans of sTBI patients, which becomes a challenging problem as it is quite impractical to implement sufficient annotations for sTBI images with large deformations and lesion erosion. Data augmentation techniques can be applied to alleviate the issue of limited training samples. However, conventional data augmentation strategies such as spatial and intensity transformation are unable to synthesize the deformation and lesions in traumatic brains, which limits the performance of the subsequent segmentation task. To address these issues, we propose a novel medical image inpainting model named sTBI-GAN to synthesize labeled sTBI MR scans by adversarial inpainting. The main strength of our sTBI-GAN method is that it can generate sTBI images and corresponding labels simultaneously, which has not been achieved in previous inpainting methods for medical images. We first generate the inpainted image under the guidance of edge information following a coarse-to-fine manner, and then the synthesized MR image is used as the prior for label inpainting. Furthermore, we introduce a registration-based template augmentation pipeline to increase the diversity of the synthesized image pairs and enhance the capacity of data augmentation. Experimental results show that the proposed sTBI-GAN method can synthesize high-quality labeled sTBI images, which greatly improves the 2D and 3D traumatic brain segmentation performance compared with the alternatives. Code is available at .

Assuntos

Encefalopatias , Lesões Encefálicas Traumáticas , Humanos , Aprendizagem , Lesões Encefálicas Traumáticas/diagnóstico por imagem , Encéfalo/diagnóstico por imagem , Processamento de Imagem Assistida por Computador

Image synthesis with disentangled attributes for chest X-ray nodule augmentation and detection.

Shen, Zhenrong; Ouyang, Xi; Xiao, Bin; Cheng, Jie-Zhi; Shen, Dinggang; Wang, Qian.

Med Image Anal ; 84: 102708, 2023 02.

Artigo em Inglês | MEDLINE | ID: mdl-36516554

RESUMO

Lung nodule detection in chest X-ray (CXR) images is common to early screening of lung cancers. Deep-learning-based Computer-Assisted Diagnosis (CAD) systems can support radiologists for nodule screening in CXR images. However, it requires large-scale and diverse medical data with high-quality annotations to train such robust and accurate CADs. To alleviate the limited availability of such datasets, lung nodule synthesis methods are proposed for the sake of data augmentation. Nevertheless, previous methods lack the ability to generate nodules that are realistic with the shape/size attributes desired by the detector. To address this issue, we introduce a novel lung nodule synthesis framework in this paper, which decomposes nodule attributes into three main aspects including the shape, the size, and the texture, respectively. A GAN-based Shape Generator firstly models nodule shapes by generating diverse shape masks. The following Size Modulation then enables quantitative control on the diameters of the generated nodule shapes in pixel-level granularity. A coarse-to-fine gated convolutional Texture Generator finally synthesizes visually plausible nodule textures conditioned on the modulated shape masks. Moreover, we propose to synthesize nodule CXR images by controlling the disentangled nodule attributes for data augmentation, in order to better compensate for the nodules that are easily missed in the detection task. Our experiments demonstrate the enhanced image quality, diversity, and controllability of the proposed lung nodule synthesis framework. We also validate the effectiveness of our data augmentation strategy on greatly improving nodule detection performance.

Assuntos

Neoplasias Pulmonares , Nódulo Pulmonar Solitário , Humanos , Tomografia Computadorizada por Raios X/métodos , Raios X , Interpretação de Imagem Radiográfica Assistida por Computador/métodos , Neoplasias Pulmonares/diagnóstico por imagem , Radiografia , Nódulo Pulmonar Solitário/diagnóstico por imagem , Pulmão

DW-Net: A cascaded convolutional neural network for apical four-chamber view segmentation in fetal echocardiography.

Xu, Lu; Liu, Mingyuan; Shen, Zhenrong; Wang, Hua; Liu, Xiaowei; Wang, Xin; Wang, Siyu; Li, Tiefeng; Yu, Shaomei; Hou, Min; Guo, Jianhua; Zhang, Jicong; He, Yihua.

Comput Med Imaging Graph ; 80: 101690, 2020 03.

Artigo em Inglês | MEDLINE | ID: mdl-31968286

RESUMO

Fetal echocardiography (FE) is a widely used medical examination for early diagnosis of congenital heart disease (CHD). The apical four-chamber view (A4C) is an important view among early FE images. Accurate segmentation of crucial anatomical structures in the A4C view is a useful and important step for early diagnosis and timely treatment of CHDs. However, it is a challenging task due to several unfavorable factors: (a) artifacts and speckle noise produced by ultrasound imaging. (b) category confusion caused by the similarity of anatomical structures and variations of scanning angles. (c) missing boundaries. In this paper, we propose an end-to-end DW-Net for accurate segmentation of seven important anatomical structures in the A4C view. The network comprises two components: 1) a Dilated Convolutional Chain (DCC) for "gridding issue" reduction, multi-scale contextual information aggregation and accurate localization of cardiac chambers. 2) a W-Net for gaining more precise boundaries and yielding refined segmentation results. Extensive experiments of the proposed method on a dataset of 895 A4C views have demonstrated that DW-Net can achieve good segmentation results, including the Dice Similarity Coefficient (DSC) of 0.827, the Pixel Accuracy (PA) of 0.933, the AUC of 0.990 and it substantially outperformed some well-known segmentation methods. Our work was highly valued by experienced clinicians. The accurate and automatic segmentation of the A4C view using the proposed DW-Net can benefit further extractions of useful clinical indicators in early FE and improve the prenatal diagnostic accuracy and efficiency of CHDs.

Assuntos

Ecocardiografia/métodos , Coração Fetal/diagnóstico por imagem , Redes Neurais de Computação , Ultrassonografia Pré-Natal/métodos , Artefatos , Feminino , Coração Fetal/anatomia & histologia , Cardiopatias Congênitas/diagnóstico por imagem , Humanos , Gravidez

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

Detalhe da pesquisa