Pesquisa | Biblioteca Virtual em Saúde Fiocruz

1.

Learning U-Net Based Multi-Scale Features in Encoding-Decoding for MR Image Brain Tissue Segmentation.

Long, Jiao-Song; Ma, Guang-Zhi; Song, En-Min; Jin, Ren-Chao.

Sensors (Basel) ; 21(9)2021 May 07.

Artigo em Inglês | MEDLINE | ID: mdl-34067101

RESUMO

Accurate brain tissue segmentation of MRI is vital to diagnosis aiding, treatment planning, and neurologic condition monitoring. As an excellent convolutional neural network (CNN), U-Net is widely used in MR image segmentation as it usually generates high-precision features. However, the performance of U-Net is considerably restricted due to the variable shapes of the segmented targets in MRI and the information loss of down-sampling and up-sampling operations. Therefore, we propose a novel network by introducing spatial and channel dimensions-based multi-scale feature information extractors into its encoding-decoding framework, which is helpful in extracting rich multi-scale features while highlighting the details of higher-level features in the encoding part, and recovering the corresponding localization to a higher resolution layer in the decoding part. Concretely, we propose two information extractors, multi-branch pooling, called MP, in the encoding part, and multi-branch dense prediction, called MDP, in the decoding part, to extract multi-scale features. Additionally, we designed a new multi-branch output structure with MDP in the decoding part to form more accurate edge-preserving predicting maps by integrating the dense adjacent prediction features at different scales. Finally, the proposed method is tested on datasets MRbrainS13, IBSR18, and ISeg2017. We find that the proposed network performs higher accuracy in segmenting MRI brain tissues and it is better than the leading method of 2018 at the segmentation of GM and CSF. Therefore, it can be a useful tool for diagnostic applications, such as brain MRI segmentation and diagnosing.

Assuntos

Processamento de Imagem Assistida por Computador , Imageamento por Ressonância Magnética , Encéfalo/diagnóstico por imagem , Redes Neurais de Computação

2.

Multi-model Ensemble Learning Architecture Based on 3D CNN for Lung Nodule Malignancy Suspiciousness Classification.

Liu, Hong; Cao, Haichao; Song, Enmin; Ma, Guangzhi; Xu, Xiangyang; Jin, Renchao; Liu, Chuhua; Hung, Chih-Cheng.

J Digit Imaging ; 33(5): 1242-1256, 2020 10.

Artigo em Inglês | MEDLINE | ID: mdl-32607905

RESUMO

Classification of benign and malignant in lung nodules using chest CT images is a key step in the diagnosis of early-stage lung cancer, as well as an effective way to improve the patients' survival rate. However, due to the diversity of lung nodules and the visual similarity of lung nodules to their surrounding tissues, it is difficult to construct a robust classification model with conventional deep learning-based diagnostic methods. To address this problem, we propose a multi-model ensemble learning architecture based on 3D convolutional neural network (MMEL-3DCNN). This approach incorporates three key ideas: (1) Constructed multi-model network architecture can be well adapted to the heterogeneity of lung nodules. (2) The input that concatenated of the intensity image corresponding to the nodule mask, the original image, and the enhanced image corresponding to which can help training model to extract advanced feature with more discriminative capacity. (3) Select the corresponding model to different nodule size dynamically for prediction, which can improve the generalization ability of the model effectively. In addition, ensemble learning is applied in this paper to further improve the robustness of the nodule classification model. The proposed method has been experimentally verified on the public dataset, LIDC-IDRI. The experimental results show that the proposed MMEL-3DCNN architecture can obtain satisfactory classification results.

Assuntos

Neoplasias Pulmonares , Humanos , Pulmão , Neoplasias Pulmonares/diagnóstico por imagem , Aprendizado de Máquina , Interpretação de Imagem Radiográfica Assistida por Computador , Nódulo Pulmonar Solitário/diagnóstico por imagem , Tomografia Computadorizada por Raios X

3.

Bone Marrow Cells Detection: A Technique for the Microscopic Image Analysis.

Liu, Hong; Cao, Haichao; Song, Enmin.

J Med Syst ; 43(4): 82, 2019 Feb 23.

Artigo em Inglês | MEDLINE | ID: mdl-30798374

RESUMO

In the detection of myeloproliferative, the number of cells in each type of bone marrow cells (BMC) is an important parameter for the evaluation. In this study, we propose a new counting method, which consists of three modules including localization, segmentation and classification. The localization of BMC is achieved from a color transformation enhanced BMC sample image and stepwise averaging method. In the nucleus segmentation, both stepwise averaging method and Otsu's method are applied to obtain a weighted threshold for segmenting the patch into nucleus and non-nucleus. In the cytoplasm segmentation, a color weakening transformation, an improved region growing method and the K-Means algorithm are employed. The connected cells with BMC will be separated by the marker-controlled watershed algorithm. The features will be extracted for the classification after the segmentation. In this study, the BMC are classified using the support vector machine into five classes; namely, neutrophilic split granulocyte, neutrophilic stab granulocyte, metarubricyte, mature lymphocytes and the outlier (all other cells not listed). Experimental results show that the proposed method achieves superior segmentation and classification performance with an average segmentation accuracy of 91.76% and an average recall rate of 87.49%. The comparison shows that the proposed segmentation and classification methods outperform the existing methods.

Assuntos

Células da Medula Óssea/classificação , Processamento de Imagem Assistida por Computador/métodos , Microscopia/métodos , Reconhecimento Automatizado de Padrão/métodos , Máquina de Vetores de Suporte , Algoritmos , Núcleo Celular , Cor , Humanos

4.

Automatic Labeling of MR Brain Images Through the Hashing Retrieval Based Atlas Forest.

Liu, Hong; Xu, Lijun; Song, Enmin; Jin, Renchao; Hung, Chih-Cheng.

J Med Syst ; 43(8): 241, 2019 Jun 21.

Artigo em Inglês | MEDLINE | ID: mdl-31227923

RESUMO

The multi-atlas method is one of the efficient and common automatic labeling method, which uses the prior information provided by expert-labeled images to guide the labeling of the target. However, most multi-atlas-based methods depend on the registration that may not give the correct information during the label propagation. To address the issue, we designed a new automatic labeling method through the hashing retrieval based atlas forest. The proposed method propagates labels without registration to reduce the errors, and constructs a target-oriented learning model to integrate information among the atlases. This method innovates a coarse classification strategy to preprocess the dataset, which retains the integrity of dataset and reduces computing time. Furthermore, the method considers each voxel in the atlas as a sample and encodes these samples with hashing for the fast sample retrieval. In the stage of labeling, the method selects suitable samples through hashing learning and trains atlas forests by integrating the information from the dataset. Then, the trained model is used to predict the labels of the target. Experimental results on two datasets illustrated that the proposed method is promising in the automatic labeling of MR brain images.

Assuntos

Processamento de Imagem Assistida por Computador , Neuroimagem , Reconhecimento Automatizado de Padrão/métodos , Algoritmos , Humanos , Aprendizado de Máquina

5.

Reliability-Aware Cooperative Node Sleeping and Clustering in Duty-Cycled Sensors Networks.

Song, Jeungeun; Miao, Yiming; Song, Enmin; Hossain, M Shamim; Alhamid, Mohammed F.

Sensors (Basel) ; 18(1)2018 Jan 04.

Artigo em Inglês | MEDLINE | ID: mdl-29300329

RESUMO

Duty-cycled sensor networks provide a new perspective for improvement of energy efficiency and reliability assurance of multi-hop cooperative sensor networks. In this paper, we consider the energy-efficient cooperative node sleeping and clustering problems in cooperative sensor networks where clusters of relay nodes jointly transmit sensory data to the next hop. Our key idea for guaranteeing reliability is to exploit the on-demand number of cooperative nodes, facilitating the prediction of personalized end-to-end (ETE) reliability. Namely, a novel reliability-aware cooperative routing (RCR) scheme is proposed to select k-cooperative nodes at every hop (RCR-selection). After selecting k cooperative nodes at every hop, all of the non-cooperative nodes will go into sleep status. In order to solve the cooperative node clustering problem, we propose the RCR-based optimal relay assignment and cooperative data delivery (RCR-delivery) scheme to provide a low-communication-overhead data transmission and an optimal duty cycle for a given number of cooperative nodes when the network is dynamic, which enables part of cooperative nodes to switch into idle status for further energy saving. Through the extensive OPNET-based simulations, we show that the proposed scheme significantly outperforms the existing geographic routing schemes and beaconless geographic routings in wireless sensor networks with a highly dynamic wireless channel and controls energy consumption, while ETE reliability is effectively guaranteed.

6.

An Augmented Reality Endoscope System for Ureter Position Detection.

Yu, Feng; Song, Enmin; Liu, Hong; Li, Yunlong; Zhu, Jun; Hung, Chih-Cheng.

J Med Syst ; 42(8): 138, 2018 Jun 25.

Artigo em Inglês | MEDLINE | ID: mdl-29938379

RESUMO

Iatrogenic injury of ureter in the clinical operation may cause the serious complication and kidney damage. To avoid such a medical accident, it is necessary to provide the ureter position information to the doctor. For the detection of ureter position, an ureter position detection and display system with the augmented ris proposed to detect the ureter that is covered by human tissue. There are two key issues which should be considered in this new system. One is how to detect the covered ureter that cannot be captured by the electronic endoscope and the other is how to display the ureter position that provides stable and high-quality images. Simultaneously, any delayed processing of the system should disturb the surgery. The aided hardware detection method and target detection algorithms are proposed in this system. To mark the ureter position, a surface-lighting plastic optical fiber (POF) with the encoded light-emitting diode (LED) light is used to indicate the ureter position. The monochrome channel filtering algorithm (MCFA) is proposed to locate the ureter region more precisely. The ureter position is extracted using the proposed automatic region growing algorithm (ARGA) that utilizes the statistical information of the monochrome channel for the selection of growing seed point. In addition, according to the pulse signal of encoded light, the recognition of bright and dark frames based on the aided hardware (BDAH) is proposed to expedite the processing speed. Experimental results demonstrate that the proposed endoscope system can identify 92.04% ureter region in average.

Assuntos

Algoritmos , Ureter , Realidade Virtual , Endoscópios , Endoscopia/métodos , Humanos , Doença Iatrogênica/prevenção & controle

7.

A Novel Endoscope System for Position Detection and Depth Estimation of the Ureter.

Song, Enmin; Yu, Feng; Liu, Hong; Cheng, Ning; Li, Yunlong; Jin, Lianghai; Hung, Chih-Cheng.

J Med Syst ; 40(12): 266, 2016 Dec.

Artigo em Inglês | MEDLINE | ID: mdl-27730392

RESUMO

Iatrogenic injury of ureter occurs occasionally in the clinical laparoscopic surgery. The ureter injury may cause the serious complications and kidney damage. To avoid such an injury, it is necessary to detect the ureter position in real-time. Currently, the endoscope cannot perform this type of function in detecting the ureter position in real-time. In order to have the real-time display of ureter position during the surgical operation, we propose a novel endoscope system which consists of a modified endoscope light and a new lumiontron tube with the LED light. The endoscope light is modified to detect the position of ureter by using our proposed dim target detection algorithm (DTDA). To make this new system functioning, two algorithmic approaches are proposed for the display of ureter position. The horizontal position of ureter is detected by the center line extraction method and the depth of ureter is estimated by the depth estimation method. Experimental results demonstrate that the proposed endoscope system can extract the position and depth information of ureter and exhibit superior performance in terms of accuracy and stabilization.

Assuntos

Algoritmos , Endoscópios , Imageamento Tridimensional/métodos , Ureter/anatomia & histologia , Desenho de Equipamento , Humanos

8.

Automatic segmentation of pulmonary fissures in computed tomography images using 3D surface features.

Yu, Mali; Liu, Hong; Gong, Jianping; Jin, Renchao; Han, Ping; Song, Enmin.

J Digit Imaging ; 27(1): 58-67, 2014 Feb.

Artigo em Inglês | MEDLINE | ID: mdl-23982119

RESUMO

Pulmonary interlobar fissures are important anatomic structures in human lungs and are useful in locating and classifying lung abnormalities. Automatic segmentation of fissures is a difficult task because of their low contrast and large variability. We developed a fully automatic training-free approach for fissure segmentation based on the local bending degree (LBD) and the maximum bending index (MBI). The LBD is determined by the angle between the eigenvectors of two Hessian matrices for a pair of adjacent voxels. It is used to construct a constraint to extract the candidate surfaces in three-dimensional (3D) space. The MBI is a measure to discriminate cylindrical surfaces from planar surfaces in 3D space. Our approach for segmenting fissures consists of five steps, including lung segmentation, plane-like structure enhancement, surface extraction with LBD, initial fissure identification with MBI, and fissure extension based on local plane fitting. When applying our approach to 15 chest computed tomography (CT) scans, the mean values of the positive predictive value, the sensitivity, the root-mean square (RMS) distance, and the maximal RMS are 91 %, 88 %, 1.01 ± 0.99 mm, and 11.56 mm, respectively, which suggests that our algorithm can efficiently segment fissures in chest CT scans.

Assuntos

Imageamento Tridimensional/métodos , Pulmão/diagnóstico por imagem , Reconhecimento Automatizado de Padrão/métodos , Intensificação de Imagem Radiográfica/métodos , Interpretação de Imagem Radiográfica Assistida por Computador/métodos , Tomografia Computadorizada por Raios X/métodos , Adulto , Idoso , China , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Adulto Jovem

9.

Combining external-latent attention for medical image segmentation.

Song, Enmin; Zhan, Bangcheng; Liu, Hong.

Neural Netw ; 170: 468-477, 2024 Feb.

Artigo em Inglês | MEDLINE | ID: mdl-38039684

RESUMO

The attention mechanism comes as a new entry point for improving the performance of medical image segmentation. How to reasonably assign weights is a key element of the attention mechanism, and the current popular schemes include the global squeezing and the non-local information interactions using self-attention (SA) operation. However, these approaches over-focus on external features and lack the exploitation of latent features. The global squeezing approach crudely represents the richness of contextual information by the global mean or maximum value, while non-local information interactions focus on the similarity of external features between different regions. Both ignore the fact that the contextual information is presented more in terms of the latent features like the frequency change within the data. To tackle above problems and make proper use of attention mechanisms in medical image segmentation, we propose an external-latent attention collaborative guided image segmentation network, named TransGuider. This network consists of three key components: 1) a latent attention module that uses an improved entropy quantification method to accurately explore and locate the distribution of latent contextual information. 2) an external self-attention module using sparse representation, which can preserve external global contextual information while reducing computational overhead by selecting representative feature description map for SA operation. 3) a multi-attention collaborative module to guide the network to continuously focus on the region of interest, refining the segmentation mask. Our experimental results on several benchmark medical image segmentation datasets show that TransGuider outperforms the state-of-the-art methods, and extensive ablation experiments demonstrate the effectiveness of the proposed components. Our code will be available at https://github.com/chasingone/TransGuider.

Assuntos

Benchmarking , Processamento de Imagem Assistida por Computador , Entropia

10.

An advanced nomogram model using deep learning radiomics and clinical data for predicting occult lymph node metastasis in lung adenocarcinoma.

Ye, Guanchao; Zhang, Chi; Zhuang, Yuzhou; Liu, Hong; Song, Enmin; Li, Kuo; Liao, Yongde.

Transl Oncol ; 44: 101922, 2024 Jun.

Artigo em Inglês | MEDLINE | ID: mdl-38554572

RESUMO

PURPOSE: To evaluate the effectiveness of deep learning radiomics nomogram in distinguishing the occult lymph node metastasis (OLNM) status in clinical stage IA lung adenocarcinoma. METHODS: A cohort of 473 cases of lung adenocarcinomas from two hospitals was included, with 404 cases allocated to the training cohort and 69 cases to the testing cohort. Clinical characteristics and semantic features were collected, and radiomics features were extracted from the computed tomography (CT) images. Additionally, deep transfer learning (DTL) features were generated using RseNet50. Predictive models were developed using the logistic regression (LR) machine learning algorithm. Moreover, gene analysis was conducted on RNA sequencing data from 14 patients to explore the underlying biological basis of deep learning radiomics scores. RESULT: The training and testing cohorts achieved AUC values of 0.826 and 0.775 for the clinical model, 0.865 and 0.801 for the radiomics model, 0.927 and 0.885 for the DTL-radiomics model, and 0.928 and 0.898 for the nomogram model. The nomogram model demonstrated superiority over the clinical model. The decision curve analysis (DCA) revealed a net benefit in predicting OLNM for all models. The investigation into the biological basis of deep learning radiomics scores identified an association between high scores and pathways related to tumor proliferation and immune cell infiltration in the microenvironment. CONCLUSIONS: The nomogram model, incorporating clinical-semantic features, radiomics, and DTL features, exhibited promising performance in predicting OLNM. It has the potential to provide valuable information for non-invasive lymph node staging and individualized therapeutic approaches.

11.

Development and Validation of a Deep Learning Radiomics Model to Predict High-Risk Pathologic Pulmonary Nodules Using Preoperative Computed Tomography.

Ye, Guanchao; Wu, Guangyao; Li, Kuo; Zhang, Chi; Zhuang, Yuzhou; Liu, Hong; Song, Enmin; Qi, Yu; Li, Yiying; Yang, Fan; Liao, Yongde.

Acad Radiol ; 31(4): 1686-1697, 2024 Apr.

Artigo em Inglês | MEDLINE | ID: mdl-37802672

RESUMO

RATIONALE AND OBJECTIVES: To accurately identify the high-risk pathological factors of pulmonary nodules, our study constructed a model combined with clinical features, radiomics features, and deep transfer learning features to predict high-risk pathological pulmonary nodules. MATERIALS AND METHODS: The study cohort consisted of 469 cases of lung adenocarcinoma patients, divided into a training cohort (n = 400) and an external validation cohort (n = 69). We obtained computed tomography (CT) semantic features and clinical characteristics, as well as extracted radiomics and deep transfer learning (DTL) features from the CT images. Selected features were used for constructing prediction models using the logistic regression (LR) algorithm. The performance of the models was evaluated through metrics including the area under the receiver operating characteristic curve (AUC), sensitivity, specificity, calibration curve, and decision curve analysis. RESULTS: The clinical model achieved an AUC of 0.774 (95% CI: 0.728-0.821) in the training cohort and 0.762 (95% confidence interval [CI]: 0.650-0.873) in the external validation cohort. The radiomics model demonstrated an AUC of 0.847 (95% CI: 0.810-0.884) in the training cohort and 0.800 (95% CI: 0.693-0.907) in the external validation cohort. The radiomics-DTL (Rad-DTL) model showed an AUC of 0.871 (95% CI: 0.838-0.905) in the training cohort and 0.806 (95% CI: 0.698-0.914) in the external validation cohort. The proposed combined model yielded AUC values of 0.872 and 0.814 in the training and external validation cohorts, respectively. The combined model demonstrated superiority over both the clinical model and the Rad-DTL model. There were no statistically significant differences observed in the comparison between the combined model incorporating clinical features and the Rad-DTL model. Decision curve analysis (DCA) indicated that the models provided a net benefit in predicting high-risk pathologic pulmonary nodules. CONCLUSION: Rad-DTL signature is a potential biomarker for predicting high-risk pathologic pulmonary nodules using preoperative CT, determining the appropriate surgical strategy, and guiding the extent of resection.

Assuntos

Adenocarcinoma de Pulmão , Aprendizado Profundo , Neoplasias Pulmonares , Nódulos Pulmonares Múltiplos , Humanos , Radiômica , Nódulos Pulmonares Múltiplos/diagnóstico por imagem , Tomografia Computadorizada por Raios X , Neoplasias Pulmonares/diagnóstico por imagem , Estudos Retrospectivos

12.

CT-based quantification of intratumoral heterogeneity for predicting pathologic complete response to neoadjuvant immunochemotherapy in non-small cell lung cancer.

Ye, Guanchao; Wu, Guangyao; Zhang, Chunyang; Wang, Mingliang; Liu, Hong; Song, Enmin; Zhuang, Yuzhou; Li, Kuo; Qi, Yu; Liao, Yongde.

Front Immunol ; 15: 1414954, 2024.

Artigo em Inglês | MEDLINE | ID: mdl-38933281

RESUMO

Objectives: To investigate the prediction of pathologic complete response (pCR) in patients with non-small cell lung cancer (NSCLC) undergoing neoadjuvant immunochemotherapy (NAIC) using quantification of intratumoral heterogeneity from pre-treatment CT image. Methods: This retrospective study included 178 patients with NSCLC who underwent NAIC at 4 different centers. The training set comprised 108 patients from center A, while the external validation set consisted of 70 patients from center B, center C, and center D. The traditional radiomics model was contrasted using radiomics features. The radiomics features of each pixel within the tumor region of interest (ROI) were extracted. The optimal division of tumor subregions was determined using the K-means unsupervised clustering method. The internal tumor heterogeneity habitat model was developed using the habitats features from each tumor sub-region. The LR algorithm was employed in this study to construct a machine learning prediction model. The diagnostic performance of the model was evaluated using criteria such as area under the receiver operating characteristic curve (AUC), accuracy, specificity, sensitivity, positive predictive value (PPV), and negative predictive value (NPV). Results: In the training cohort, the traditional radiomics model achieved an AUC of 0.778 [95% confidence interval (CI): 0.688-0.868], while the tumor internal heterogeneity habitat model achieved an AUC of 0.861 (95% CI: 0.789-0.932). The tumor internal heterogeneity habitat model exhibits a higher AUC value. It demonstrates an accuracy of 0.815, surpassing the accuracy of 0.685 achieved by traditional radiomics models. In the external validation cohort, the AUC values of the two models were 0.723 (CI: 0.591-0.855) and 0.781 (95% CI: 0.673-0.889), respectively. The habitat model continues to exhibit higher AUC values. In terms of accuracy evaluation, the tumor heterogeneity habitat model outperforms the traditional radiomics model, achieving a score of 0.743 compared to 0.686. Conclusion: The quantitative analysis of intratumoral heterogeneity using CT to predict pCR in NSCLC patients undergoing NAIC holds the potential to inform clinical decision-making for resectable NSCLC patients, prevent overtreatment, and enable personalized and precise cancer management.

Assuntos

Carcinoma Pulmonar de Células não Pequenas , Neoplasias Pulmonares , Terapia Neoadjuvante , Tomografia Computadorizada por Raios X , Humanos , Carcinoma Pulmonar de Células não Pequenas/tratamento farmacológico , Carcinoma Pulmonar de Células não Pequenas/terapia , Carcinoma Pulmonar de Células não Pequenas/diagnóstico por imagem , Carcinoma Pulmonar de Células não Pequenas/patologia , Neoplasias Pulmonares/tratamento farmacológico , Neoplasias Pulmonares/terapia , Neoplasias Pulmonares/diagnóstico por imagem , Neoplasias Pulmonares/patologia , Masculino , Feminino , Terapia Neoadjuvante/métodos , Pessoa de Meia-Idade , Estudos Retrospectivos , Idoso , Tomografia Computadorizada por Raios X/métodos , Resultado do Tratamento , Aprendizado de Máquina , Imunoterapia/métodos , Adulto , Resposta Patológica Completa

13.

Magnetically driven capsules with multimodal response and multifunctionality for biomedical applications.

Sun, Yuxuan; Zhang, Wang; Gu, Junnan; Xia, Liangyu; Cao, Yinghao; Zhu, Xinhui; Wen, Hao; Ouyang, Shaowei; Liu, Ruiqi; Li, Jialong; Jiang, Zhenxing; Cheng, Denglong; Lv, Yiliang; Han, Xiaotao; Qiu, Wu; Cai, Kailin; Song, Enmin; Cao, Quanliang; Li, Liang.

Nat Commun ; 15(1): 1839, 2024 Feb 29.

Artigo em Inglês | MEDLINE | ID: mdl-38424039

RESUMO

Untethered capsules hold clinical potential for the diagnosis and treatment of gastrointestinal diseases. Although considerable progress has been achieved recently in this field, the constraints imposed by the narrow spatial structure of the capsule and complex gastrointestinal tract environment cause many open-ended problems, such as poor active motion and limited medical functions. In this work, we describe the development of small-scale magnetically driven capsules with a distinct magnetic soft valve made of dual-layer ferromagnetic soft composite films. A core technological advancement achieved is the flexible opening and closing of the magnetic soft valve by using the competitive interactions between magnetic gradient force and magnetic torque, laying the foundation for the functional integration of both drug release and sampling. Meanwhile, we propose a magnetic actuation strategy based on multi-frequency response control and demonstrate that it can achieve effective decoupled regulation of the capsule's global motion and local responses. Finally, through a comprehensive approach encompassing ideal models, animal ex vivo models, and in vivo assessment, we demonstrate the versatility of the developed magnetic capsules and their multiple potential applications in the biomedical field, such as targeted drug delivery and sampling, selective dual-drug release, and light/thermal-assisted therapy.

Assuntos

Sistemas de Liberação de Medicamentos , Gastroenteropatias , Animais , Fenômenos Físicos

14.

Wireless capsule endoscopy video reduction based on camera motion estimation.

Liu, Hong; Pan, Ning; Lu, Heng; Song, Enmin; Wang, Qian; Hung, Chih-Cheng.

J Digit Imaging ; 26(2): 287-301, 2013 Apr.

Artigo em Inglês | MEDLINE | ID: mdl-22868484

RESUMO

Wireless capsule endoscopy (WCE) is a novel technology aiming for investigating the diseases and abnormalities in small intestine. The major drawback of WCE examination is that it takes a long time to examine the whole WCE video. In this paper, we present a new reduction scheme for WCE video to reduce the examination time. To achieve this task, a WCE video motion model is proposed. Under this motion model, the WCE imaging motion is estimated in two stages (the coarse level and the fine level). In the coarse level, the WCE camera motion is estimated with a combination of Bee Algorithm and Mutual Information. In the fine level, the local gastrointestinal tract motion is estimated with SIFT flow. Based on the result of WCE imaging motion estimation, the reduction scheme preserves key images in WCE video with scene changes. From experimental results, we notice that the proposed motion model is suitable for the motion estimation in successive WCE images. Through the comparison with APRS and FCM-NMF scheme, our scheme can produce an acceptable reduction sequence for browsing and examination.

Assuntos

Artefatos , Endoscopia por Cápsula/métodos , Gastroenteropatias/diagnóstico , Movimento (Física) , Algoritmos , Cápsulas Endoscópicas , Humanos , Modelos Teóricos , Sensibilidade e Especificidade

15.

FSA-Net: Rethinking the attention mechanisms in medical image segmentation from releasing global suppressed information.

Zhan, Bangcheng; Song, Enmin; Liu, Hong.

Comput Biol Med ; 161: 106932, 2023 07.

Artigo em Inglês | MEDLINE | ID: mdl-37230013

RESUMO

Attention mechanism-based medical image segmentation methods have developed rapidly recently. For the attention mechanisms, it is crucial to accurately capture the distribution weights of the effective features contained in the data. To accomplish this task, most attention mechanisms prefer using the global squeezing approach. However, it will lead to a problem of over-focusing on the global most salient effective features of the region of interest, while suppressing the secondary salient ones. Making partial fine-grained features are abandoned directly. To address this issue, we propose to use a multiple-local perception method to aggregate global effective features, and design a fine-grained medical image segmentation network, named FSA-Net. This network consists of two key components: 1) the novel Separable Attention Mechanisms which replace global squeezing with local squeezing to release the suppressed secondary salient effective features. 2) a Multi-Attention Aggregator (MAA) which can fuse multi-level attention to efficiently aggregate task-relevant semantic information. We conduct extensive experimental evaluations on five publicly available medical image segmentation datasets: MoNuSeg, COVID-19-CT100, GlaS, CVC-ClinicDB, ISIC2018, and DRIVE datasets. Experimental results show that FSA-Net outperforms state-of-the-art methods in medical image segmentation.

Assuntos

COVID-19 , Humanos , COVID-19/diagnóstico por imagem , Semântica , Processamento de Imagem Assistida por Computador

16.

A 3D Cross-Modality Feature Interaction Network With Volumetric Feature Alignment for Brain Tumor and Tissue Segmentation.

Zhuang, Yuzhou; Liu, Hong; Song, Enmin; Hung, Chih-Cheng.

IEEE J Biomed Health Inform ; 27(1): 75-86, 2023 01.

Artigo em Inglês | MEDLINE | ID: mdl-36251915

RESUMO

Accurate volumetric segmentation of brain tumors and tissues is beneficial for quantitative brain analysis and brain disease identification in multi-modal Magnetic Resonance (MR) images. Nevertheless, due to the complex relationship between modalities, 3D Fully Convolutional Networks (3D FCNs) using simple multi-modal fusion strategies hardly learn the complex and nonlinear complementary information between modalities. Meanwhile, the indiscriminative feature aggregation between low-level and high-level features easily causes volumetric feature misalignment in 3D FCNs. On the other hand, the 3D convolution operations of 3D FCNs are excellent at modeling local relations but typically inefficient at capturing global relations between distant regions in volumetric images. To tackle these issues, we propose an Aligned Cross-Modality Interaction Network (ACMINet) for segmenting the regions of brain tumors and tissues from MR images. In this network, the cross-modality feature interaction module is first designed to adaptively and efficiently fuse and refine multi-modal features. Secondly, the volumetric feature alignment module is developed for dynamically aligning low-level and high-level features by the learnable volumetric feature deformation field. Thirdly, we propose the volumetric dual interaction graph reasoning module for graph-based global context modeling in spatial and channel dimensions. Our proposed method is applied to brain glioma, vestibular schwannoma, and brain tissue segmentation tasks, and we performed extensive experiments on BraTS2018, BraTS2020, Vestibular Schwannoma, and iSeg-2017 datasets. Experimental results show that ACMINet achieves state-of-the-art segmentation performance on all four benchmark datasets and obtains the highest DSC score of hard-segmented enhanced tumor region on the validation leaderboard of the BraTS2020 challenge.

Assuntos

Neoplasias Encefálicas , Neuroma Acústico , Humanos , Redes Neurais de Computação , Neuroma Acústico/patologia , Neoplasias Encefálicas/patologia , Imageamento por Ressonância Magnética/métodos , Encéfalo/patologia , Processamento de Imagem Assistida por Computador/métodos

17.

Automatic 3D CT liver segmentation based on fast global minimization of probabilistic active contour.

Jin, Renchao; Wang, Manyang; Xu, Lijun; Lu, Jiayi; Song, Enmin; Ma, Guangzhi.

Med Phys ; 50(4): 2100-2120, 2023 Apr.

Artigo em Inglês | MEDLINE | ID: mdl-36413182

RESUMO

PURPOSE: Automatic liver segmentation from computed tomography (CT) images is an essential preprocessing step for computer-aided diagnosis of liver diseases. However, due to the large differences in liver shapes, low-contrast to adjacent tissues, and existence of tumors or other abnormalities, liver segmentation has been very challenging. This study presents an accurate and fast liver segmentation method based on a novel probabilistic active contour (PAC) model and its fast global minimization scheme (3D-FGMPAC), which is explainable as compared with deep learning methods. METHODS: The proposed method first constructs a slice-indexed-histogram to localize the volume of interest (VOI) and estimate the probability that a voxel belongs to the liver according its intensity. The probabilistic image would be used to initialize the 3D PAC model. Secondly, a new contour indicator function, which is a component of the model, is produced by combining the gradient-based edge detection and Hessian-matrix-based surface detection. Then, a fast numerical scheme derived for the 3D PAC model is performed to evolve the initial probabilistic image into the global minimizer of the model, which is a smoothed probabilistic image showing a distinctly highlighted liver. Next, a simple region-growing strategy is applied to extract the whole liver mask from the smoothed probabilistic image. Finally, a B-spline surface is constructed to fit the patch of the rib cage to prevent possible leakage into adjacent intercostal tissues. RESULTS: The proposed method is evaluated on two public datasets. The average Dice score, volume overlap error, volume difference, symmetric surface distance and volume processing time are 0.96, 7.35%, 0.02%, 1.17 mm and 19.8 s for the Sliver07 dataset, and 0.95, 8.89%, - 0.02 % $-0.02\%$ , 1.45 mm and 23.08 s for the 3Dircadb dataset, respectively. CONCLUSIONS: The proposed fully-automatic approach can effectively segment the liver from low-contrast and complex backgrounds. The quantitative and qualitative results demonstrate that the proposed segmentation method outperforms state-of-the-art traditional automatic liver segmentation algorithms and achieves very competitive performance compared with recent deep leaning-based methods.

Assuntos

Neoplasias Hepáticas , Fígado , Humanos , Fígado/diagnóstico por imagem , Fígado/patologia , Abdome , Neoplasias Hepáticas/diagnóstico por imagem , Neoplasias Hepáticas/patologia , Algoritmos , Tomografia Computadorizada por Raios X/métodos , Processamento de Imagem Assistida por Computador/métodos

18.

A bidirectional registration neural network for cardiac motion tracking using cine MRI images.

Lu, Jiayi; Jin, Renchao; Wang, Manyang; Song, Enmin; Ma, Guangzhi.

Comput Biol Med ; 160: 107001, 2023 06.

Artigo em Inglês | MEDLINE | ID: mdl-37187138

RESUMO

Using cine magnetic resonance imaging (cine MRI) images to track cardiac motion helps users to analyze the myocardial strain, and is of great importance in clinical applications. At present, most of the automatic deep learning-based motion tracking methods compare two images without considering temporal information between MRI frames, which easily leads to the lack of consistency of the generated motion fields. Even though a small number of works take into account the temporal factor, they are usually computationally intensive or have limitations on image length. To solve this problem, we propose a bidirectional convolution neural network for motion tracking of cardiac cine MRI images. This network leverages convolutional blocks to extract spatial features from three-dimensional (3D) image registration pairs, and models the temporal relations through a bidirectional recurrent neural network to obtain the Lagrange motion field between the reference image and other images. Compared with previous pairwise registration methods, the proposed method can automatically learn spatiotemporal information from multiple images with fewer parameters. We evaluated our model on three public cardiac cine MRI datasets. The experimental results demonstrated that the proposed method can significantly improve the motion tracking accuracy. The average Dice coefficient between estimated segmentation and manual segmentation has reached almost 0.85 on the widely used Automatic Cardiac Diagnostic Challenge (ACDC) dataset.

Assuntos

Processamento de Imagem Assistida por Computador , Imagem Cinética por Ressonância Magnética , Coração/diagnóstico por imagem , Processamento de Imagem Assistida por Computador/métodos , Imageamento por Ressonância Magnética , Imagem Cinética por Ressonância Magnética/métodos , Movimento (Física) , Redes Neurais de Computação , Humanos

19.

A discontinuity-preserving regularization for deep learning-based cardiac image registration.

Lu, Jiayi; Jin, Renchao; Wang, Manyang; Song, Enmin; Ma, Guangzhi.

Phys Med Biol ; 68(9)2023 05 03.

Artigo em Inglês | MEDLINE | ID: mdl-37068486

RESUMO

Objective. Sliding motion may occur between organs in anatomical regions due to respiratory motion and heart beating. This issue is often neglected in previous studies, resulting in poor image registration performance. A new approach is proposed to handle discontinuity at the boundary and improve registration accuracy.Approach. The proposed discontinuity-preserving regularization (DPR) term can maintain local discontinuities. It leverages the segmentation mask to find organ boundaries and then relaxes the displacement field constraints in these boundary regions. A weakly supervised method using mask dissimilarity loss (MDL) is also proposed. It employs a simple formula to calculate the similarity between the fixed image mask and the deformed moving image mask. These two strategies are added to the loss function during network training to guide the model better to update parameters. Furthermore, during inference time, no segmentation mask information is needed.Main results. Adding the proposed DPR term increases the Dice coefficients by 0.005, 0.009, and 0.081 for three existing registration neural networks CRNet, VoxelMorph, and ViT-V-Net, respectively. It also shows significant improvements in other metrics, including Hausdorff Distance and Average Surface Distance. All quantitative indicator results with MDL have been slightly improved within 1%. After applying these two regularization terms, the generated displacement field is more reasonable at the boundary, and the deformed moving image is closer to the fixed image.Significance. This study demonstrates that the proposed regularization terms can effectively handle discontinuities at the boundaries of organs and improve the accuracy of deep learning-based cardiac image registration methods. Besides, they are generic to be extended to other networks.

Assuntos

Algoritmos , Aprendizado Profundo , Redes Neurais de Computação , Movimento (Física) , Coração/diagnóstico por imagem , Processamento de Imagem Assistida por Computador/métodos

20.

A modality-collaborative convolution and transformer hybrid network for unpaired multi-modal medical image segmentation with limited annotations.

Liu, Hong; Zhuang, Yuzhou; Song, Enmin; Xu, Xiangyang; Ma, Guangzhi; Cetinkaya, Coskun; Hung, Chih-Cheng.

Med Phys ; 50(9): 5460-5478, 2023 Sep.

Artigo em Inglês | MEDLINE | ID: mdl-36864700

RESUMO

BACKGROUND: Multi-modal learning is widely adopted to learn the latent complementary information between different modalities in multi-modal medical image segmentation tasks. Nevertheless, the traditional multi-modal learning methods require spatially well-aligned and paired multi-modal images for supervised training, which cannot leverage unpaired multi-modal images with spatial misalignment and modality discrepancy. For training accurate multi-modal segmentation networks using easily accessible and low-cost unpaired multi-modal images in clinical practice, unpaired multi-modal learning has received comprehensive attention recently. PURPOSE: Existing unpaired multi-modal learning methods usually focus on the intensity distribution gap but ignore the scale variation problem between different modalities. Besides, within existing methods, shared convolutional kernels are frequently employed to capture common patterns in all modalities, but they are typically inefficient at learning global contextual information. On the other hand, existing methods highly rely on a large number of labeled unpaired multi-modal scans for training, which ignores the practical scenario when labeled data is limited. To solve the above problems, we propose a modality-collaborative convolution and transformer hybrid network (MCTHNet) using semi-supervised learning for unpaired multi-modal segmentation with limited annotations, which not only collaboratively learns modality-specific and modality-invariant representations, but also could automatically leverage extensive unlabeled scans for improving performance. METHODS: We make three main contributions to the proposed method. First, to alleviate the intensity distribution gap and scale variation problems across modalities, we develop a modality-specific scale-aware convolution (MSSC) module that can adaptively adjust the receptive field sizes and feature normalization parameters according to the input. Secondly, we propose a modality-invariant vision transformer (MIViT) module as the shared bottleneck layer for all modalities, which implicitly incorporates convolution-like local operations with the global processing of transformers for learning generalizable modality-invariant representations. Third, we design a multi-modal cross pseudo supervision (MCPS) method for semi-supervised learning, which enforces the consistency between the pseudo segmentation maps generated by two perturbed networks to acquire abundant annotation information from unlabeled unpaired multi-modal scans. RESULTS: Extensive experiments are performed on two unpaired CT and MR segmentation datasets, including a cardiac substructure dataset derived from the MMWHS-2017 dataset and an abdominal multi-organ dataset consisting of the BTCV and CHAOS datasets. Experiment results show that our proposed method significantly outperforms other existing state-of-the-art methods under various labeling ratios, and achieves a comparable segmentation performance close to single-modal methods with fully labeled data by only leveraging a small portion of labeled data. Specifically, when the labeling ratio is 25%, our proposed method achieves overall mean DSC values of 78.56% and 76.18% in cardiac and abdominal segmentation, respectively, which significantly improves the average DSC value of two tasks by 12.84% compared to single-modal U-Net models. CONCLUSIONS: Our proposed method is beneficial for reducing the annotation burden of unpaired multi-modal medical images in clinical applications.

Assuntos

Algoritmos , Coração , Aprendizado de Máquina Supervisionado , Processamento de Imagem Assistida por Computador

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA