Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 30
Filtrar
1.
Network ; : 1-34, 2024 May 14.
Artigo em Inglês | MEDLINE | ID: mdl-38743436

RESUMO

Image denoising is one of the significant approaches for extracting valuable information in the required images without any errors. During the process of image transmission in the wireless medium, a wide variety of noise is presented to affect the image quality. For efficient analysis, an effective denoising approach is needed to enhance the quality of the images. The main scope of this research paper is to correct errors and remove the effects of channel degradation. A corrupted image denoising approach is developed in wireless channels to eliminate the bugs. The required images are gathered from wireless channels at the receiver end. Initially, the collected images are decomposed into several regions using Adaptive Lifting Wavelet Transform (ALWT) and then the "Symmetric Convolution-based Residual Attention Network (SC-RAN)" is employed, where the residual images are obtained by separating the clean image from the noisy images. The parameters present are optimized using Hybrid Energy Golden Tortoise Beetle Optimizer (HEGTBO) to maximize efficiency. The image denoising is performed over the obtained residual images and noisy images to get the final denoised images. The numerical findings of the developed model attain 31.69% regarding PSNR metrics. Thus, the analysis of the developed model shows significant improvement.

2.
BMC Med Imaging ; 24(1): 157, 2024 Jun 24.
Artigo em Inglês | MEDLINE | ID: mdl-38914956

RESUMO

BACKGROUND: For prostate electrosurgery, where real-time surveillance screens are relied upon for operations, manual identification of the prostate capsule remains the primary method. With the need for rapid and accurate detection becoming increasingly urgent, we set out to develop a deep learning approach for detecting the prostate capsule using endoscopic optical images. METHODS: Our method involves utilizing the Simple, Parameter-Free Attention Module(SimAM) residual attention fusion module to enhance the extraction of texture and detail information, enabling better feature extraction capabilities. This enhanced detail information is then hierarchically transferred from lower to higher levels to aid in the extraction of semantic information. By employing a forward feature-by-feature hierarchical fusion network based on the 3D residual attention mechanism, we have proposed an improved single-shot multibox detector model. RESULTS: Our proposed model achieves a detection precision of 83.12% and a speed of 0.014 ms on NVIDIA RTX 2060, demonstrating its effectiveness in rapid detection. Furthermore, when compared to various existing methods including Faster Region-based Convolutional Neural Network (Faster R-CNN), Single Shot Multibox Detector (SSD), EfficientDet and others, our method Attention based Feature Fusion Single Shot Multibox Detector (AFFSSD) stands out with the highest mean Average Precision (mAP) and faster speed, ranking only below You Only Look Once version 7 (YOLOv7). CONCLUSIONS: This network excels in extracting regional features from images while retaining the spatial structure, facilitating the rapid detection of medical images.


Assuntos
Aprendizado Profundo , Imageamento Tridimensional , Humanos , Masculino , Imageamento Tridimensional/métodos , Próstata/diagnóstico por imagem , Neoplasias da Próstata/diagnóstico por imagem
3.
Sensors (Basel) ; 24(11)2024 May 26.
Artigo em Inglês | MEDLINE | ID: mdl-38894221

RESUMO

Aiming at the problems of incomplete dehazing, color distortion, and loss of detail and edge information encountered by existing algorithms when processing images of underground coal mines, an image dehazing algorithm for underground coal mines, named CAB CA DSConv Fusion gUNet (CCDF-gUNet), is proposed. First, Dynamic Snake Convolution (DSConv) is introduced to replace traditional convolutions, enhancing the feature extraction capability. Second, residual attention convolution blocks are constructed to simultaneously focus on both local and global information in images. Additionally, the Coordinate Attention (CA) module is utilized to learn the coordinate information of features so that the model can better capture the key information in images. Furthermore, to simultaneously focus on the detail and structural consistency of images, a fusion loss function is introduced. Finally, based on the test verification of the public dataset Haze-4K, the Peak Signal-to-Noise Ratio (PSNR), Structural Similarity (SSIM), and Mean Squared Error (MSE) are 30.72 dB, 0.976, and 55.04, respectively, and on a self-made underground coal mine dataset, they are 31.18 dB, 0.971, and 49.66, respectively. The experimental results show that the algorithm performs well in dehazing, effectively avoids color distortion, and retains image details and edge information, providing some theoretical references for image processing in coal mine surveillance videos.

4.
Eur Radiol ; 33(8): 5385-5397, 2023 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-36892643

RESUMO

OBJECTIVES: Hippocampal characterization is one of the most significant hallmarks of Alzheimer's disease (AD); rather, the single-level feature is insufficient. A comprehensive hippocampal characterization is pivotal for developing a well-performing biomarker for AD. To verify whether a comprehensive characterization of hippocampal features of gray matter volume, segmentation probability, and radiomics features could better distinguish AD from normal control (NC), and to investigate whether the classification decision score could serve as a robust and individualized brain signature. METHODS: A total of 3238 participants' structural MRI from four independent databases were employed to conduct a 3D residual attention network (3DRA-Net) to classify NC, mild cognitive impairment (MCI), and AD. The generalization was validated under inter-database cross-validation. The neurobiological basis of the classification decision score as a neuroimaging biomarker was systematically investigated by association with clinical profiles, as well as longitudinal trajectory analysis to reveal AD progression. All image analyses were performed only upon the single modality of T1-weighted MRI. RESULTS: Our study exhibited an outstanding performance (ACC = 91.6%, AUC = 0.95) of the comprehensive characterization of hippocampal features in distinguishing AD (n = 282) from NC (n = 603) in Alzheimer's Disease Neuroimaging Initiative cohort, and ACC = 89.2% and AUC = 0.93 under external validation. More importantly, the constructed score was significantly correlated with clinical profiles (p < 0.05), and dynamically altered over the AD longitudinal progression, provided compelling evidence of a solid neurobiological basis. CONCLUSIONS: This systemic study highlights the potential of the comprehensive characterization of hippocampal features to provide an individualized, generalizable, and biologically plausible neuroimaging biomarker for early detection of AD. KEY POINTS: • The comprehensive characterization of hippocampal features exhibited ACC = 91.6% (AUC = 0.95) in classifying AD from NC under intra-database cross-validation, and ACC = 89.2% (AUC = 0.93) in external validation. • The constructed classification score was significantly associated with clinical profiles, and dynamically altered over the AD longitudinal progression, which highlighted its potential of being an individualized, generalizable, and biologically plausible neuroimaging biomarker for early detection of AD.


Assuntos
Doença de Alzheimer , Aprendizado Profundo , Humanos , Doença de Alzheimer/diagnóstico por imagem , Neuroimagem/métodos , Imageamento por Ressonância Magnética/métodos , Hipocampo/diagnóstico por imagem , Encéfalo/diagnóstico por imagem , Biomarcadores
5.
Sensors (Basel) ; 23(17)2023 Aug 28.
Artigo em Inglês | MEDLINE | ID: mdl-37687922

RESUMO

Semantic segmentation and depth estimation are crucial components in the field of autonomous driving for scene understanding. Jointly learning these tasks can lead to a better understanding of scenarios. However, using task-specific networks to extract global features from task-shared networks can be inadequate. To address this issue, we propose a multi-task residual attention network (MTRAN) that consists of a global shared network and two attention networks dedicated to semantic segmentation and depth estimation. The convolutional block attention module is used to highlight the global feature map, and residual connections are added to prevent network degradation problems. To ensure manageable task loss and prevent specific tasks from dominating the training process, we introduce a random-weighted strategy into the impartial multi-task learning method. We conduct experiments to demonstrate the effectiveness of the proposed method.

6.
Sensors (Basel) ; 23(19)2023 Sep 28.
Artigo em Inglês | MEDLINE | ID: mdl-37836976

RESUMO

Due to the outstanding penetrating detection performance of low-frequency electromagnetic waves, through-wall radar (TWR) has gained widespread applications in various fields, including public safety, counterterrorism operations, and disaster rescue. TWR is required to accomplish various tasks, such as people detection, people counting, and positioning in practical applications. However, most current research primarily focuses on one or two tasks. In this paper, we propose a multitask network that can simultaneously realize people counting, action recognition, and localization. We take the range-time-Doppler (RTD) spectra obtained from one-dimensional (1D) radar signals as datasets and convert the information related to the number, motion, and location of people into confidence matrices as labels. The convolutional layers and novel attention modules automatically extract deep features from the data and output the number, motion category, and localization results of people. We define the total loss function as the sum of individual task loss functions. Through the loss function, we transform the positioning problem into a multilabel classification problem, where a certain position in the distance confidence matrix represents a certain label. On the test set consisting of 10,032 samples from through-wall scenarios with a 24 cm thick brick wall, the accuracy of people counting can reach 96.94%, and the accuracy of motion recognition is 96.03%, with an average distance error of 0.12 m.

7.
Anal Biochem ; 654: 114802, 2022 10 01.
Artigo em Inglês | MEDLINE | ID: mdl-35809650

RESUMO

Knowledge of RNA solvent accessibility has recently become attractive due to the increasing awareness of its importance for key biological process. Accurately predicting the solvent accessibility of RNA is crucial for understanding its 3D structure and biological function. In this study, we develop a novel computational method, termed M2pred, for accurately predicting the solvent accessibility of RNA from sequence-based multi-scale context feature. In M2pred, three single-view features, i.e., base-pairing probabilities, position-specific frequency matrix, and a binary one-hot encoding, are first generated as three feature sources, and immediately concatenated to engender a super feature. Secondly, for the super feature, the matrix-format features of each nucleotide are extracted using an initialized sliding window technique, and regularly stacked into a cube-format feature. Then, using multi-scale context feature extraction strategy, a pyramid feature constructed of contextual feature of four scales related to target nucleotides is extracted from the cube-format feature. Finally, a customized multi-shot neural network framework, which is equipped with four different scales of receptive fields mainly integrating several residual attention blocks, is designed to dig discrimination information from the contextual pyramid feature. Experimental results demonstrate that the proposed M2pred achieve a high prediction performance and outperforms existing state-of-the-art prediction methods of RNA solvent accessibility.


Assuntos
Redes Neurais de Computação , RNA , Nucleotídeos , RNA/química , Solventes/química
8.
Sensors (Basel) ; 22(15)2022 Aug 02.
Artigo em Inglês | MEDLINE | ID: mdl-35957336

RESUMO

A convolutional neural network based on an improved residual structure is proposed to implement a lightweight classification model for the recognition of complex pavement conditions, which uses RGB-thermal as input and embeds an attention module to adjust the spatial, as well as channel, information of the images. The best prediction accuracy of the proposed model is 98.88%, while the RGB-thermal is used as input and an attention mechanism is used. The attention mechanism increases the attention to detail of the image and regulates the use of image channels, which enhances the final performance of the model. It is also compared with state-of-the-art (SOTA) deep learning models, indicating our model has fewer parameters, shorter training time, and higher recognition accuracy compared to existing image classification models. A visualization method incorporating gradient-weighted class activation mapping (Grad-CAM) is proposed to analyze the classification results, comparing the data the model learns from the images under different input data.


Assuntos
Redes Neurais de Computação
9.
Sensors (Basel) ; 22(22)2022 Nov 11.
Artigo em Inglês | MEDLINE | ID: mdl-36433298

RESUMO

Melanoma is a main factor that leads to skin cancer, and early diagnosis and treatment can significantly reduce the mortality of patients. Skin lesion boundary segmentation is a key to accurately localizing a lesion in dermoscopic images. However, the irregular shape and size of the lesions and the blurred boundary of the lesions pose significant challenges for researchers. In recent years, pixel-level semantic segmentation strategies based on convolutional neural networks have been widely used, but many methods still suffer from the inaccurate segmentation of fuzzy boundaries. In this paper, we proposed a multi-scale hybrid attentional convolutional neural network (MHAU-Net) for the precise localization and segmentation of skin lesions. MHAU-Net has four main components: multi-scale resolution input, hybrid residual attention (HRA), dilated convolution, and atrous spatial pyramid pooling. Multi-scale resolution inputs provide richer visual information, and HRA solves the problem of blurred boundaries and enhances the segmentation results. The Dice, mIoU, average specificity, and sensitivity on the ISIC2018 task 1 validation set were 93.69%, 90.02%, 92.7% and 93.9%, respectively. The segmentation metrics are significantly better than the latest DCSAU-Net, UNeXt, and U-Net, and excellent segmentation results are achieved on different datasets. We performed model robustness validations on the Kvasir-SEG dataset with an overall sensitivity and average specificity of 95.91% and 96.28%, respectively.


Assuntos
Dermatopatias , Neoplasias Cutâneas , Humanos , Processamento de Imagem Assistida por Computador/métodos , Algoritmos , Redes Neurais de Computação , Neoplasias Cutâneas/diagnóstico por imagem , Neoplasias Cutâneas/patologia , Progressão da Doença
10.
J Digit Imaging ; 35(5): 1131-1142, 2022 10.
Artigo em Inglês | MEDLINE | ID: mdl-35789447

RESUMO

Magnetic resonance (MR) imaging is the most sensitive clinical tool in the diagnosis and monitoring of multiple sclerosis (MS) alterations. Spinal cord evaluation has gained interest in this clinical scenario in recent years, but, unlike the brain, there is a more limited choice of algorithms to assist spinal cord segmentation. Our goal was to investigate and develop an automatic MR cervical cord segmentation method, enabling automated and seamless spinal cord atrophy assessment and setting the stage for the development of an aggregated algorithm for the extraction of lesion-related imaging biomarkers. The algorithm was developed using a real-world MR imaging dataset of 121 MS patients (96 cases used as a training dataset and 25 cases as a validation dataset). Transversal, 3D T1-weighted gradient echo MR images (TE/TR/FA = 1.7-2.7 ms/5.6-8.2 ms/12°) were acquired in a 3 T system (Signa HD, GEHC) as standard of care in our clinical practice. Experienced radiologists supervised the manual labelling, which was considered the ground-truth. The 2D convolutional neural network consisted of a hybrid residual attention-aware segmentation method trained to delineate the cervical spinal cord. The training was conducted using a focal loss function, based on the Tversky index to address label imbalance, and an automatic optimal learning rate finder. Our automated model provided an accurate segmentation, achieving a validation DICE coefficient of 0.904 ± 0.101 compared with the manual delineation. An automatic method for cervical spinal cord segmentation on T1-weighted MR images was successfully implemented. It will have direct implications serving as the first step for accelerating the process for MS staging and follow-up through imaging biomarkers.


Assuntos
Medula Cervical , Esclerose Múltipla , Humanos , Medula Cervical/diagnóstico por imagem , Medula Cervical/patologia , Esclerose Múltipla/diagnóstico por imagem , Esclerose Múltipla/patologia , Processamento de Imagem Assistida por Computador/métodos , Redes Neurais de Computação , Imageamento por Ressonância Magnética/métodos , Medula Espinal/patologia , Atenção
11.
Sheng Wu Yi Xue Gong Cheng Xue Za Zhi ; 39(2): 301-310, 2022 Apr 25.
Artigo em Chinês | MEDLINE | ID: mdl-35523551

RESUMO

Electrocardiogram (ECG) can visually reflect the physiological electrical activity of human heart, which is important in the field of arrhythmia detection and classification. To address the negative effect of label imbalance in ECG data on arrhythmia classification, this paper proposes a nested long short-term memory network (NLSTM) model for unbalanced ECG signal classification. The NLSTM is built to learn and memorize the temporal characteristics in complex signals, and the focal loss function is used to reduce the weights of easily identifiable samples. Then the residual attention mechanism is used to modify the assigned weights according to the importance of sample characteristic to solve the sample imbalance problem. Then the synthetic minority over-sampling technique is used to perform a simple manual oversampling process on the Massachusetts institute of technology and Beth Israel hospital arrhythmia (MIT-BIH-AR) database to further increase the classification accuracy of the model. Finally, the MIT-BIH arrhythmia database is applied to experimentally verify the above algorithms. The experimental results show that the proposed method can effectively solve the issues of imbalanced samples and unremarkable features in ECG signals, and the overall accuracy of the model reaches 98.34%. It also significantly improves the recognition and classification of minority samples and has provided a new feasible method for ECG-assisted diagnosis, which has practical application significance.


Assuntos
Memória de Curto Prazo , Redes Neurais de Computação , Algoritmos , Arritmias Cardíacas/diagnóstico , Eletrocardiografia , Humanos , Processamento de Sinais Assistido por Computador
12.
Sensors (Basel) ; 20(22)2020 Nov 18.
Artigo em Inglês | MEDLINE | ID: mdl-33218040

RESUMO

Automatic sleep stage classification of multi-channel sleep signals can help clinicians efficiently evaluate an individual's sleep quality and assist in diagnosing a possible sleep disorder. To obtain accurate sleep classification results, the processing flow of results from signal preprocessing and machine-learning-based classification is typically employed. These classification results are refined based on sleep transition rules. Neural networks-i.e., machine learning algorithms-are powerful at solving classification problems. Some methods apply them to the first two processes above; however, the refinement process continues to be based on traditional methods. In this study, the sleep stage refinement process was incorporated into the neural network model to form real end-to-end processing. In addition, for multi-channel signals, the multi-branch convolutional neural network was combined with a proposed residual attention method. This approach further improved the model classification accuracy. The proposed method was evaluated on the Sleep-EDF Expanded Database (Sleep-EDFx) and University College Dublin Sleep Apnea Database (UCDDB). It achieved respective accuracy rates of 85.7% and 79.4%. The results also showed that sleep stage refinement based on a neural network is more effective than the traditional refinement method. Moreover, the proposed residual attention method was determined to have a more robust channel-information fusion ability than the respective average and concatenation methods.

13.
Neural Netw ; 169: 378-387, 2024 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-37924607

RESUMO

The effective use of temporal relationships while extracting fertile spatial features is the key to video action understanding. Video action understanding is a challenging visual task because it generally necessitates not only the features of individual key frames but also the contextual understanding of the entire video and the relationships among key frames. Temporal relationships pose a challenge to video action understanding. However, existing 3D convolutional neural network approaches are limited, with a great deal of redundant spatial and temporal information. In this paper, we present a novel two-stream approach that incorporates Spatial Residual Attention and Temporal Markov (SRATM) to learn complementary features to achieve stronger video action understanding performance. Specifically, the proposed SRATM consists of spatial residual attention and temporal Markov. Firstly, the spatial residual attention network captures effective spatial feature representation. Further, the temporal Markov network enhances the model by learning the temporal relationships via conducting probabilistic logic calculation among frames in a video. Finally, we conduct extensive experiments on four video action datasets, namely, Something-Something-V1, Something-Something-V2, Diving48, and Mini-Kinetics, show that the proposed SRATM method achieves competitive results.


Assuntos
Aprendizagem , Redes Neurais de Computação , Física
14.
Magn Reson Imaging ; 111: 157-167, 2024 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-38642780

RESUMO

Deep cascaded networks have been extensively studied and applied to accelerate Magnetic Resonance Imaging (MRI) and have shown promising results. Most existing works employ a large cascading number for the sake of superior performances. However, due to the lack of proper guidance, the reconstruction performance can easily reach a plateau and even face degradation if simply increasing the cascading number. In this paper, we aim to boost the reconstruction performance from a novel perspective by proposing a parallel architecture called DIRECTION that fully exploits the guiding value of the reconstruction residual of each subnetwork. Specifically, we introduce a novel Reconstruction Residual-Based Feature Modulation Mechanism (RRFMM) which utilizes the reconstruction residual of the previous subnetwork to guide the next subnetwork at the feature level. To achieve this, a Residual Attention Modulation Block (RAMB) is proposed to generate attention maps using multi-scale residual features to modulate the image features of the corresponding scales. Equipped with this strategy, each subnetwork within the cascaded network possesses its unique optimization objective and emphasis rather than blindly updating its parameters. To further boost the performance, we introduce the Cross-Stage Feature Reuse Connection (CSFRC) and the Reconstruction Dense Connection (RDC), which can reduce information loss and enhance representative ability. We conduct sufficient experiments and evaluate our method on the fastMRI knee dataset using multiple subsampling masks. Comprehensive experimental results show that our method can markedly boost the performance of cascaded networks and significantly outperforms other compared state-of-the-art methods quantitatively and qualitatively.


Assuntos
Algoritmos , Processamento de Imagem Assistida por Computador , Imageamento por Ressonância Magnética , Imageamento por Ressonância Magnética/métodos , Humanos , Processamento de Imagem Assistida por Computador/métodos , Encéfalo/diagnóstico por imagem , Redes Neurais de Computação , Aprendizado Profundo
15.
Phys Med Biol ; 69(16)2024 Aug 02.
Artigo em Inglês | MEDLINE | ID: mdl-39047770

RESUMO

Objective. Convolutional neural network (CNN) is developing rapidly in the field of medical image registration, and the proposed U-Net further improves the precision of registration. However, this method may discard certain important information in the process of encoding and decoding steps, consequently leading to a decline in accuracy. To solve this problem, a multi-channel semantic-aware and residual attention mechanism network (MSRA-Net) is proposed in this paper.Approach. Our proposed network achieves efficient information aggregation by cleverly extracting the features of different channels. Firstly, a context-aware module (CAM) is designed to extract valuable contextual information. And the depth-wise separable convolution is employed in the CAM to alleviate the computational burden. Then, a new multi-channel semantic-aware module (MCSAM) is designed for more comprehensive fusion of up-sampling features. Additionally, the residual attention module is introduced in the up-sampling process to extract more semantic information and minimize information loss.Main results. This study utilizes Dice score, average symmetric surface distance and negative Jacobian determinant evaluation metrics to evaluate the influence of registration. The experimental results demonstrate that our proposed MSRA-Net has the highest accuracy compared to several state-of-the-art methods. Moreover, our network has demonstrated the highest Dice score across multiple datasets, thereby indicating that the superior generalization capabilities of our model.Significance. The proposed MSRA-Net offers a novel approach to improve medical image registration accuracy, with implications for various clinical applications. Our implementation is available athttps://github.com/shy922/MSRA-Net.


Assuntos
Imageamento Tridimensional , Redes Neurais de Computação , Semântica , Imageamento Tridimensional/métodos , Humanos , Aprendizado de Máquina não Supervisionado
16.
Artif Intell Med ; 150: 102827, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38553166

RESUMO

Due to the surging of cost, artificial intelligence-assisted de novo drug design has supplanted conventional methods and become an emerging option for drug discovery. Although there have arisen many successful examples of applying generative models to the molecular field, these methods struggle to deal with conditional generation that meet chemists' practical requirements which ask for a controllable process to generate new molecules or optimize basic molecules with appointed conditions. To address this problem, a Recurrent Molecular-Generative Pretrained Transformer model is proposed, supplemented by LocalRNN and Residual Attention Layer Transformer, referred to as RM-GPT. RM-GPT rebuilds GPT model's architecture by incorporating LocalRNN and Residual Attention Layer Transformer so that it is able to extract local information and build connectivity between attention blocks. The incorporation of Transformer in these two modules enables leveraging the parallel computing advantages of multi-head attention mechanisms while extracting local structural information effectively. Through exploring and learning in a large chemical space, RM-GPT absorbs the ability to generate drug-like molecules with conditions in demand, such as desired properties and scaffolds, precisely and stably. RM-GPT achieved better results than SOTA methods on conditional generation.


Assuntos
Inteligência Artificial , Aprendizagem
17.
Int J Neural Syst ; 34(7): 2450033, 2024 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-38623651

RESUMO

Surgical reconstruction of mandibular defects is a clinical routine manner for the rehabilitation of patients with deformities. The mandible plays a crucial role in maintaining the facial contour and ensuring the speech and mastication functions. The repairing and reconstruction of mandible defects is a significant yet challenging task in oral-maxillofacial surgery. Currently, the mainly available methods are traditional digitalized design methods that suffer from substantial artificial operations, limited applicability and high reconstruction error rates. An automated, precise, and individualized method is imperative for maxillofacial surgeons. In this paper, we propose a Stage-wise Residual Attention Generative Adversarial Network (SRA-GAN) for mandibular defect reconstruction. Specifically, we design a stage-wise residual attention mechanism for generator to enhance the extraction capability of mandibular remote spatial information, making it adaptable to various defects. For the discriminator, we propose a multi-field perceptual network, consisting of two parallel discriminators with different perceptual fields, to reduce the cumulative reconstruction errors. Furthermore, we design a self-encoder perceptual loss function to ensure the correctness of mandibular anatomical structures. The experimental results on a novel custom-built mandibular defect dataset demonstrate that our method has a promising prospect in clinical application, achieving the best Dice Similarity Coefficient (DSC) of 94.238% and 95% Hausdorff Distance (HD95) of 4.787.


Assuntos
Mandíbula , Reconstrução Mandibular , Redes Neurais de Computação , Humanos , Mandíbula/cirurgia , Reconstrução Mandibular/métodos , Atenção/fisiologia
18.
Front Microbiol ; 15: 1453870, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-39224212

RESUMO

The synthesis of pseudo-healthy images, involving the generation of healthy counterparts for pathological images, is crucial for data augmentation, clinical disease diagnosis, and understanding pathology-induced changes. Recently, Generative Adversarial Networks (GANs) have shown substantial promise in this domain. However, the heterogeneity of intracranial infection symptoms caused by various infections complicates the model's ability to accurately differentiate between pathological and healthy regions, leading to the loss of critical information in healthy areas and impairing the precise preservation of the subject's identity. Moreover, for images with extensive lesion areas, the pseudo-healthy images generated by these methods often lack distinct organ and tissue structures. To address these challenges, we propose a three-stage method (localization, inpainting, synthesis) that achieves nearly perfect preservation of the subject's identity through precise pseudo-healthy synthesis of the lesion region and its surroundings. The process begins with a Segmentor, which identifies the lesion areas and differentiates them from healthy regions. Subsequently, a Vague-Filler fills the lesion areas to construct a healthy outline, thereby preventing structural loss in cases of extensive lesions. Finally, leveraging this healthy outline, a Generative Adversarial Network integrated with a contextual residual attention module generates a more realistic and clearer image. Our method was validated through extensive experiments across different modalities within the BraTS2021 dataset, achieving a healthiness score of 0.957. The visual quality of the generated images markedly exceeded those produced by competing methods, with enhanced capabilities in repairing large lesion areas. Further testing on the COVID-19-20 dataset showed that our model could effectively partially reconstruct images of other organs.

19.
Sci Rep ; 14(1): 20622, 2024 09 04.
Artigo em Inglês | MEDLINE | ID: mdl-39232053

RESUMO

Alzheimer's Disease (AD) causes slow death in brain cells due to shrinkage of brain cells which is more prevalent in older people. In most cases, the symptoms of AD are mistaken as age-related stresses. The most widely utilized method to detect AD is Magnetic Resonance Imaging (MRI). Along with Artificial Intelligence (AI) techniques, the efficacy of identifying diseases related to the brain has become easier. But, the identical phenotype makes it challenging to identify the disease from the neuro-images. Hence, a deep learning method to detect AD at the beginning stage is suggested in this work. The newly implemented "Enhanced Residual Attention with Bi-directional Long Short-Term Memory (Bi-LSTM) (ERABi-LNet)" is used in the detection phase to identify the AD from the MRI images. This model is used for enhancing the performance of the Alzheimer's detection in scale of 2-5%, minimizing the error rates, increasing the balance of the model, so that the multi-class problems are supported. At first, MRI images are given to "Residual Attention Network (RAN)", which is specially developed with three convolutional layers, namely atrous, dilated and Depth-Wise Separable (DWS), to obtain the relevant attributes. The most appropriate attributes are determined by these layers, and subjected to target-based fusion. Then the fused attributes are fed into the "Attention-based Bi-LSTM". The final outcome is obtained from this unit. The detection efficiency based on median is 26.37% and accuracy is 97.367% obtained by tuning the parameters in the ERABi-LNet with the help of Modified Search and Rescue Operations (MCDMR-SRO). The obtained results are compared with ROA-ERABi-LNet, EOO-ERABi-LNet, GTBO-ERABi-LNet and SRO-ERABi-LNet respectively. The ERABi_LNet thus provides enhanced accuracy and other performance metrics compared to such deep learning models. The proposed method has the better sensitivity, specificity, F1-Score and False Positive Rate compared with all the above mentioned competing models with values such as 97.49%.97.84%,97.74% and 2.616 respective;y. This ensures that the model has better learning capabilities and provides lesser false positives with balanced prediction.


Assuntos
Doença de Alzheimer , Imageamento por Ressonância Magnética , Humanos , Doença de Alzheimer/diagnóstico por imagem , Doença de Alzheimer/patologia , Imageamento por Ressonância Magnética/métodos , Aprendizado Profundo , Memória de Curto Prazo/fisiologia , Encéfalo/diagnóstico por imagem , Encéfalo/patologia , Redes Neurais de Computação , Idoso
20.
Med Biol Eng Comput ; 61(11): 3067-3085, 2023 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-37624534

RESUMO

Medical image fusion aims to integrate complementary information from multimodal medical images and has been widely applied in the field of medicine, such as clinical diagnosis, pathology analysis, and healing examinations. For the fusion task, feature extraction is a crucial step. To obtain significant information embedded in medical images, many deep learning-based algorithms have been proposed recently and achieved good fusion results. However, most of them can hardly capture the independent and underlying features, which leads to unsatisfactory fusion results. To address these issues, a multibranch residual attention reconstruction network (MBRARN) is proposed for the medical image fusion task. The proposed network mainly consists of three parts: feature extraction, feature fusion, and feature reconstruction. Firstly, the input medical images are converted into three scales by image pyramid operation and then are input into three branches of the proposed network respectively. The purpose of this procedure is to capture the local detailed information and the global structural information. Then, convolutions with residual attention modules are designed, which can not only enhance the captured outstanding features, but also make the network converge fast and stably. Finally, feature fusion is performed with the designed fusion strategy. In this step, a new more effective fusion strategy is correspondently designed for MRI-SPECT based on the Euclidean norm, called feature distance ratio (FDR). The experimental results conducted on Harvard whole brain atlas dataset demonstrate that the proposed network can achieve better results in terms of both subjective and objective evaluation, compared with some state-of-the-art medical image fusion algorithms.


Assuntos
Algoritmos , Encéfalo , Encéfalo/diagnóstico por imagem , Tomografia Computadorizada de Emissão de Fóton Único , Processamento de Imagem Assistida por Computador
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa