RESUMO
The advancement of medical imaging has profoundly impacted our understanding of the human body and various diseases. It has led to the continuous refinement of related technologies over many years. Despite these advancements, several challenges persist in the development of medical imaging, including data shortages characterized by low contrast, high noise levels, and limited image resolution. The U-Net architecture has significantly evolved to address these challenges, becoming a staple in medical imaging due to its effective performance and numerous updated versions. However, the emergence of Transformer-based models marks a new era in deep learning for medical imaging. These models and their variants promise substantial progress, necessitating a comparative analysis to comprehend recent advancements. This review begins by exploring the fundamental U-Net architecture and its variants, then examines the limitations encountered during its evolution. It then introduces the Transformer-based self-attention mechanism and investigates how modern models incorporate positional information. The review emphasizes the revolutionary potential of Transformer-based techniques, discusses their limitations, and outlines potential avenues for future research.
Assuntos
Diagnóstico por Imagem , Humanos , Diagnóstico por Imagem/métodos , Diagnóstico por Imagem/tendências , Aprendizado Profundo , Processamento de Imagem Assistida por Computador/métodos , Redes Neurais de ComputaçãoRESUMO
Efficiency and comfort in buildings rely on on well-functioning HVAC systems. However, system faults can compromise performance. Modern data-driven fault detection methods, considering diverse techniques, encounter challenges in understanding intricate interactions and adapting to dynamic conditions present in HVAC systems during occupancy periods. Implementing fault detection during active operation, which aligns with real-world scenarios and captures dynamic interactions and environmental changes, is considered highly valuable. To address this, utilizing the dynamic simulation system HVAC SIMulation PLUS (HVACSIM+), an HVAC fault model was developed using 194 sensor signals from each HVAC component within a single-story, four-room building. The advanced HVAC fault detection framework, leveraging simulated HVAC operational scenarios with the Gramian angular field (GAF) and two-dimensional convolutional neural networks (GAF-2DCNNs), offers a robust and proactive solution. By utilizing the GAF capacity to convert time-series sensor data into informative 2D images, integrated with 2DCNN for automated feature extraction, hidden temporal relationships within 1D signals are captured. After training on nine significant HVAC faults and normal conditions during occupancy, the effectiveness of the proposed GAF-2DCNN is evaluated through comparisons with support vector machine (SVM), random forest (RF), and hybrid RF-SVM, one-dimensional convolutional neural networks (1D-CNNs). The results demonstrates an impressive overall accuracy of 97%, accompanied by precision, recall, and F1 scores that surpass 90% for individual HVAC faults. Through the introduction of the unified approach that integrates HVACSIM+ simulated data and GAF-2DCNN, a notable enhancement in robustness and reliability for handling substantial HVAC faults is achieved.
RESUMO
The brain-computer interface (BCI) has many applications in various fields. In EEG-based research, an essential step is signal denoising. In this paper, a generative adversarial network (GAN)-based denoising method is proposed to denoise the multichannel EEG signal automatically. A new loss function is defined to ensure that the filtered signal can retain as much effective original information and energy as possible. This model can imitate and integrate artificial denoising methods, which reduces processing time; hence it can be used for a large amount of data processing. Compared to other neural network denoising models, the proposed model has one more discriminator, which always judges whether the noise is filtered out. The generator is constantly changing the denoising way. To ensure the GAN model generates EEG signals stably, a new normalization method called sample entropy threshold and energy threshold-based (SETET) normalization is proposed to check the abnormal signals and limit the range of EEG signals. After the denoising system is established, although the denoising model uses the different subjects' data for training, it can still apply to the new subjects' data denoising. The experiments discussed in this paper employ the HaLT public dataset. Correlation and root mean square error (RMSE) are used as evaluation criteria. Results reveal that the proposed automatic GAN denoising network achieves the same performance as the manual hybrid artificial denoising method. Moreover, the GAN network makes the denoising process automatic, representing a significant reduction in time.
Assuntos
Interfaces Cérebro-Computador , Processamento de Sinais Assistido por Computador , Algoritmos , Eletroencefalografia , Humanos , Redes Neurais de ComputaçãoRESUMO
INTRODUCTION: Obstructive sleep apnea (OSA) can cause serious health problems such as hypertension or cardiovascular disease. The manual detection of apnea is a time-consuming task, and automatic diagnosis is much more desirable. The contribution of this work is to detect OSA using a multi-error-reduction (MER) classification system with multi-domain features from bio-signals. METHODS: Time-domain, frequency-domain, and non-linear analysis features are extracted from oxygen saturation (SaO2), ECG, airflow, thoracic, and abdominal signals. To analyse the significance of each feature, we design a two-stage feature selection. Stage 1 is the statistical analysis stage, and Stage 2 is the final feature subset selection stage using machine learning methods. In Stage 1, two statistical analyses (the one-way analysis of variance (ANOVA) and the rank-sum test) provide a list of the significance level of each kind of feature. Then, in Stage 2, the support vector machine (SVM) algorithm is used to select a final feature subset based on the significance list. Next, an MER classification system is constructed, which applies a stacking with a structure that consists of base learners and an artificial neural network (ANN) meta-learner. RESULTS: The Sleep Heart Health Study (SHHS) database is used to provide bio-signals. A total of 66 features are extracted. In the experiment that involves a duration parameter, 19 features are selected as the final feature subset because they provide a better and more stable performance. The SVM model shows good performance (accuracy = 81.68%, sensitivity = 97.05%, and specificity = 66.54%). It is also found that classifiers have poor performance when they predict normal events in less than 60 s. In the next experiment stage, the time-window segmentation method with a length of 60s is used. After the above two-stage feature selection procedure, 48 features are selected as the final feature subset that give good performance (accuracy = 90.80%, sensitivity = 93.95%, and specificity = 83.82%). To conduct the classification, Gradient Boosting, CatBoost, Light GBM, and XGBoost are used as base learners, and the ANN is used as the meta-learner. The performance of this MER classification system has the accuracy of 94.66%, the sensitivity of 96.37%, and the specificity of 90.83%.
Assuntos
Síndromes da Apneia do Sono/diagnóstico , Algoritmos , Técnicas Biossensoriais/métodos , Humanos , Aprendizado de Máquina , Polissonografia/métodos , Sensibilidade e Especificidade , Sono/fisiologia , Apneia Obstrutiva do Sono/diagnóstico , Máquina de Vetores de SuporteRESUMO
The malfunctioning of the heating, ventilating, and air conditioning (HVAC) system is considered to be one of the main challenges in modern buildings. Due to the complexity of the building management system (BMS) with operational data input from a large number of sensors used in HVAC system, the faults can be very difficult to detect in the early stage. While numerous fault detection and diagnosis (FDD) methods with the use of statistical modeling and machine learning have revealed prominent results in recent years, early detection remains a challenging task since many current approaches are unfeasible for diagnosing some HVAC faults and have accuracy performance issues. In view of this, this study presents a novel hybrid FDD approach by combining random forest (RF) and support vector machine (SVM) classifiers for the application of FDD for the HVAC system. Experimental results demonstrate that our proposed hybrid random forest-support vector machine (HRF-SVM) outperforms other methods with higher prediction accuracy (98%), despite that the fault symptoms were insignificant. Furthermore, the proposed framework can reduce the significant number of sensors required and work well with the small number of faulty training data samples available in real-world applications.
Assuntos
Ar Condicionado , Máquina de Vetores de Suporte , Calefação , Aprendizado de Máquina , Modelos EstatísticosRESUMO
People with sleep apnea (SA) are at increased risk of having stroke and cardiovascular diseases. Polysomnography (PSG) is used to detect SA. This paper conducts feature selection from PSG signals and uses a support vector machine (SVM) to detect SA. To analyze SA, the Physionet Apnea Database was used to obtain various features. Electrocardiography (ECG), oxygen saturation (SaO2), airflow, abdominal, and thoracic signals were used to provide various frequency-, time-domain and non-linear features (n = 87). To analyse the significance of these features, firstly, two evaluation measures, the rank-sum method and the analysis of variance (ANOVA) were used to evaluate the significance of the features. These features were then classified according to their significance. Finally, different class feature sets were presented as inputs for an SVM classifier to detect the onset of SA. The hill-climbing feature selection algorithm and the k-fold cross-validation method were applied to evaluate each classification performance. Through the experiments, we discovered that the best feature set (including the top-five significant features) obtained the best classification performance. Furthermore, we plotted receiver operating characteristic (ROC) curves to examine the performance of the SVM, and the results showed the SVM with Linear kernel (regularization parameter = 1) outperformed other classifiers (area under curve = 95.23%, sensitivity = 94.29%, specificity = 96.17%). The results confirm that feature subsets based on multiple bio-signals have the potential to identify patients with SA. The use of a smaller subset avoids dimensionality problems and reduces the computational load.
Assuntos
Síndromes da Apneia do Sono , Algoritmos , Eletrocardiografia , Humanos , Polissonografia , Síndromes da Apneia do Sono/diagnóstico , Máquina de Vetores de SuporteRESUMO
Computer-aided algorithm plays an important role in disease diagnosis through medical images. As one of the major cancers, lung cancer is commonly detected by computer tomography. To increase the survival rate of lung cancer patients, an early-stage diagnosis is necessary. In this paper, we propose a new structure, multi-level cross residual convolutional neural network (ML-xResNet), to classify the different types of lung nodule malignancies. ML-xResNet is constructed by three-level parallel ResNets with different convolution kernel sizes to extract multi-scale features of the inputs. Moreover, the residuals are connected not only with the current level but also with other levels in a crossover manner. To illustrate the performance of ML-xResNet, we apply the model to process ternary classification (benign, indeterminate, and malignant lung nodules) and binary classification (benign and malignant lung nodules) of lung nodules, respectively. Based on the experiment results, the proposed ML-xResNet achieves the best results of 85.88% accuracy for ternary classification and 92.19% accuracy for binary classification, without any additional handcrafted preprocessing algorithm.
Assuntos
Neoplasias Pulmonares , Nódulo Pulmonar Solitário , Algoritmos , Humanos , Pulmão , Neoplasias Pulmonares/diagnóstico por imagem , Redes Neurais de Computação , Nódulo Pulmonar Solitário/diagnóstico por imagem , Tomografia Computadorizada por Raios XRESUMO
To solve the real-time complex mission-planning problem for Multiple heterogeneous Unmanned Aerial Vehicles (UAVs) in the dynamic environments, this paper addresses a new approach by effectively adapting the Consensus-Based Bundle Algorithms (CBBA) under the constraints of task timing, limited UAV resources, diverse types of tasks, dynamic addition of tasks, and real-time requirements. We introduce the dynamic task generation mechanism, which satisfied the task timing constraints. The tasks that require the cooperation of multiple UAVs are simplified into multiple sub-tasks to perform by a single UAV independently. We also introduce the asynchronous task allocation mechanism. This mechanism reduces the computational complexity of the algorithm and the communication time between UAVs. The partial task redistribution mechanism has been adopted for achieving the dynamic task allocation. The real-time performance of the algorithm is assured on the premise of optimal results. The feasibility and real-time performance of the algorithm are validated by conducting dynamic simulation experiments.
RESUMO
This paper presents a smart "e-nose" device to monitor indoor hazardous air. Indoor hazardous odor is a threat for seniors, infants, children, pregnant women, disabled residents, and patients. To overcome the limitations of using existing non-intelligent, slow-responding, deficient gas sensors, we propose a novel artificial-intelligent-based multiple hazard gas detector (MHGD) system that is mounted on a motor vehicle-based robot which can be remotely controlled. First, we optimized the sensor array for the classification of three hazardous gases, including cigarette smoke, inflammable ethanol, and off-flavor from spoiled food, using an e-nose with a mixing chamber. The mixing chamber can prevent the impact of environmental changes. We compared the classification results of all combinations of sensors, and selected the one with the highest accuracy (98.88%) as the optimal sensor array for the MHGD. The optimal sensor array was then mounted on the MHGD to detect and classify the target gases without a mixing chamber but in a controlled environment. Finally, we tested the MHGD under these conditions, and achieved an acceptable accuracy (70.00%).
RESUMO
BACKGROUND: One of the key challenges of the biomedical cyber-physical system is to combine cognitive neuroscience with the integration of physical systems to assist people with disabilities. Electroencephalography (EEG) has been explored as a non-invasive method of providing assistive technology by using brain electrical signals. METHODS: This paper presents a unique prototype of a hybrid brain computer interface (BCI) which senses a combination classification of mental task, steady state visual evoked potential (SSVEP) and eyes closed detection using only two EEG channels. In addition, a microcontroller based head-mounted battery-operated wireless EEG sensor combined with a separate embedded system is used to enhance portability, convenience and cost effectiveness. This experiment has been conducted with five healthy participants and five patients with tetraplegia. RESULTS: Generally, the results show comparable classification accuracies between healthy subjects and tetraplegia patients. For the offline artificial neural network classification for the target group of patients with tetraplegia, the hybrid BCI system combines three mental tasks, three SSVEP frequencies and eyes closed, with average classification accuracy at 74% and average information transfer rate (ITR) of the system of 27 bits/min. For the real-time testing of the intentional signal on patients with tetraplegia, the average success rate of detection is 70% and the speed of detection varies from 2 to 4 s.
Assuntos
Pesquisa Biomédica/instrumentação , Interfaces Cérebro-Computador , Eletroencefalografia/instrumentação , Fenômenos Eletrofisiológicos , Tecnologia sem Fio , Adulto , Idoso , Idoso de 80 Anos ou mais , Algoritmos , Encéfalo/fisiologia , Encéfalo/fisiopatologia , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Traumatismos da Medula Espinal/fisiopatologiaRESUMO
In medical image analysis, anatomical landmarks usually contain strong prior knowledge of their structural information. In this paper, we propose to promote medical landmark localization by modeling the underlying landmark distribution via normalizing flows. Specifically, we introduce the flow-based landmark distribution prior as a learnable objective function into a regression-based landmark localization framework. Moreover, we employ an integral operation to make the mapping from heatmaps to coordinates differentiable to further enhance heatmap-based localization with the learned distribution prior. Our proposed Normalizing Flow-based Distribution Prior (NFDP) employs a straightforward backbone and non-problem-tailored architecture (i.e., ResNet18), which delivers high-fidelity outputs across three X-ray-based landmark localization datasets. Remarkably, the proposed NFDP can do the job with minimal additional computational burden as the normalizing flows module is detached from the framework on inferencing. As compared to existing techniques, our proposed NFDP provides a superior balance between prediction accuracy and inference speed, making it a highly efficient and effective approach. The source code of this paper is available at https://github.com/jacksonhzx95/NFDP.
Assuntos
Algoritmos , Pontos de Referência Anatômicos , Processamento de Imagem Assistida por Computador , Humanos , Processamento de Imagem Assistida por Computador/métodos , Pontos de Referência Anatômicos/diagnóstico por imagem , Tomografia Computadorizada por Raios X/métodosRESUMO
OBJECTIVE: Scoliosis is a spinal deformation in which the spine takes a lateral curvature, generating an angle in the coronal plane. The conventional method for detecting scoliosis is measurement of the Cobb angle in spine images obtained by anterior X-ray scanning. Ultrasound imaging of the spine is found to be less ionising than traditional radiographic modalities. For posterior ultrasound scanning, alternate indices of the spinous process angle (SPA) and ultrasound curve angle (UCA) were developed and have proven comparable to those of the traditional Cobb angle. In SPA, the measurements are made using the spinous processes as an anatomical reference, leading to an underestimation of the traditionally used Cobb angles. Alternatively, in UCA, more lateral features of the spine are employed for measurement of the main thoracic and thoracolumbar angles; however, clear identification of bony features is required. The current practice of UCA angle measurement is manual. This research attempts to automate the process so that the errors related to human intervention can be avoided and the scalability of ultrasound scoliosis diagnosis can be improved. The key objective is to develop an automatic scoliosis diagnosis system using 3-D ultrasound imaging. METHODS: The novel diagnosis system is a three-step process: (i) finding the ultrasound spine image with the most visible lateral features using the convolutional RankNet algorithm; (ii) segmenting the bony features from the noisy ultrasound images using joint spine segmentation and noise removal; and (iii) calculating the UCA automatically using a newly developed centroid pairing and inscribed rectangle slope method. RESULTS: The proposed method was evaluated on 109 patients with scoliosis of different severity. The results obtained had a good correlation with manually measured UCAs (R2=0.9784 for the main thoracic angle andR2=0.9671 for the thoracolumbar angle) and a clinically acceptable mean absolute difference of the main thoracic angle (2.82 ± 2.67°) and thoracolumbar angle (3.34 ± 2.83°). CONCLUSION: The proposed method establishes a very promising approach for enabling the applications of economic 3-D ultrasound volume projection imaging for mass screening of scoliosis.
Assuntos
Escoliose , Humanos , Escoliose/diagnóstico por imagem , Coluna Vertebral/diagnóstico por imagem , Ultrassonografia/métodos , Radiografia , Imageamento TridimensionalRESUMO
The Brain Computer Interface (BCI) is the communication between the human brain and the computer. Electroencephalogram (EEG) is one of the biomedical signals which can be obtained by attaching electrodes to the scalp. Some EEG related applications can be developed to help disabled people, such as EEG based wheelchair or robotic arm. A hybrid BCI real-time control system is proposed to control a multi-tasks BCI robot. In this system, a sliding window based online data segmentation strategy is proposed to segment training data, which enable the system to learn the dynamic features when the subject's brain state transfer from a rest state to a task execution state. The features help the system achieve real-time control and ensure the continuity of executing actions. In addition, Common Spatial Pattern (CSP) can better extract the spatial features of these continuous actions from the dynamic data to ensure that multiple control commands are accurately classified. In the experiment, three subjects' EEG data is collected, trained and tested the performance and reliability of the proposed control system. The system records the robot's spending time, moving distance, and the number of objects pushing down. Experimental results are given to show the feasibility of the real-time control system. Compared to real-time remote controller, the proposed system can achieve similar performance. Thus, the proposed hybrid BCI real-time control system is able to control the robot in the real-time environment and can be used to develop robot-aided arm training methods based on neurological rehabilitation principles for stroke and brain injury patients.
Assuntos
Interfaces Cérebro-Computador , Procedimentos Cirúrgicos Robóticos , Robótica , Humanos , Reprodutibilidade dos Testes , Eletroencefalografia/métodosRESUMO
The growth of population and the various intensive life pressures everyday deepen the competitions among people. Tens of millions of people each year suffer from depression and only a fraction receives adequate treatment. The development of social networks such as Facebook, Twitter, Weibo, and QQ provides more convenient communication and provides a new emotional vent window. People communicate with their friends, sharing their opinions, and shooting videos to reflect their feelings. It provides an opportunity to detect depression in social networks. Although depression detection using social networks has reflected the established connectivity across users, fewer researchers consider the data security and privacy-preserving schemes. Therefore, we advocate the federated learning technique as an efficient and scalable method, where it enables the handling of a massive number of edge devices in parallel. In this study, we conduct the depression analysis on the basis of an online microblog called Weibo. A novel algorithm termed as CNN Asynchronous Federated optimization (CAFed) is proposed based on federated learning to improve the communication cost and convergence rate. It is shown that our proposed method can effectively protect users' privacy under the premise of ensuring the accuracy of prediction. The proposed method converges faster than the Federated Averaging (FedAvg) for non-convex problems. Federated learning techniques can identify quality solutions of mental health problems among Weibo users.
RESUMO
In recent years, cardiovascular diseases (CVDs) have become one of the leading causes of mortality globally. At early stages, CVDs appear with minor symptoms and progressively get worse. The majority of people experience symptoms such as exhaustion, shortness of breath, ankle swelling, fluid retention, and other symptoms when starting CVD. Coronary artery disease (CAD), arrhythmia, cardiomyopathy, congenital heart defect (CHD), mitral regurgitation, and angina are the most common CVDs. Clinical methods such as blood tests, electrocardiography (ECG) signals, and medical imaging are the most effective methods used for the detection of CVDs. Among the diagnostic methods, cardiac magnetic resonance imaging (CMRI) is increasingly used to diagnose, monitor the disease, plan treatment and predict CVDs. Coupled with all the advantages of CMR data, CVDs diagnosis is challenging for physicians as each scan has many slices of data, and the contrast of it might be low. To address these issues, deep learning (DL) techniques have been employed in the diagnosis of CVDs using CMR data, and much research is currently being conducted in this field. This review provides an overview of the studies performed in CVDs detection using CMR images and DL techniques. The introduction section examined CVDs types, diagnostic methods, and the most important medical imaging techniques. The following presents research to detect CVDs using CMR images and the most significant DL methods. Another section discussed the challenges in diagnosing CVDs from CMRI data. Next, the discussion section discusses the results of this review, and future work in CVDs diagnosis from CMR images and DL techniques are outlined. Finally, the most important findings of this study are presented in the conclusion section.
Assuntos
Doenças Cardiovasculares , Doença da Artéria Coronariana , Aprendizado Profundo , Humanos , Doenças Cardiovasculares/diagnóstico por imagem , Imageamento por Ressonância Magnética , Coração , Doença da Artéria Coronariana/diagnósticoRESUMO
Parkinson's disease (PD) is the most common form of Parkinsonism, which is a group of neurological disorders with PD-like motor impairments. The disease affects over 6 million people worldwide and is characterized by motor and non-motor symptoms. The affected person has trouble in controlling movements, which may affect simple daily-life tasks, such as typing on a computer. We propose the application of a modified SqueezeNet convolutional neural network (CNN) for detecting PD based on the subject's key-typing patterns. First, the data are pre-processed using data standardization and the Synthetic Minority Oversampling Technique (SMOTE), and then a Continuous Wavelet Transformation is applied to generate spectrograms used for training and testing a modified SqueezeNet model. The modified SqueezeNet model achieved an accuracy of 90%, representing a noticeable improvement in comparison to other approaches.
RESUMO
Volume Projection Imaging from ultrasound data is a promising technique to visualize spine features and diagnose Adolescent Idiopathic Scoliosis. In this paper, we present a novel multi-task framework to reduce the scan noise in volume projection images and to segment different spine features simultaneously, which provides an appealing alternative for intelligent scoliosis assessment in clinical applications. Our proposed framework consists of two streams: i) A noise removal stream based on generative adversarial networks, which aims to achieve effective scan noise removal in a weakly-supervised manner, i.e., without paired noisy-clean samples for learning; ii) A spine segmentation stream, which aims to predict accurate bone masks. To establish the interaction between these two tasks, we propose a selective feature-sharing strategy to transfer only the beneficial features, while filtering out the useless or harmful information. We evaluate our proposed framework on both scan noise removal and spine segmentation tasks. The experimental results demonstrate that our proposed method achieves promising performance on both tasks, which provides an appealing approach to facilitating clinical diagnosis.
Assuntos
Escoliose , Adolescente , Algoritmos , Humanos , Processamento de Imagem Assistida por Computador/métodos , Escoliose/diagnóstico por imagem , Coluna Vertebral/diagnóstico por imagem , UltrassonografiaRESUMO
Autism spectrum disorder (ASD) is a brain condition characterized by diverse signs and symptoms that appear in early childhood. ASD is also associated with communication deficits and repetitive behavior in affected individuals. Various ASD detection methods have been developed, including neuroimaging modalities and psychological tests. Among these methods, magnetic resonance imaging (MRI) imaging modalities are of paramount importance to physicians. Clinicians rely on MRI modalities to diagnose ASD accurately. The MRI modalities are non-invasive methods that include functional (fMRI) and structural (sMRI) neuroimaging methods. However, diagnosing ASD with fMRI and sMRI for specialists is often laborious and time-consuming; therefore, several computer-aided design systems (CADS) based on artificial intelligence (AI) have been developed to assist specialist physicians. Conventional machine learning (ML) and deep learning (DL) are the most popular schemes of AI used for diagnosing ASD. This study aims to review the automated detection of ASD using AI. We review several CADS that have been developed using ML techniques for the automated diagnosis of ASD using MRI modalities. There has been very limited work on the use of DL techniques to develop automated diagnostic models for ASD. A summary of the studies developed using DL is provided in the Supplementary Appendix. Then, the challenges encountered during the automated diagnosis of ASD using MRI and AI techniques are described in detail. Additionally, a graphical comparison of studies using ML and DL to diagnose ASD automatically is discussed. We suggest future approaches to detecting ASDs using AI techniques and MRI neuroimaging.
RESUMO
Neuron image segmentation has wide applications and important potential values for neuroscience research. Due to the complexity of the submicroscopic structure of neurons cells and the defects of the image quality such as anisotropy, boundary loss and blurriness in electron microscopy-based (EM) imaging, and one faces a challenge in efficient automated segmenting large-scale neuron image 3D datasets, which is an essential prerequisite front-end process for the reconstruction of neuron circuits. Here, a neuron image segmentation method by combining Chan-Vest (CV) model with Deep Boltzmann Machine (DBM) is proposed, and a generative model is used to model and generate the target shape, it take this as a prior information to add global target shape feature constraint to the energy function of CV model, and the shape priori information is fused to assist neuron image segmentation. We applied our method to two 3D-EM datasets from different types of nerve tissue and achieved the best performance consistently across two classical evaluation metrics of neuron segmentation accuracy, namely Variation of Information (VoI) and Adaptive Rand Index (ARI). Experimental results show that the fusion algorithm has high segmentation accuracy, strong robustness, and can characterize the sub-microstructure information of neuron images well.
Assuntos
Algoritmos , Processamento de Imagem Assistida por Computador , Microscopia Eletrônica , NeurôniosRESUMO
3D ultrasound imaging has become one of the common diagnosis ways to assess scoliosis since it is radiation-free, real-time, and low-cost. Spine curvature angle measurement is an important step to assess scoliosis precisely. One way to calculate the angle is using the vertebrae features of the 2-D coronal images to identify the most tilted vertebrae. To do the measurement, the segmentation of the transverse vertebrae is an important step. In this paper, we propose a dual-task ultrasound transverse vertebrae segmentation network (D-TVNet) based on U-Net. First, we arrange an auxiliary shape regularization network to learn the contour segmentation of the bones. It improves the boundary segmentation and anti-interference ability of the U-Net by fusing some of the features of the auxiliary task and the main task. Then, we introduce the atrous spatial pyramid pooling (ASPP) module to the end of the down-sampling stage of the main task stream to improve the relative feature extraction ability. To further improve the boundary segmentation, we extendedly fuse the down-sampling output features of the auxiliary network in the ASPP. The experiment results show that the proposed D-TVNet achieves the best dice score of 86.68% and the mean dice score of 86.17% based on cross-validation, which is an improvement of 5.17% over the baseline U-Net. An automatic ultrasound spine bone segmentation network with promising results has been achieved.