Pesquisa | Portal Regional da BVS

1.

Simultaneously mapping the 3D distributions of multiple heavy metals in an industrial site using deep learning and multisource auxiliary data.

Peng, Yuxuan; Zhao, Yongcun; Chen, Jian; Xie, Enze; Yan, Guojing; Zou, Tingrun; Xu, Xianghua.

J Hazard Mater ; 480: 136000, 2024 Sep 30.

Artigo em Inglês | MEDLINE | ID: mdl-39357360

RESUMO

Three-dimensional (3D) distributions of multiple soil pollutants in industrial site are crucial for risk assessment and remediation. Yet, their 3D prediction accuracies are often low because of the strong variability of pollutants and availability of 3D covariate data. This study proposed a patch-based multi-task convolution neural network (MT-CNN) model for simultaneously predicting the 3D distributions of Zn, Pb, Ni, and Cu at an industrial site. By integrating neighborhood patches from multisource covariates, the MT-CNN model captured both horizontal and vertical pollution information, and outperformed the widely-used methods such as random forest (RF), ordinary Kriging (OK), and inverse distance weighting (IDW) for all the 4 heavy metals, with R2 values of 0.58, 0.56, 0.29 and 0.23 for Zn, Pb, Ni and Cu, respectively. Besides, the MT-CNN model achieved more stable predictions with reasonable accuracy, in comparison with the single-task CNN model. These results highlighted the potential of the proposed MT-CNN in simultaneously mapping the 3D distributions of multiple pollutants, while balancing the model training, maintaining and accuracy for low-cost rapid assessment of soil pollution at industrial sites.

2.

M3T-LM: A multi-modal multi-task learning model for jointly predicting patient length of stay and mortality.

Chen, Junde; Li, Qing; Liu, Feng; Wen, Yuxin.

Comput Biol Med ; 183: 109237, 2024 Oct 07.

Artigo em Inglês | MEDLINE | ID: mdl-39378581

RESUMO

Ensuring accurate predictions of inpatient length of stay (LoS) and mortality rates is essential for enhancing hospital service efficiency, particularly in light of the constraints posed by limited healthcare resources. Integrative analysis of heterogeneous clinic record data from different sources can hold great promise for improving the prognosis and diagnosis level of LoS and mortality. Currently, most existing studies solely focus on single data modality or tend to single-task learning, i.e., training LoS and mortality tasks separately. This limits the utilization of available multi-modal data and prevents the sharing of feature representations that could capture correlations between different tasks, ultimately hindering the model's performance. To address the challenge, this study proposes a novel Multi-Modal Multi-Task learning model, termed as M3T-LM, to integrate clinic records to predict inpatients' LoS and mortality simultaneously. The M3T-LM framework incorporates multiple data modalities by constructing sub-models tailored to each modality. Specifically, a novel attention-embedded one-dimensional (1D) convolutional neural network (CNN) is designed to handle numerical data. For clinical notes, they are converted into sequence data, and then two long short-term memory (LSTM) networks are exploited to model on textual sequence data. A two-dimensional (2D) CNN architecture, noted as CRXMDL, is designed to extract high-level features from chest X-ray (CXR) images. Subsequently, multiple sub-models are integrated to formulate the M3T-LM to capture the correlations between patient LoS and modality prediction tasks. The efficiency of the proposed method is validated on the MIMIC-IV dataset. The proposed method attained a test MAE of 5.54 for LoS prediction and a test F1 of 0.876 for mortality prediction. The experimental results demonstrate that our approach outperforms state-of-the-art (SOTA) methods in tackling mixed regression and classification tasks.

3.

Simultaneously predicting SPAD and water content in rice leaves using hyperspectral imaging with deep multi-task regression and transfer component analysis.

Zhai, Yuanning; Wang, Jun; Zhou, Lei; Zhang, Xincheng; Ren, Yun; Qi, Hengnian; Zhang, Chu.

J Sci Food Agric ; 2024 Sep 02.

Artigo em Inglês | MEDLINE | ID: mdl-39221962

RESUMO

BACKGROUND: Water content and chlorophyll content are important indicators for monitoring rice growth status. Simultaneous detection of water content and chlorophyll content is of significance. Different varieties of rice show differences in phenotype, resulting in the difficulties of establishing a universal model. In this study, hyperspectral imaging was used to detect the Soil and Plant Analyzer Development (SPAD) values and water content of fresh rice leaves of three rice varieties (Jiahua 1, Xiushui 121 and Xiushui 134). RESULTS: Both partial least squares regression and convolutional neural networks were used to establish single-task and multi-task models. Transfer component analysis (TCA) was used as transfer learning to learn the common features to achieve an approximate identical distribution between any two varieties. Single-task and multi-task models were also built using the features of the source domain, and these models were applied to the target domain. These results indicated that for models of each rice variety the prediction accuracy of most multi-task models was close to that of single-task models. As for TCA, the results showed that the single-task model achieved good performance for all transfer learning tasks. CONCLUSION: Compared with the original model, good and differentiated results were obtained for the models using features learned by TCA for both the source domain and target domain. The multi-task models could be constructed to predict SPAD values and water content simultaneously and then transferred to another rice variety, which could improve the efficiency of model construction and realize rapid detection of rice growth indicators. © 2024 Society of Chemical Industry.

4.

Automated analysis of fetal heart rate baseline/acceleration/deceleration using MTU-Net3 + model.

Wang, Minghan; Li, Guangfei; Yang, Yimin; Yang, Yongxiu; Feng, Yongkang; Li, Yashuang; Liu, Guoli; Hao, Dongmei.

Biomed Eng Lett ; 14(5): 1037-1048, 2024 Sep.

Artigo em Inglês | MEDLINE | ID: mdl-39220035

RESUMO

In clinical practice, obstetricians use visual interpretation of fetal heart rate (FHR) to diagnose fetal conditions, but inconsistencies among interpretations can hinder accuracy. This study introduces MTU-Net3+, a deep learning model designed for automated, multi-task FHR analysis, aiming to improve diagnostic accuracy and efficiency. The proposed MTU-Net3 + was built upon the UNet3 + architecture, incorporating an encoder, a decoder, full-scale skip connections, and a deep supervision module, and further integrates a self-attention mechanism and bidirectional Long Short-Term Memory layers to enhance its performance. The MTU-Net3 + model accepts the preprocessed 20-minute FHR signals as input, outputting categorical probabilities and baseline values for each time point. The proposed MTU-Net3 + model was trained on a subset of a public database, and was tested on the remaining data of the public database and a private database. In the remaining public datasets, this model achieved F1 scores of 84.21% for deceleration (F1.Dec) and 61.33% for acceleration (F1.Acc), with a Root Mean Square Baseline Difference (RMSD.BL) of 3.46 bpm, 0% of points with an absolute difference exceeding 15 bpm(D15bpm), a Synthetic Inconsistency Coefficient (SI) of 44.82%, and a Morphological Analysis Discordance Index (MADI) of 7.00%. On the private dataset, the model recorded an RMSD.BL of 1.37 bpm, 0% D15bpm, F1.Dec of 100%, F1.Acc of 87.50%, an SI of 12.20% and a MADI of 2.79%. The MTU-Net3 + model proposed in this study performed well in automated FHR analysis, demonstrating its potential as an effective tool in the field of fetal health assessment.

5.

A multi-view multi-label fast model for Auricularia cornea phenotype identification and classification.

Xu, Yinghang; Qu, Shizheng; Liu, Huan; Zhang, Lina; Liu, Yunfei; Wang, Lu; Li, Zhuoshi.

Sci Rep ; 14(1): 21136, 2024 09 10.

Artigo em Inglês | MEDLINE | ID: mdl-39256414

RESUMO

The identification and classification of various phenotypic features of Auricularia cornea fruit bodies are crucial for quality grading and breeding efforts. The phenotypic features of Auricularia cornea fruit bodies encompass size, number, shape, color, pigmentation, and damage. These phenotypic features are distributed across various views of the fruit bodies, making the task of achieving both rapid and accurate identification and classification challenging. This paper proposes a novel multi-view multi-label fast network that integrates two different views of the Auricularia cornea fruiting body, enabling rapid and precise identification and classification of six phenotypic features simultaneously. Initially, a multi-view feature extraction model based on partial convolution was constructed. This model incorporates channel attention mechanisms to achieve rapid phenotypic feature extraction of the Auricularia cornea fruiting body. Subsequently, an efficient multi-task classifier was designed, based on class-specific residual attention, to ensure accurate classification of phenotypic features. Finally, task weights were dynamically adjusted based on heteroscedastic uncertainty, reducing the training complexity of the multi-task classification. The proposed network achieved a classification accuracy of 94.66% and an inference speed of 11.9 ms on an image dataset of dried Auricularia cornea fruiting bodies with three views and six labels. The results demonstrate that the proposed network can efficiently and accurately identify and classify all phenotypic features of Auricularia cornea.

Assuntos

Fenótipo , Basidiomycota/classificação , Basidiomycota/fisiologia , Carpóforos , Processamento de Imagem Assistida por Computador/métodos , Algoritmos , Redes Neurais de Computação

6.

Detailed delineation of the fetal brain in diffusion MRI via multi-task learning.

Karimi, Davood; Calixto, Camilo; Snoussi, Haykel; Cortes-Albornoz, Maria Camila; Velasco-Annis, Clemente; Rollins, Caitlin; Jaimes, Camilo; Gholipour, Ali; Warfield, Simon K.

bioRxiv ; 2024 Aug 30.

Artigo em Inglês | MEDLINE | ID: mdl-39257731

RESUMO

Diffusion-weighted MRI is increasingly used to study the normal and abnormal development of fetal brain inutero. Recent studies have shown that dMRI can offer invaluable insights into the neurodevelopmental processes in the fetal stage. However, because of the low data quality and rapid brain development, reliable analysis of fetal dMRI data requires dedicated computational methods that are currently unavailable. The lack of automated methods for fast, accurate, and reproducible data analysis has seriously limited our ability to tap the potential of fetal brain dMRI for medical and scientific applications. In this work, we developed and validated a unified computational framework to (1) segment the brain tissue into white matter, cortical/subcortical gray matter, and cerebrospinal fluid, (2) segment 31 distinct white matter tracts, and (3) parcellate the brain's cortex and delineate the deep gray nuclei and white matter structures into 96 anatomically meaningful regions. We utilized a set of manual, semi-automatic, and automatic approaches to annotate 97 fetal brains. Using these labels, we developed and validated a multi-task deep learning method to perform the three computations. Our evaluations show that the new method can accurately carry out all three tasks, achieving a mean Dice similarity coefficient of 0.865 on tissue segmentation, 0.825 on white matter tract segmentation, and 0.819 on parcellation. The proposed method can greatly advance the field of fetal neuroimaging as it can lead to substantial improvements in fetal brain tractography, tract-specific analysis, and structural connectivity assessment.

7.

A Novel Deep Learning Model for Breast Tumor Ultrasound Image Classification with Lesion Region Perception.

Wei, Jinzhu; Zhang, Haoyang; Xie, Jiang.

Curr Oncol ; 31(9): 5057-5079, 2024 Aug 28.

Artigo em Inglês | MEDLINE | ID: mdl-39330002

RESUMO

Multi-task learning (MTL) methods are widely applied in breast imaging for lesion area perception and classification to assist in breast cancer diagnosis and personalized treatment. A typical paradigm of MTL is the shared-backbone network architecture, which can lead to information sharing conflicts and result in the decline or even failure of the main task's performance. Therefore, extracting richer lesion features and alleviating information-sharing conflicts has become a significant challenge for breast cancer classification. This study proposes a novel Multi-Feature Fusion Multi-Task (MFFMT) model to effectively address this issue. Firstly, in order to better capture the local and global feature relationships of lesion areas, a Contextual Lesion Enhancement Perception (CLEP) module is designed, which integrates channel attention mechanisms with detailed spatial positional information to extract more comprehensive lesion feature information. Secondly, a novel Multi-Feature Fusion (MFF) module is presented. The MFF module effectively extracts differential features that distinguish between lesion-specific characteristics and the semantic features used for tumor classification, and enhances the common feature information of them as well. Experimental results on two public breast ultrasound imaging datasets validate the effectiveness of our proposed method. Additionally, a comprehensive study on the impact of various factors on the model's performance is conducted to gain a deeper understanding of the working mechanism of the proposed framework.

Assuntos

Neoplasias da Mama , Aprendizado Profundo , Humanos , Neoplasias da Mama/diagnóstico por imagem , Feminino , Ultrassonografia Mamária/métodos , Interpretação de Imagem Assistida por Computador/métodos

8.

DetSegDiff: A joint periodontal landmark detection and segmentation in intraoral ultrasound using edge-enhanced diffusion-based network.

Kumaralingam, Logiraj; Dinh, Hoang B V; Nguyen, Kim-Cuong T; Punithakumar, Kumaradevan; La, Thanh-Giang; Lou, Edmond H M; Major, Paul W; Le, Lawrence H.

Comput Biol Med ; 182: 109174, 2024 Sep 24.

Artigo em Inglês | MEDLINE | ID: mdl-39321583

RESUMO

Individuals with malocclusion require an orthodontic diagnosis and treatment plan based on the severity of their condition. Assessing and monitoring changes in periodontal structures before, during, and after orthodontic procedures is crucial, and intraoral ultrasound (US) imaging has been shown a promising diagnostic tool in imaging periodontium. However, accurately delineating and analyzing periodontal structures in US videos is a challenging task for clinicians, as it is time-consuming and subject to interpretation errors. This paper introduces DetSegDiff, an edge-enhanced diffusion-based network developed to simultaneously detect the cementoenamel junction (CEJ) and segment alveolar bone structure in intraoral US videos. An edge feature encoder is designed to enhance edge and texture information for precise delineation of periodontal structures. Additionally, we employed the spatial squeeze-attention module (SSAM) to extract more representative features to perform both detection and segmentation tasks at global and local levels. This study used 169 videos from 17 orthodontic patients for training purposes and was subsequently tested on 41 videos from 4 additional patients. The proposed method achieved a mean distance difference of 0.17 ± 0.19 mm for the CEJ and an average Dice score of 90.1% for alveolar bone structure. As there is a lack of multi-task benchmark networks, thorough experiments were undertaken to assess and benchmark the proposed method against state-of-the-art (SOTA) detection and segmentation individual networks. The experimental results demonstrated that DetSegDiff outperformed SOTA approaches, confirming the feasibility of using automated diagnostic systems for orthodontists.

9.

Detailed delineation of the fetal brain in diffusion MRI via multi-task learning.

Karimi, Davood; Calixto, Camilo; Snoussi, Haykel; Cortes-Albornoz, Maria Camila; Velasco-Annis, Clemente; Rollins, Caitlin; Jaimes, Camilo; Gholipour, Ali; Warfield, Simon K.

ArXiv ; 2024 Sep 12.

Artigo em Inglês | MEDLINE | ID: mdl-39314513

RESUMO

Diffusion-weighted MRI is increasingly used to study the normal and abnormal development of fetal brain inutero. Recent studies have shown that dMRI can offer invaluable insights into the neurodevelopmental processes in the fetal stage. However, because of the low data quality and rapid brain development, reliable analysis of fetal dMRI data requires dedicated computational methods that are currently unavailable. The lack of automated methods for fast, accurate, and reproducible data analysis has seriously limited our ability to tap the potential of fetal brain dMRI for medical and scientific applications. In this work, we developed and validated a unified computational framework to (1) segment the brain tissue into white matter, cortical/subcortical gray matter, and cerebrospinal fluid, (2) segment 31 distinct white matter tracts, and (3) parcellate the brain's cortex and delineate the deep gray nuclei and white matter structures into 96 anatomically meaningful regions. We utilized a set of manual, semi-automatic, and automatic approaches to annotate 97 fetal brains. Using these labels, we developed and validated a multi-task deep learning method to perform the three computations. Our evaluations show that the new method can accurately carry out all three tasks, achieving a mean Dice similarity coefficient of 0.865 on tissue segmentation, 0.825 on white matter tract segmentation, and 0.819 on parcellation. The proposed method can greatly advance the field of fetal neuroimaging as it can lead to substantial improvements in fetal brain tractography, tract-specific analysis, and structural connectivity assessment.

10.

AnyFace++: Deep Multi-Task, Multi-Domain Learning for Efficient Face AI.

Rakhimzhanova, Tomiris; Kuzdeuov, Askat; Varol, Huseyin Atakan.

Sensors (Basel) ; 24(18)2024 Sep 15.

Artigo em Inglês | MEDLINE | ID: mdl-39338738

RESUMO

Accurate face detection and subsequent localization of facial landmarks are mandatory steps in many computer vision applications, such as emotion recognition, age estimation, and gender identification. Thanks to advancements in deep learning, numerous facial applications have been developed for human faces. However, most have to employ multiple models to accomplish several tasks simultaneously. As a result, they require more memory usage and increased inference time. Also, less attention is paid to other domains, such as animals and cartoon characters. To address these challenges, we propose an input-agnostic face model, AnyFace++, to perform multiple face-related tasks concurrently. The tasks are face detection and prediction of facial landmarks for human, animal, and cartoon faces, including age estimation, gender classification, and emotion recognition for human faces. We trained the model using deep multi-task, multi-domain learning with a heterogeneous cost function. The experimental results demonstrate that AnyFace++ generates outcomes comparable to cutting-edge models designed for specific domains.

Assuntos

Aprendizado Profundo , Face , Humanos , Face/fisiologia , Face/anatomia & histologia , Emoções/fisiologia , Feminino , Algoritmos , Masculino

11.

Identification of genetic basis of brain imaging by group sparse multi-task learning leveraging summary statistics.

Xi, Duo; Cui, Dingnan; Zhang, Mingjianan; Zhang, Jin; Shang, Muheng; Guo, Lei; Han, Junwei; Du, Lei.

Comput Struct Biotechnol J ; 23: 3288-3299, 2024 Dec.

Artigo em Inglês | MEDLINE | ID: mdl-39296810

RESUMO

Brain imaging genetics is an evolving neuroscience topic aiming to identify genetic variations related to neuroimaging measurements of interest. Traditional linear regression methods have shown success, but their reliance on individual-level imaging and genetic data limits their applicability. Herein, we proposed S-GsMTLR, a group sparse multi-task linear regression method designed to harness summary statistics from genome-wide association studies (GWAS) of neuroimaging quantitative traits. S-GsMTLR directly employs GWAS summary statistics, bypassing the requirement for raw imaging genetic data, and applies multivariate multi-task sparse learning to these univariate GWAS results. It amalgamates the strengths of conventional sparse learning methods, including sophisticated modeling techniques and efficient feature selection. Additionally, we implemented a rapid optimization strategy to alleviate computational burdens by identifying genetic variants associated with phenotypes of interest across the entire chromosome. We first evaluated S-GsMTLR using summary statistics derived from the Alzheimer's Disease Neuroimaging Initiative. The results were remarkably encouraging, demonstrating its comparability to conventional methods in modeling and identification of risk loci. Furthermore, our method was evaluated with two additional GWAS summary statistics datasets: One focused on white matter microstructures and the other on whole brain imaging phenotypes, where the original individual-level data was unavailable. The results not only highlighted S-GsMTLR's ability to pinpoint significant loci but also revealed intriguing structures within genetic variations and loci that went unnoticed by GWAS. These findings suggest that S-GsMTLR is a promising multivariate sparse learning method in brain imaging genetics. It eliminates the need for original individual-level imaging and genetic data while demonstrating commendable modeling and feature selection capabilities.

12.

A Hierarchical Multi-Task Learning Framework for Semantic Annotation in Tabular Data.

Wu, Jie; Hou, Mengshu.

Entropy (Basel) ; 26(8)2024 Aug 04.

Artigo em Inglês | MEDLINE | ID: mdl-39202134

RESUMO

To optimize the utilization and analysis of tables, it is essential to recognize and understand their semantics comprehensively. This requirement is especially critical given that many tables lack explicit annotations, necessitating the identification of column types and inter-column relationships. Such identification can significantly augment data quality, streamline data integration, and support data analysis and mining. Current table annotation models often address each subtask independently, which may result in the neglect of constraints and contextual information, causing relational ambiguities and inference errors. To address this issue, we propose a unified multi-task learning framework capable of concurrently handling multiple tasks within a single model, including column named entity recognition, column type identification, and inter-column relationship detection. By integrating these tasks, the framework exploits their interrelations, facilitating the exchange of shallow features and the sharing of representations. Their cooperation enables each task to leverage insights from the others, thereby improving the performance of individual subtasks and enhancing the model's overall generalization capabilities. Notably, our model is designed to employ only the internal information of tabular data, avoiding reliance on external context or knowledge graphs. This design ensures robust performance even with limited input information. Extensive experiments demonstrate the superior performance of our model across various tasks, validating the effectiveness of unified multi-task learning framework in the recognition and comprehension of table semantics.

13.

Multi-task heterogeneous graph learning on electronic health records.

Chan, Tsai Hor; Yin, Guosheng; Bae, Kyongtae; Yu, Lequan.

Neural Netw ; 180: 106644, 2024 Aug 22.

Artigo em Inglês | MEDLINE | ID: mdl-39180906

RESUMO

Learning electronic health records (EHRs) has received emerging attention because of its capability to facilitate accurate medical diagnosis. Since the EHRs contain enriched information specifying complex interactions between entities, modeling EHRs with graphs is shown to be effective in practice. The EHRs, however, present a great degree of heterogeneity, sparsity, and complexity, which hamper the performance of most of the models applied to them. Moreover, existing approaches modeling EHRs often focus on learning the representations for a single task, overlooking the multi-task nature of EHR analysis problems and resulting in limited generalizability across different tasks. In view of these limitations, we propose a novel framework for EHR modeling, namely MulT-EHR (Multi-Task EHR), which leverages a heterogeneous graph to mine the complex relations and model the heterogeneity in the EHRs. To mitigate the large degree of noise, we introduce a denoising module based on the causal inference framework to adjust for severe confounding effects and reduce noise in the EHR data. Additionally, since our model adopts a single graph neural network for simultaneous multi-task prediction, we design a multi-task learning module to leverage the inter-task knowledge to regularize the training process. Extensive empirical studies on MIMIC-III and MIMIC-IV datasets validate that the proposed method consistently outperforms the state-of-the-art designs in four popular EHR analysis tasks - drug recommendation, and predictions of the length of stay, mortality, and readmission. Thorough ablation studies demonstrate the robustness of our method upon variations to key components and hyperparameters.

14.

Phase-Based Gait Prediction after Botulinum Toxin Treatment Using Deep Learning.

Khan, Adil; Galarraga, Omar; Garcia-Salicetti, Sonia; Vigneron, Vincent.

Sensors (Basel) ; 24(16)2024 Aug 18.

Artigo em Inglês | MEDLINE | ID: mdl-39205037

RESUMO

Gait disorders in neurological diseases are frequently associated with spasticity. Intramuscular injection of Botulinum Toxin Type A (BTX-A) can be used to treat spasticity. Providing optimal treatment with the highest possible benefit-risk ratio is a crucial consideration. This paper presents a novel approach for predicting knee and ankle kinematics after BTX-A treatment based on pre-treatment kinematics and treatment information. The proposed method is based on a Bidirectional Long Short-Term Memory (Bi-LSTM) deep learning architecture. Our study's objective is to investigate this approach's effectiveness in accurately predicting the kinematics of each phase of the gait cycle separately after BTX-A treatment. Two deep learning models are designed to incorporate categorical medical treatment data corresponding to the injected muscles: (1) within the hidden layers of the Bi-LSTM network, (2) through a gating mechanism. Since several muscles can be injected during the same session, the proposed architectures aim to model the interactions between the different treatment combinations. In this study, we conduct a comparative analysis of our prediction results with the current state of the art. The best results are obtained with the incorporation of the gating mechanism. The average prediction root mean squared error is 2.99° (R2 = 0.85) and 2.21° (R2 = 0.84) for the knee and the ankle kinematics, respectively. Our findings indicate that our approach outperforms the existing methods, yielding a significantly improved prediction accuracy.

Assuntos

Toxinas Botulínicas Tipo A , Aprendizado Profundo , Marcha , Humanos , Marcha/efeitos dos fármacos , Marcha/fisiologia , Toxinas Botulínicas Tipo A/uso terapêutico , Fenômenos Biomecânicos , Espasticidade Muscular/tratamento farmacológico , Espasticidade Muscular/fisiopatologia , Injeções Intramusculares , Masculino , Feminino

15.

Trend Prediction and Operation Alarm Model Based on PCA-Based MTL and AM for the Operating Parameters of a Water Pumping Station.

Shao, Zhiyu; Mei, Xin; Liu, Tianyuan; Li, Jingwei; Tang, Hongru.

Sensors (Basel) ; 24(16)2024 Aug 21.

Artigo em Inglês | MEDLINE | ID: mdl-39205111

RESUMO

In order to effectively predict the changing trend of operating parameters in the pump unit and carry out fault diagnosis and alarm processes, a trend prediction model is proposed in this paper based on PCA-based multi-task learning (MTL) and an attention mechanism (AM). The multi-task learning method based on PCA was used to process the operating data of the pump unit to make full use of the historical data to extract the key common features reflecting the operating state of the pump unit. The attention mechanism (AM) is introduced to dynamically allocate the weight coefficient of common feature mapping for highlighting the key common features and improving the prediction accuracy of the model when predicting the trend of data change for new working conditions. The model is tested with the actual operating data of a pumping station unit, and the calculation results of different models are compared and analyzed. The results show that the introduction of multi-task learning and attention mechanisms can improve the stability and accuracy of the trend prediction model compared with traditional single-task learning and static common feature mapping weights. According to the threshold analysis of the monitoring statistical parameters of the model, a multi-stage alarm model of pump unit operation condition monitoring can be established, which provides a theoretical basis for optimizing operation and maintenance management strategy in the process of pump station management.

16.

Simultaneous Stereo Matching and Confidence Estimation Network.

Schmähling, Tobias; Müller, Tobias; Eberhardt, Jörg; Elser, Stefan.

J Imaging ; 10(8)2024 Aug 14.

Artigo em Inglês | MEDLINE | ID: mdl-39194987

RESUMO

In this paper, we present a multi-task model that predicts disparities and confidence levels in deep stereo matching simultaneously. We do this by combining its successful model for each separate task and obtaining a multi-task model that can be trained with a proposed loss function. We show the advantages of this model compared to training and predicting disparity and confidence sequentially. This method enables an improvement of 15% to 30% in the area under the curve (AUC) metric when trained in parallel rather than sequentially. In addition, the effect of weighting the components in the loss function on the stereo and confidence performance is investigated. By improving the confidence estimate, the practicality of stereo estimators for creating distance images is increased.

17.

A comprehensive multi-task deep learning approach for predicting metabolic syndrome with genetic, nutritional, and clinical data.

Lee, Minhyuk; Park, Taesung; Shin, Ji-Yeon; Park, Mira.

Sci Rep ; 14(1): 17851, 2024 08 01.

Artigo em Inglês | MEDLINE | ID: mdl-39090161

RESUMO

Metabolic syndrome (MetS) is a complex disorder characterized by a cluster of metabolic abnormalities, including abdominal obesity, hypertension, elevated triglycerides, reduced high-density lipoprotein cholesterol, and impaired glucose tolerance. It poses a significant public health concern, as individuals with MetS are at an increased risk of developing cardiovascular diseases and type 2 diabetes. Early and accurate identification of individuals at risk for MetS is essential. Various machine learning approaches have been employed to predict MetS, such as logistic regression, support vector machines, and several boosting techniques. However, these methods use MetS as a binary status and do not consider that MetS comprises five components. Therefore, a method that focuses on these characteristics of MetS is needed. In this study, we propose a multi-task deep learning model designed to predict MetS and its five components simultaneously. The benefit of multi-task learning is that it can manage multiple tasks with a single model, and learning related tasks may enhance the model's predictive performance. To assess the efficacy of our proposed method, we compared its performance with that of several single-task approaches, including logistic regression, support vector machine, CatBoost, LightGBM, XGBoost and one-dimensional convolutional neural network. For the construction of our multi-task deep learning model, we utilized data from the Korean Association Resource (KARE) project, which includes 352,228 single nucleotide polymorphisms (SNPs) from 7729 individuals. We also considered lifestyle, dietary, and socio-economic factors that affect chronic diseases, in addition to genomic data. By evaluating metrics such as accuracy, precision, F1-score, and the area under the receiver operating characteristic curve, we demonstrate that our multi-task learning model surpasses traditional single-task machine learning models in predicting MetS.

Assuntos

Aprendizado Profundo , Síndrome Metabólica , Síndrome Metabólica/genética , Humanos , Masculino , Feminino , Pessoa de Meia-Idade , Máquina de Vetores de Suporte , Adulto , Polimorfismo de Nucleotídeo Único , Modelos Logísticos , Redes Neurais de Computação

18.

Real-time estimation of the optimal coil placement in transcranial magnetic stimulation using multi-task deep learning.

Moser, Philipp; Reishofer, Gernot; Prückl, Robert; Schaffelhofer, Stefan; Freigang, Sascha; Thumfart, Stefan; Mahdy Ali, Kariem.

Sci Rep ; 14(1): 19361, 2024 08 21.

Artigo em Inglês | MEDLINE | ID: mdl-39169126

RESUMO

Transcranial magnetic stimulation (TMS) has emerged as a promising neuromodulation technique with both therapeutic and diagnostic applications. As accurate coil placement is known to be essential for focal stimulation, computational models have been established to help find the optimal coil positioning by maximizing electric fields at the cortical target. While these numerical simulations provide realistic and subject-specific field distributions, they are computationally demanding, precluding their use in real-time applications. In this paper, we developed a novel multi-task deep neural network which simultaneously predicts the optimal coil placement for a given cortical target as well as the associated TMS-induced electric field. Trained on large amounts of preceding numerical optimizations, the Attention U-Net-based neural surrogate provided accurate coil optimizations in only 35 ms, a fraction of time compared to the state-of-the-art numerical framework. The mean errors on the position estimates were below 2 mm, i.e., smaller than previously reported manual coil positioning errors. The predicted electric fields were also highly correlated (r> 0.97) with their numerical references. In addition to healthy subjects, we validated our approach also in glioblastoma patients. We first statistically underlined the importance of using realistic heterogeneous tumor conductivities instead of simply adopting values from the surrounding healthy tissue. Second, applying the trained neural surrogate to tumor patients yielded similar accurate positioning and electric field estimates as in healthy subjects. Our findings provide a promising framework for future real-time electric field-optimized TMS applications.

Assuntos

Aprendizado Profundo , Estimulação Magnética Transcraniana , Estimulação Magnética Transcraniana/métodos , Humanos , Masculino , Glioblastoma/terapia , Feminino , Adulto , Simulação por Computador

19.

A Self-Supervised Few-Shot Semantic Segmentation Method Based on Multi-Task Learning and Dense Attention Computation.

Yi, Kai; Wang, Weihang; Zhang, Yi.

Sensors (Basel) ; 24(15)2024 Jul 31.

Artigo em Inglês | MEDLINE | ID: mdl-39124022

RESUMO

Nowadays, autonomous driving technology has become widely prevalent. The intelligent vehicles have been equipped with various sensors (e.g., vision sensors, LiDAR, depth cameras etc.). Among them, the vision systems with tailored semantic segmentation and perception algorithms play critical roles in scene understanding. However, the traditional supervised semantic segmentation needs a large number of pixel-level manual annotations to complete model training. Although few-shot methods reduce the annotation work to some extent, they are still labor intensive. In this paper, a self-supervised few-shot semantic segmentation method based on Multi-task Learning and Dense Attention Computation (dubbed MLDAC) is proposed. The salient part of an image is split into two parts; one of them serves as the support mask for few-shot segmentation, while cross-entropy losses are calculated between the other part and the entire region with the predicted results separately as multi-task learning so as to improve the model's generalization ability. Swin Transformer is used as our backbone to extract feature maps at different scales. These feature maps are then input to multiple levels of dense attention computation blocks to enhance pixel-level correspondence. The final prediction results are obtained through inter-scale mixing and feature skip connection. The experimental results indicate that MLDAC obtains 55.1% and 26.8% one-shot mIoU self-supervised few-shot segmentation on the PASCAL-5i and COCO-20i datasets, respectively. In addition, it achieves 78.1% on the FSS-1000 few-shot dataset, proving its efficacy.

20.

Bone age assessment by multi-granularity and multi-attention feature encoding.

Liu, Bowen; Huang, Yulin; Li, Shaowei; He, Jinshui; Zhang, Dongxu.

Quant Imaging Med Surg ; 14(8): 5902-5914, 2024 Aug 01.

Artigo em Inglês | MEDLINE | ID: mdl-39144019

RESUMO

Background: Bone age assessment (BAA) is crucial for the diagnosis of growth disorders and the optimization of treatments. However, the random error caused by different observers' experiences and the low consistency of repeated assessments harms the quality of such assessments. Thus, automated assessment methods are needed. Methods: Previous research has sought to design localization modules in a strongly or weakly supervised fashion to aggregate part regions to better recognize subtle differences. Conversely, we sought to efficiently deliver information between multi-granularity regions for fine-grained feature learning and to directly model long-distance relationships for global understanding. The proposed method has been named the "Multi-Granularity and Multi-Attention Net (2M-Net)". Specifically, we first applied the jigsaw method to generate related tasks emphasizing regions with different granularities, and we then trained the model on these tasks using a hierarchical sharing mechanism. In effect, the training signals from the extra tasks created as an inductive bias, enabling 2M-Net to discover task relatedness without the need for annotations. Next, the self-attention mechanism acted as a plug-and-play module to effectively enhance the feature representation capabilities. Finally, multi-scale features were applied for prediction. Results: A public data set of 14,236 hand radiographs, provided by the Radiological Society of North America (RSNA), was used to develop and validate 2M-Net. In the public benchmark testing, the mean absolute error (MAE) between the bone age estimates of the model and of the reviewer was 3.98 months (3.89 months for males and 4.07 months for females). Conclusions: By using the jigsaw method to construct a multi-task learning strategy and inserting the self-attention module for efficient global modeling, we established 2M-Net, which is comparable to the previous best method in terms of performance.

RESUMO

RESUMO

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA