Pesquisa | BVS CLAP/SMR-OPAS/OMS

1.

ULDNA: integrating unsupervised multi-source language models with LSTM-attention network for high-accuracy protein-DNA binding site prediction.

Zhu, Yi-Heng; Liu, Zi; Liu, Yan; Ji, Zhiwei; Yu, Dong-Jun.

Brief Bioinform ; 25(2)2024 Jan 22.

Artigo em Inglês | MEDLINE | ID: mdl-38349057

RESUMO

Efficient and accurate recognition of protein-DNA interactions is vital for understanding the molecular mechanisms of related biological processes and further guiding drug discovery. Although the current experimental protocols are the most precise way to determine protein-DNA binding sites, they tend to be labor-intensive and time-consuming. There is an immediate need to design efficient computational approaches for predicting DNA-binding sites. Here, we proposed ULDNA, a new deep-learning model, to deduce DNA-binding sites from protein sequences. This model leverages an LSTM-attention architecture, embedded with three unsupervised language models that are pre-trained on large-scale sequences from multiple database sources. To prove its effectiveness, ULDNA was tested on 229 protein chains with experimental annotation of DNA-binding sites. Results from computational experiments revealed that ULDNA significantly improves the accuracy of DNA-binding site prediction in comparison with 17 state-of-the-art methods. In-depth data analyses showed that the major strength of ULDNA stems from employing three transformer language models. Specifically, these language models capture complementary feature embeddings with evolution diversity, in which the complex DNA-binding patterns are buried. Meanwhile, the specially crafted LSTM-attention network effectively decodes evolution diversity-based embeddings as DNA-binding results at the residue level. Our findings demonstrated a new pipeline for predicting DNA-binding sites on a large scale with high accuracy from protein sequence alone.

Assuntos

Análise de Dados , Idioma , Sítios de Ligação , Sequência de Aminoácidos , Bases de Dados Factuais

2.

CRISPR-DIPOFF: an interpretable deep learning approach for CRISPR Cas-9 off-target prediction.

Toufikuzzaman, Md; Hassan Samee, Md Abul; Sohel Rahman, M.

Brief Bioinform ; 25(2)2024 Jan 22.

Artigo em Inglês | MEDLINE | ID: mdl-38388680

RESUMO

CRISPR Cas-9 is a groundbreaking genome-editing tool that harnesses bacterial defense systems to alter DNA sequences accurately. This innovative technology holds vast promise in multiple domains like biotechnology, agriculture and medicine. However, such power does not come without its own peril, and one such issue is the potential for unintended modifications (Off-Target), which highlights the need for accurate prediction and mitigation strategies. Though previous studies have demonstrated improvement in Off-Target prediction capability with the application of deep learning, they often struggle with the precision-recall trade-off, limiting their effectiveness and do not provide proper interpretation of the complex decision-making process of their models. To address these limitations, we have thoroughly explored deep learning networks, particularly the recurrent neural network based models, leveraging their established success in handling sequence data. Furthermore, we have employed genetic algorithm for hyperparameter tuning to optimize these models' performance. The results from our experiments demonstrate significant performance improvement compared with the current state-of-the-art in Off-Target prediction, highlighting the efficacy of our approach. Furthermore, leveraging the power of the integrated gradient method, we make an effort to interpret our models resulting in a detailed analysis and understanding of the underlying factors that contribute to Off-Target predictions, in particular the presence of two sub-regions in the seed region of single guide RNA which extends the established biological hypothesis of Off-Target effects. To the best of our knowledge, our model can be considered as the first model combining high efficacy, interpretability and a desirable balance between precision and recall.

Assuntos

Sistemas CRISPR-Cas , Aprendizado Profundo , Edição de Genes/métodos , RNA Guia de Sistemas CRISPR-Cas , Redes Neurais de Computação

3.

Improved prediction of DNA and RNA binding proteins with deep learning models.

Wu, Siwen; Guo, Jun-Tao.

Brief Bioinform ; 25(4)2024 May 23.

Artigo em Inglês | MEDLINE | ID: mdl-38856168

RESUMO

Nucleic acid-binding proteins (NABPs), including DNA-binding proteins (DBPs) and RNA-binding proteins (RBPs), play important roles in essential biological processes. To facilitate functional annotation and accurate prediction of different types of NABPs, many machine learning-based computational approaches have been developed. However, the datasets used for training and testing as well as the prediction scopes in these studies have limited their applications. In this paper, we developed new strategies to overcome these limitations by generating more accurate and robust datasets and developing deep learning-based methods including both hierarchical and multi-class approaches to predict the types of NABPs for any given protein. The deep learning models employ two layers of convolutional neural network and one layer of long short-term memory. Our approaches outperform existing DBP and RBP predictors with a balanced prediction between DBPs and RBPs, and are more practically useful in identifying novel NABPs. The multi-class approach greatly improves the prediction accuracy of DBPs and RBPs, especially for the DBPs with ~12% improvement. Moreover, we explored the prediction accuracy of single-stranded DNA binding proteins and their effect on the overall prediction accuracy of NABP predictions.

Assuntos

Biologia Computacional , Proteínas de Ligação a DNA , Aprendizado Profundo , Proteínas de Ligação a RNA , Proteínas de Ligação a RNA/metabolismo , Proteínas de Ligação a DNA/metabolismo , Biologia Computacional/métodos , Redes Neurais de Computação , Humanos

4.

An efficient hybrid deep learning architecture for predicting short antimicrobial peptides.

Nguyen, Quang H; Nguyen-Vo, Thanh-Hoang; Do, Trang T T; Nguyen, Binh P.

Proteomics ; 24(14): e2300382, 2024 Jul.

Artigo em Inglês | MEDLINE | ID: mdl-38837544

RESUMO

Short-length antimicrobial peptides (AMPs) have been demonstrated to have intensified antimicrobial activities against a wide spectrum of microbes. Therefore, exploration of novel and promising short AMPs is highly essential in developing various types of antimicrobial drugs or treatments. In addition to experimental approaches, computational methods have been developed to improve screening efficiency. Although existing computational methods have achieved satisfactory performance, there is still much room for model improvement. In this study, we proposed iAMP-DL, an efficient hybrid deep learning architecture, for predicting short AMPs. The model was constructed using two well-known deep learning architectures: the long short-term memory architecture and convolutional neural networks. To fairly assess the performance of the model, we compared our model with existing state-of-the-art methods using the same independent test set. Our comparative analysis shows that iAMP-DL outperformed other methods. Furthermore, to assess the robustness and stability of our model, the experiments were repeated 10 times to observe the variation in prediction efficiency. The results demonstrate that iAMP-DL is an effective, robust, and stable framework for detecting promising short AMPs. Another comparative study of different negative data sampling methods also confirms the effectiveness of our method and demonstrates that it can also be used to develop a robust model for predicting AMPs in general. The proposed framework was also deployed as an online web server with a user-friendly interface to support the research community in identifying short AMPs.

Assuntos

Peptídeos Antimicrobianos , Aprendizado Profundo , Peptídeos Antimicrobianos/química , Peptídeos Antimicrobianos/farmacologia , Redes Neurais de Computação , Biologia Computacional/métodos , Peptídeos Catiônicos Antimicrobianos/química , Peptídeos Catiônicos Antimicrobianos/farmacologia

5.

Tpgen: a language model for stable protein design with a specific topology structure.

Min, Xiaoping; Yang, Chongzhou; Xie, Jun; Huang, Yang; Liu, Nan; Jin, Xiaocheng; Wang, Tianshu; Kong, Zhibo; Lu, Xiaoli; Ge, Shengxiang; Zhang, Jun; Xia, Ningshao.

BMC Bioinformatics ; 25(1): 35, 2024 Jan 23.

Artigo em Inglês | MEDLINE | ID: mdl-38254030

RESUMO

BACKGROUND: Natural proteins occupy a small portion of the protein sequence space, whereas artificial proteins can explore a wider range of possibilities within the sequence space. However, specific requirements may not be met when generating sequences blindly. Research indicates that small proteins have notable advantages, including high stability, accurate resolution prediction, and facile specificity modification. RESULTS: This study involves the construction of a neural network model named TopoProGenerator(TPGen) using a transformer decoder. The model is trained with sequences consisting of a maximum of 65 amino acids. The training process of TopoProGenerator incorporates reinforcement learning and adversarial learning, for fine-tuning. Additionally, it encompasses a stability predictive model trained with a dataset comprising over 200,000 sequences. The results demonstrate that TopoProGenerator is capable of designing stable small protein sequences with specified topology structures. CONCLUSION: TPGen has the ability to generate protein sequences that fold into the specified topology, and the pretraining and fine-tuning methods proposed in this study can serve as a framework for designing various types of proteins.

Assuntos

Aminoácidos , Fontes de Energia Elétrica , Sequência de Aminoácidos , Idioma , Aprendizagem

6.

Recurrent neural network-based simultaneous cardiac T1, T2, and T1ρ mapping.

Tao, Yiming; Lv, Zhenfeng; Liu, Wenjian; Qi, Haikun; Hu, Peng.

NMR Biomed ; 37(8): e5133, 2024 Aug.

Artigo em Inglês | MEDLINE | ID: mdl-38520183

RESUMO

The purpose of the current study was to explore the feasibility of training a deep neural network to accelerate the process of generating T1, T2, and T1ρ maps for a recently proposed free-breathing cardiac multiparametric mapping technique, where a recurrent neural network (RNN) was utilized to exploit the temporal correlation among the multicontrast images. The RNN-based model was developed for rapid and accurate T1, T2, and T1ρ estimation. Bloch simulation was performed to simulate a dataset of more than 10 million signals and time correspondences with different noise levels for network training. The proposed RNN-based method was compared with a dictionary-matching method and a conventional mapping method to evaluate the model's effectiveness in phantom and in vivo studies at 3 T, respectively. In phantom studies, the RNN-based method and the dictionary-matching method achieved similar accuracy and precision in T1, T2, and T1ρ estimations. In in vivo studies, the estimated T1, T2, and T1ρ values obtained by the two methods achieved similar accuracy and precision for 10 healthy volunteers (T1: 1228.70 ± 53.80 vs. 1228.34 ± 52.91 ms, p > 0.1; T2: 40.70 ± 2.89 vs. 41.19 ± 2.91 ms, p > 0.1; T1ρ: 45.09 ± 4.47 vs. 45.23 ± 4.65 ms, p > 0.1). The RNN-based method can generate cardiac multiparameter quantitative maps simultaneously in just 2 s, achieving 60-fold acceleration compared with the dictionary-matching method. The RNN-accelerated method offers an almost instantaneous approach for reconstructing accurate T1, T2, and T1ρ maps, being much more efficient than the dictionary-matching method for the free-breathing multiparametric cardiac mapping technique, which may pave the way for inline mapping in clinical applications.

Assuntos

Coração , Redes Neurais de Computação , Imagens de Fantasmas , Humanos , Coração/diagnóstico por imagem , Masculino , Adulto , Imageamento por Ressonância Magnética/métodos , Feminino , Processamento de Imagem Assistida por Computador/métodos , Algoritmos

7.

Identification of patients' smoking status using an explainable AI approach: a Danish electronic health records case study.

Ebrahimi, Ali; Henriksen, Margrethe Bang Høstgaard; Brasen, Claus Lohman; Hilberg, Ole; Hansen, Torben Frøstrup; Jensen, Lars Henrik; Peimankar, Abdolrahman; Wiil, Uffe Kock.

BMC Med Res Methodol ; 24(1): 114, 2024 May 17.

Artigo em Inglês | MEDLINE | ID: mdl-38760718

RESUMO

BACKGROUND: Smoking is a critical risk factor responsible for over eight million annual deaths worldwide. It is essential to obtain information on smoking habits to advance research and implement preventive measures such as screening of high-risk individuals. In most countries, including Denmark, smoking habits are not systematically recorded and at best documented within unstructured free-text segments of electronic health records (EHRs). This would require researchers and clinicians to manually navigate through extensive amounts of unstructured data, which is one of the main reasons that smoking habits are rarely integrated into larger studies. Our aim is to develop machine learning models to classify patients' smoking status from their EHRs. METHODS: This study proposes an efficient natural language processing (NLP) pipeline capable of classifying patients' smoking status and providing explanations for the decisions. The proposed NLP pipeline comprises four distinct components, which are; (1) considering preprocessing techniques to address abbreviations, punctuation, and other textual irregularities, (2) four cutting-edge feature extraction techniques, i.e. Embedding, BERT, Word2Vec, and Count Vectorizer, employed to extract the optimal features, (3) utilization of a Stacking-based Ensemble (SE) model and a Convolutional Long Short-Term Memory Neural Network (CNN-LSTM) for the identification of smoking status, and (4) application of a local interpretable model-agnostic explanation to explain the decisions rendered by the detection models. The EHRs of 23,132 patients with suspected lung cancer were collected from the Region of Southern Denmark during the period 1/1/2009-31/12/2018. A medical professional annotated the data into 'Smoker' and 'Non-Smoker' with further classifications as 'Active-Smoker', 'Former-Smoker', and 'Never-Smoker'. Subsequently, the annotated dataset was used for the development of binary and multiclass classification models. An extensive comparison was conducted of the detection performance across various model architectures. RESULTS: The results of experimental validation confirm the consistency among the models. However, for binary classification, BERT method with CNN-LSTM architecture outperformed other models by achieving precision, recall, and F1-scores between 97% and 99% for both Never-Smokers and Active-Smokers. In multiclass classification, the Embedding technique with CNN-LSTM architecture yielded the most favorable results in class-specific evaluations, with equal performance measures of 97% for Never-Smoker and measures in the range of 86 to 89% for Active-Smoker and 91-92% for Never-Smoker. CONCLUSION: Our proposed NLP pipeline achieved a high level of classification performance. In addition, we presented the explanation of the decision made by the best performing detection model. Future work will expand the model's capabilities to analyze longer notes and a broader range of categories to maximize its utility in further research and screening applications.

Assuntos

Registros Eletrônicos de Saúde , Processamento de Linguagem Natural , Fumar , Humanos , Dinamarca/epidemiologia , Registros Eletrônicos de Saúde/estatística & dados numéricos , Fumar/epidemiologia , Aprendizado de Máquina , Feminino , Masculino , Pessoa de Meia-Idade , Redes Neurais de Computação

8.

Trend analysis and prediction of gonorrhea in mainland China based on a hybrid time series model.

Wang, Zhende; Wang, Yongbin; Zhang, Shengkui; Wang, Suzhen; Xu, Zhen; Feng, ZiJian.

BMC Infect Dis ; 24(1): 113, 2024 Jan 22.

Artigo em Inglês | MEDLINE | ID: mdl-38253998

RESUMO

BACKGROUND: Gonorrhea has long been a serious public health problem in mainland China that requires attention, modeling to describe and predict its prevalence patterns can help the government to develop more scientific interventions. METHODS: Time series (TS) data of the gonorrhea incidence in China from January 2004 to August 2022 were collected, with the incidence data from September 2021 to August 2022 as the validation. The seasonal autoregressive integrated moving average (SARIMA) model, long short-term memory network (LSTM) model, and hybrid SARIMA-LSTM model were used to simulate the data respectively, the model performance were evaluated by calculating the mean absolute percentage error (MAPE), root mean square error (RMSE), and mean absolute error (MAE) of the training and validation sets of the models. RESULTS: The Seasonal components after data decomposition showed an approximate bimodal distribution with a period of 12 months. The three models identified were SARIMA(1,1,1) (2,1,2)12, LSTM with 150 hidden units, and SARIMA-LSTM with 150 hidden units, the SARIMA-LSTM model fitted best in the training and validation sets, for the smallest MAPE, RMSE, and MPE. CONCLUSIONS: The overall incidence trend of gonorrhea in mainland China has been on the decline since 2004, with some periods exhibiting an upward trend. The incidence of gonorrhea displays a seasonal distribution, typically peaking in July and December each year. The SARIMA model, LSTM model, and SARIMA-LSTM model can all fit the monthly incidence time series data of gonorrhea in mainland China. However, in terms of predictive performance, the SARIMA-LSTM model outperforms the SARIMA and LSTM models, with the LSTM model surpassing the SARIMA model. This suggests that the SARIMA-LSTM model can serve as a preferred tool for time series analysis, providing evidence for the government to predict trends in gonorrhea incidence. The model's predictions indicate that the incidence of gonorrhea in mainland China will remain at a high level in 2024, necessitating that policymakers implement public health measures in advance to prevent the spread of the disease.

Assuntos

Gonorreia , Humanos , Fatores de Tempo , Gonorreia/epidemiologia , China/epidemiologia , Governo , Saúde Pública , Convulsões

9.

High-Precision Microscale Particulate Matter Prediction in Diverse Environments Using a Long Short-Term Memory Neural Network and Street View Imagery.

Liu, Xiansheng; Zhang, Xun; Wang, Rui; Liu, Ying; Hadiatullah, Hadiatullah; Xu, Yanning; Wang, Tao; Bendl, Jan; Adam, Thomas; Schnelle-Kreis, Jürgen; Querol, Xavier.

Environ Sci Technol ; 58(8): 3869-3882, 2024 Feb 27.

Artigo em Inglês | MEDLINE | ID: mdl-38355131

RESUMO

In this study, we propose a novel long short-term memory (LSTM) neural network model that leverages color features (HSV: hue, saturation, value) extracted from street images to estimate air quality with particulate matter (PM) in four typical European environments: urban, suburban, villages, and the harbor. To evaluate its performance, we utilize concentration data for eight parameters of ambient PM (PM1.0, PM2.5, and PM10, particle number concentration, lung-deposited surface area, equivalent mass concentrations of ultraviolet PM, black carbon, and brown carbon) collected from a mobile monitoring platform during the nonheating season in downtown Augsburg, Germany, along with synchronized street view images. Experimental comparisons were conducted between the LSTM model and other deep learning models (recurrent neural network and gated recurrent unit). The results clearly demonstrate a better performance of the LSTM model compared with other statistically based models. The LSTM-HSV model achieved impressive interpretability rates above 80%, for the eight PM metrics mentioned above, indicating the expected performance of the proposed model. Moreover, the successful application of the LSTM-HSV model in other seasons of Augsburg city and various environments (suburbs, villages, and harbor cities) demonstrates its satisfactory generalization capabilities in both temporal and spatial dimensions. The successful application of the LSTM-HSV model underscores its potential as a versatile tool for the estimation of air pollution after presampling of the studied area, with broad implications for urban planning and public health initiatives.

Assuntos

Poluentes Atmosféricos , Poluição do Ar , Material Particulado/análise , Poluentes Atmosféricos/análise , Monitoramento Ambiental/métodos , Memória de Curto Prazo , Poluição do Ar/análise , Redes Neurais de Computação , Carbono

10.

Forecasting acute kidney injury and resource utilization in ICU patients using longitudinal, multimodal models.

Tan, Yukun; Dede, Merve; Mohanty, Vakul; Dou, Jinzhuang; Hill, Holly; Bernstam, Elmer; Chen, Ken.

J Biomed Inform ; 154: 104648, 2024 Jun.

Artigo em Inglês | MEDLINE | ID: mdl-38692464

RESUMO

BACKGROUND: Advances in artificial intelligence (AI) have realized the potential of revolutionizing healthcare, such as predicting disease progression via longitudinal inspection of Electronic Health Records (EHRs) and lab tests from patients admitted to Intensive Care Units (ICU). Although substantial literature exists addressing broad subjects, including the prediction of mortality, length-of-stay, and readmission, studies focusing on forecasting Acute Kidney Injury (AKI), specifically dialysis anticipation like Continuous Renal Replacement Therapy (CRRT) are scarce. The technicality of how to implement AI remains elusive. OBJECTIVE: This study aims to elucidate the important factors and methods that are required to develop effective predictive models of AKI and CRRT for patients admitted to ICU, using EHRs in the Medical Information Mart for Intensive Care (MIMIC) database. METHODS: We conducted a comprehensive comparative analysis of established predictive models, considering both time-series measurements and clinical notes from MIMIC-IV databases. Subsequently, we proposed a novel multi-modal model which integrates embeddings of top-performing unimodal models, including Long Short-Term Memory (LSTM) and BioMedBERT, and leverages both unstructured clinical notes and structured time series measurements derived from EHRs to enable the early prediction of AKI and CRRT. RESULTS: Our multimodal model achieved a lead time of at least 12 h ahead of clinical manifestation, with an Area Under the Receiver Operating Characteristic Curve (AUROC) of 0.888 for AKI and 0.997 for CRRT, as well as an Area Under the Precision Recall Curve (AUPRC) of 0.727 for AKI and 0.840 for CRRT, respectively, which significantly outperformed the baseline models. Additionally, we performed a SHapley Additive exPlanation (SHAP) analysis using the expected gradients algorithm, which highlighted important, previously underappreciated predictive features for AKI and CRRT. CONCLUSION: Our study revealed the importance and the technicality of applying longitudinal, multimodal modeling to improve early prediction of AKI and CRRT, offering insights for timely interventions. The performance and interpretability of our model indicate its potential for further assessment towards clinical applications, to ultimately optimize AKI management and enhance patient outcomes.

Assuntos

Injúria Renal Aguda , Registros Eletrônicos de Saúde , Unidades de Terapia Intensiva , Injúria Renal Aguda/terapia , Humanos , Estudos Longitudinais , Terapia de Substituição Renal , Inteligência Artificial , Previsões , Tempo de Internação , Masculino , Bases de Dados Factuais , Feminino

11.

The effectiveness of simple heuristic features in sensor orientation and placement problems in human activity recognition using a single smartphone accelerometer.

Barua, Arnab; Jiang, Xianta; Fuller, Daniel.

Biomed Eng Online ; 23(1): 21, 2024 Feb 17.

Artigo em Inglês | MEDLINE | ID: mdl-38368358

RESUMO

BACKGROUND: Human activity Recognition (HAR) using smartphone sensors suffers from two major problems: sensor orientation and placement. Sensor orientation and sensor placement problems refer to the variation in sensor signal for a particular activity due to sensors' altering orientation and placement. Extracting orientation and position invariant features from raw sensor signals is a simple solution for tackling these problems. Using few heuristic features rather than numerous time-domain and frequency-domain features offers more simplicity in this approach. The heuristic features are features which have very minimal effects of sensor orientation and placement. In this study, we evaluated the effectiveness of four simple heuristic features in solving the sensor orientation and placement problems using a 1D-CNN-LSTM model for a data set consisting of over 12 million samples. METHODS: We accumulated data from 42 participants for six common daily activities: Lying, Sitting, Walking, and Running at 3-Metabolic Equivalent of Tasks (METs), 5-METs and 7-METs from a single accelerometer sensor of a smartphone. We conducted our study for three smartphone positions: Pocket, Backpack and Hand. We extracted simple heuristic features from the accelerometer data and used them to train and test a 1D-CNN-LSTM model to evaluate their effectiveness in solving sensor orientation and placement problems. RESULTS: We performed intra-position and inter-position evaluations. In intra-position evaluation, we trained and tested the model using data from the same smartphone position, whereas, in inter-position evaluation, the training and test data was from different smartphone positions. For intra-position evaluation, we acquired 70-73% accuracy; for inter-position cases, the accuracies ranged between 59 and 69%. Moreover, we performed participant-specific and activity-specific analyses. CONCLUSIONS: We found that the simple heuristic features are considerably effective in solving orientation problems. With further development, such as fusing the heuristic features with other methods that eliminate placement issues, we can also achieve a better result than the outcome we achieved using the heuristic features for the sensor placement problem. In addition, we found the heuristic features to be more effective in recognizing high-intensity activities.

Assuntos

Heurística , Smartphone , Humanos , Atividades Humanas , Caminhada , Acelerometria/métodos

12.

Artificial intelligence-based forecasting model for incinerator in sulfur recovery units to predict SO₂ emissions.

Thameem, Muhammed; Raj, Abhijeet; Berrouk, Abdallah; Jaoude, Maguy A; AlHammadi, Ali A.

Environ Res ; 249: 118329, 2024 May 15.

Artigo em Inglês | MEDLINE | ID: mdl-38325781

RESUMO

Pollutant emissions from chemical plants are a major concern in the context of environmental safety. A reliable emission forecasting model can provide important information for optimizing the process and improving the environmental performance. In this work, forecasting models are developed for the prediction of SO2 emission from a Sulfur Recovery Unit (SRU). Since SRUs incorporate complex chemical reactions, first-principle models are not suitable to predict emission levels based on a given feed condition. Accordingly, artificial intelligence-based models such as standard machine learning (ML) algorithms, multi-layer perceptron (MLP), long short-term memory (LSTM), one-dimensional convolution (1D-CNN), and CNN-LSTM models were tested, and their performance was evaluated. The input features and hyperparameters of the models were optimized to achieve maximum performance. The performance was evaluated in terms of mean squared error (MSE) and mean absolute percentage Error (MAPE) for 1 h, 3 h and 5 h ahead of forecasting. The reported results show that the CNN-LSTM encoder-decoder model outperforms other tested models, with its superiority becoming more pronounced as the forecasting horizon increased from 1 h to 5 h. For the 5-h ahead forecasting, the proposed model showed a MAPE advantage of 17.23%, 4.41%, and 2.83%, respectively over the 1D-CNN, Deep LSTM, and single-layer LSTM models in the larger dataset.

Assuntos

Poluentes Atmosféricos , Inteligência Artificial , Previsões , Incineração , Dióxido de Enxofre , Dióxido de Enxofre/análise , Previsões/métodos , Poluentes Atmosféricos/análise , Enxofre/análise , Modelos Teóricos , Monitoramento Ambiental/métodos , Redes Neurais de Computação , Aprendizado de Máquina

13.

A fourfold-objective-based cloud privacy preservation model with proposed association rule hiding and deep learning assisted optimal key generation.

Sharma, Smita; Tyagi, Sanjay.

Network ; : 1-36, 2024 Jul 26.

Artigo em Inglês | MEDLINE | ID: mdl-39054942

RESUMO

Numerous studies have been conducted in an attempt to preserve cloud privacy, yet the majority of cutting-edge solutions fall short when it comes to handling sensitive data. This research proposes a "privacy preservation model in the cloud environment". The four stages of recommended security preservation methodology are "identification of sensitive data, generation of an optimal tuned key, suggested data sanitization, and data restoration". Initially, owner's data enters the Sensitive data identification process. The sensitive information in the input (owner's data) is identified via Augmented Dynamic Itemset Counting (ADIC) based Associative Rule Mining Model. Subsequently, the identified sensitive data are sanitized via the newly created tuned key. The generated tuned key is formulated with new fourfold objective-hybrid optimization approach-based deep learning approach. The optimally tuned key is generated with LSTM on the basis of fourfold objectives and the new hybrid MUAOA. The created keys, as well as generated sensitive rules, are fed into the deep learning model. The MUAOA technique is a conceptual blend of standard AOA and CMBO, respectively. As a result, unauthorized people will be unable to access information. Finally, comparative evaluation is undergone and proposed LSTM+MUAOA has achieved higher values on privacy about 5.21 compared to other existing models.

14.

Support vector machine-based stock market prediction using long short-term memory and convolutional neural network with aquila circle inspired optimization.

Karthick Myilvahanan, J; Mohana Sundaram, N.

Network ; : 1-36, 2024 Jun 10.

Artigo em Inglês | MEDLINE | ID: mdl-38855971

RESUMO

Predicting the stock market is one of the significant chores and has a successful prediction of stock rates, and it helps in making correct decisions. The prediction of the stock market is the main challenge due to blaring, chaotic data as well as non-stationary data. In this research, the support vector machine (SVM) is devised for performing an effective stock market prediction. At first, the input time series data is considered and the pre-processing of data is done by employing a standard scalar. Then, the time intrinsic features are extracted and the suitable features are selected in the feature selection stage by eliminating other features using recursive feature elimination. Afterwards, the Long Short-Term Memory (LSTM) based prediction is done, wherein LSTM is trained to employ Aquila circle-inspired optimization (ACIO) that is newly introduced by merging Aquila optimizer (AO) with circle-inspired optimization algorithm (CIOA). On the other hand, delay-based matrix formation is conducted by considering input time series data. After that, convolutional neural network (CNN)-based prediction is performed, where CNN is tuned by the same ACIO. Finally, stock market prediction is executed utilizing SVM by fusing the predicted outputs attained from LSTM-based prediction and CNN-based prediction. Furthermore, the SVM attains better outcomes of minimum mean absolute percentage error; (MAPE) and normalized root-mean-square error (RMSE) with values about 0.378 and 0.294.

15.

Predicting concrete strength early age using a combination of machine learning and electromechanical impedance with nano-enhanced sensors.

Ju, Huang; Xing, Lin; Ali, Alaa Hussein; El-Arab, Islam Ezz; Elshekh, Ali E A; Abbas, Mohamed; Abdullah, Nermeen; Elattar, Samia; Hashmi, Ahmed; Ali, Elimam; Assilzadeh, Hamid.

Environ Res ; 258: 119248, 2024 Oct 01.

Artigo em Inglês | MEDLINE | ID: mdl-38823615

RESUMO

To ensure the structural integrity of concrete and prevent unanticipated fracturing, real-time monitoring of early-age concrete's strength development is essential, mainly through advanced techniques such as nano-enhanced sensors. The piezoelectric-based electro-mechanical impedance (EMI) method with nano-enhanced sensors is emerging as a practical solution for such monitoring requirements. This study presents a strength estimation method based on Non-Destructive Testing (NDT) Techniques and Long Short-Term Memory (LSTM) and artificial neural networks (ANNs) as hybrid (NDT-LSTMs-ANN), including several types of concrete strength-related agents. Input data includes water-to-cement rate, temperature, curing time, and maturity based on interior temperature, allowing experimentally monitoring the development of concrete strength from the early steps of hydration and casting to the last stages of hardening 28 days after the casting. The study investigated the impact of various factors on concrete strength development, utilizing a cutting-edge approach that combines traditional models with nano-enhanced piezoelectric sensors and NDT-LSTMs-ANN enhanced with nanotechnology. The results demonstrate that the hybrid provides highly accurate concrete strength estimation for construction safety and efficiency. Adopting the piezoelectric-based EMI technique with these advanced sensors offers a viable and effective monitoring solution, presenting a significant leap forward for the construction industry's structural health monitoring practices.

Assuntos

Materiais de Construção , Impedância Elétrica , Aprendizado de Máquina , Redes Neurais de Computação , Materiais de Construção/análise , Nanotecnologia/instrumentação , Nanotecnologia/métodos , Teste de Materiais/métodos

16.

The prediction of influenza-like illness using national influenza surveillance data and Baidu query data.

Wei, Su; Lin, Sun; Wenjing, Zhao; Shaoxia, Song; Yuejie, Yang; Yujie, He; Shu, Zhang; Zhong, Li; Ti, Liu.

BMC Public Health ; 24(1): 513, 2024 Feb 19.

Artigo em Inglês | MEDLINE | ID: mdl-38369456

RESUMO

BACKGROUND: Seasonal influenza and other respiratory tract infections are serious public health problems that need to be further addressed and investigated. Internet search data are recognized as a valuable source for forecasting influenza or other respiratory tract infection epidemics. However, the selection of internet search data and the application of forecasting methods are important for improving forecasting accuracy. The aim of the present study was to forecast influenza epidemics based on the long short-term memory neural network (LSTM) method, Baidu search index data, and the influenza-like-illness (ILI) rate. METHODS: The official weekly ILI% data for northern and southern mainland China were obtained from the Chinese Influenza Center from 2018 to 2021. Based on the Baidu Index, search indices related to influenza infection over the corresponding time period were obtained. Pearson correlation analysis was performed to explore the association between influenza-related search queries and the ILI% of southern and northern mainland China. The LSTM model was used to forecast the influenza epidemic within the same week and at lags of 1-4 weeks. The model performance was assessed by evaluation metrics, including the mean square error (MSE), root mean square error (RMSE) and mean absolute error (MAE). RESULTS: In total, 24 search queries in northern mainland China and 7 search queries in southern mainland China were found to be correlated and were used to construct the LSTM model, which included the same week and a lag of 1-4 weeks. The LSTM model showed that ILI% + mask with one lag week and ILI% + influenza name were good prediction modules, with reduced RMSE predictions of 16.75% and 4.20%, respectively, compared with the estimated ILI% for northern and southern mainland China. CONCLUSIONS: The results illuminate the feasibility of using an internet search index as a complementary data source for influenza forecasting and the efficiency of using the LSTM model to forecast influenza epidemics.

Assuntos

Epidemias , Influenza Humana , Humanos , Influenza Humana/epidemiologia , China/epidemiologia , Redes Neurais de Computação , Previsões

17.

CLINet: A novel deep learning network for ECG signal classification.

Mantravadi, Ananya; Saini, Siddharth; R, Sai Chandra Teja; Mittal, Sparsh; Shah, Shrimay; R, Sri Devi; Singhal, Rekha.

J Electrocardiol ; 83: 41-48, 2024.

Artigo em Inglês | MEDLINE | ID: mdl-38306814

RESUMO

Machine learning is poised to revolutionize medicine with algorithms that spot cardiac arrhythmia. An automated diagnostic approach can boost the efficacy of diagnosing life-threatening arrhythmia disorders in routine medical procedures. In this paper, we propose a deep learning network CLINet for ECG signal classification. Our network uses convolution, LSTM and involution layers to bring their unique advantages together. For both convolution and involution layers, we use multiple, large size kernels for multi-scale representation learning. CLINet does not require complicated pre-processing and can handle electrocardiograms of any length. Our network achieves 99.90% accuracy on the ICCAD dataset and 99.94% accuracy on the MIT-BIH dataset. With only 297 K parameters, our model can be easily embedded in smart wearable devices. The source code of CLINet is available at https://github.com/CandleLabAI/CLINet-ECG-Classification-2024.

Assuntos

Aprendizado Profundo , Humanos , Processamento de Sinais Assistido por Computador , Eletrocardiografia/métodos , Algoritmos , Software , Arritmias Cardíacas/diagnóstico

18.

Real-time IoT-powered AI system for monitoring and forecasting of air pollution in industrial environment.

Ramadan, Montaser N A; Ali, Mohammed A H; Khoo, Shin Yee; Alkhedher, Mohammad; Alherbawi, Mohammad.

Ecotoxicol Environ Saf ; 283: 116856, 2024 Aug 15.

Artigo em Inglês | MEDLINE | ID: mdl-39151373

RESUMO

Air pollution in industrial environments, particularly in the chrome plating process, poses significant health risks to workers due to high concentrations of hazardous pollutants. Exposure to substances like hexavalent chromium, volatile organic compounds (VOCs), and particulate matter can lead to severe health issues, including respiratory problems and lung cancer. Continuous monitoring and timely intervention are crucial to mitigate these risks. Traditional air quality monitoring methods often lack real-time data analysis and predictive capabilities, limiting their effectiveness in addressing pollution hazards proactively. This paper introduces a real-time air pollution monitoring and forecasting system specifically designed for the chrome plating industry. The system, supported by Internet of Things (IoT) sensors and AI approaches, detects a wide range of air pollutants, including NH3, CO, NO2, CH4, CO2, SO2, O3, PM2.5, and PM10, and provides real-time data on pollutant concentration levels. Data collected by the sensors are processed using LSTM, Random Forest, and Linear Regression models to predict pollution levels. The LSTM model achieved a coefficient of variation (R²) of 99â¯% and a mean absolute percentage error (MAE) of 0.33 for temperature and humidity forecasting. For PM2.5, the Random Forest model outperformed others, achieving an R² of 84â¯% and an MAE of 10.11. The system activates factory exhaust fans to circulate air when high pollution levels are predicted to occur in the next hours, allowing for proactive measures to improve air quality before issues arise. This innovative approach demonstrates significant advancements in industrial environmental monitoring, enabling dynamic responses to pollution and improving air quality in industrial settings.

19.

Long-term prediction of Iranian blood product supply using LSTM: a 5-year forecast.

Miri-Moghaddam, Ebrahim; Bizhaem, Saeede Khosravi; Moezzifar, Zohre; Salmani, Fatemeh.

BMC Med Inform Decis Mak ; 24(1): 213, 2024 Jul 29.

Artigo em Inglês | MEDLINE | ID: mdl-39075453

RESUMO

BACKGROUND: This study aims to predict the trend of procurement and storage of various blood products, as well as planning and monitoring the consumption of blood products in different centers across Iran based on artificial intelligence until the year 2027. METHODS: This research constitutes a time-series investigation within the realm of longitudinal studies. In this study, information on the number of packed red blood cells (RBC), leukoreduced red blood cells (LR-RBC), and platelets (PLT), PLT-Apheresis, and fresh frozen plasma (FFP) was requested from all blood transfusion centers in the country and extracted using a unified protocol. After the initial examination of the information and addressing data issues and inconsistencies, the corrected data were analyzed. Both conventional and artificial intelligence approaches were used to predict each product in this study. The best model was selected based on goodness-of-fit indicators RMSE and MAPE. RESULTS: Based on the obtained results, the FFP product will follow a relatively consistent process similar to previous years in the next five years. The PLT product is predicted to have a growing trend over the next 5 years, which applies to both the demand and supply of the product. The PLT-Apheresis product also shows a similar upward trend, albeit with a lower growth rate. The RBC product will have a constant trend over a 5-year period (long-term) according to both models, taking into account short-term changes. Similarly, there is a similar trend in LR-RBC, with the expectation that short-term pattern repetition will continue over a 5-year period (long-term). Comparing the goodness-of-fit results, the LSTM model proved to be better for predicting the dominant blood products. CONCLUSIONS: The growth of the elderly population and diseases related to old age, and on the other hand, the trend of increasing the consumption of the product with a short lifespan (PLT) requires the activation of the management of the patient's blood, especially in relation to this product in medical centers. The trend for other products in the next five years is similar to previous years, and no growth in demand is observed. The LSTM method, considering periodic and cyclical events, has performed the prediction.

Assuntos

Previsões , Irã (Geográfico) , Humanos , Redes Neurais de Computação , Transfusão de Sangue/estatística & dados numéricos , Bancos de Sangue , Estudos Longitudinais

20.

DEL-Thyroid: deep ensemble learning framework for detection of thyroid cancer progression through genomic mutation.

Shah, Asghar Ali; Daud, Ali; Bukhari, Amal; Alshemaimri, Bader; Ahsan, Muhammad; Younis, Rehmana.

BMC Med Inform Decis Mak ; 24(1): 198, 2024 Jul 22.

Artigo em Inglês | MEDLINE | ID: mdl-39039464

RESUMO

Genes, expressed as sequences of nucleotides, are susceptible to mutations, some of which can lead to cancer. Machine learning and deep learning methods have emerged as vital tools in identifying mutations associated with cancer. Thyroid cancer ranks as the 5th most prevalent cancer in the USA, with thousands diagnosed annually. This paper presents an ensemble learning model leveraging deep learning techniques such as Long Short-Term Memory (LSTM), Gated Recurrent Units (GRUs), and Bi-directional LSTM (Bi-LSTM) to detect thyroid cancer mutations early. The model is trained on a dataset sourced from asia.ensembl.org and IntOGen.org, consisting of 633 samples with 969 mutations across 41 genes, collected from individuals of various demographics. Feature extraction encompasses techniques including Hahn moments, central moments, raw moments, and various matrix-based methods. Evaluation employs three testing methods: self-consistency test (SCT), independent set test (IST), and 10-fold cross-validation test (10-FCVT). The proposed ensemble learning model demonstrates promising performance, achieving 96% accuracy in the independent set test (IST). Statistical measures such as training accuracy, testing accuracy, recall, sensitivity, specificity, Mathew's Correlation Coefficient (MCC), loss, training accuracy, F1 Score, and Cohen's kappa are utilized for comprehensive evaluation.

Assuntos

Aprendizado Profundo , Mutação , Neoplasias da Glândula Tireoide , Humanos , Neoplasias da Glândula Tireoide/genética , Neoplasias da Glândula Tireoide/diagnóstico , Progressão da Doença

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA