Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 53
Filtrar
Más filtros











Intervalo de año de publicación
1.
J Comput Biol ; 31(9): 797-814, 2024 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-39069885

RESUMEN

The physiological activities within cells are mainly regulated through protein-protein interactions (PPI). Therefore, studying protein interactions has become an essential part of researching protein function and mechanisms. Traditional biological experiments required for PPI prediction are expensive and time consuming. For this reason, many methods based on predicting PPI from protein sequences have been proposed in recent years. However, existing computational methods usually require the combination of evolutionary feature information of proteins to predict PPI docking situations. Because different relevant features of selected proteins are chosen, there may be differences in the predicted results for PPI. This article proposes a PPI prediction method based on the pretrained protein sequence model ProtBert, combined with the Bidirectional Gated Recurrent Unit (BiGRU) and attention mechanism. Only using protein sequence information and leveraging ProtBert's powerful ability to capture amino acid feature information, BiGRU is used for further feature extraction of the amino acid vectors output by ProtBert. The attention mechanism is then applied to enhance the focus on different amino acid features and improve the expression ability of protein sequence features, ultimately obtaining binary classification results for protein interactions. Experimental results show that our proposed ProtBert-BiGRU-Attention model has good predictive performance for PPI. Through relevant comparative experiments, it has been proven that our model performs well in protein binary prediction. Furthermore, through the ablation experiment of the model, different deep learning modules' contributions to the prediction have been demonstrated.


Asunto(s)
Algoritmos , Biología Computacional , Mapeo de Interacción de Proteínas , Proteínas , Biología Computacional/métodos , Proteínas/metabolismo , Proteínas/química , Mapeo de Interacción de Proteínas/métodos , Humanos , Secuencia de Aminoácidos , Bases de Datos de Proteínas
2.
PeerJ Comput Sci ; 10: e2166, 2024.
Artículo en Inglés | MEDLINE | ID: mdl-38983236

RESUMEN

Amid the wave of globalization, the phenomenon of cultural amalgamation has surged in frequency, bringing to the fore the heightened prominence of challenges inherent in cross-cultural communication. To address these challenges, contemporary research has shifted its focus to human-computer dialogue. Especially in the educational paradigm of human-computer dialogue, analysing emotion recognition in user dialogues is particularly important. Accurately identify and understand users' emotional tendencies and the efficiency and experience of human-computer interaction and play. This study aims to improve the capability of language emotion recognition in human-computer dialogue. It proposes a hybrid model (BCBA) based on bidirectional encoder representations from transformers (BERT), convolutional neural networks (CNN), bidirectional gated recurrent units (BiGRU), and the attention mechanism. This model leverages the BERT model to extract semantic and syntactic features from the text. Simultaneously, it integrates CNN and BiGRU networks to delve deeper into textual features, enhancing the model's proficiency in nuanced sentiment recognition. Furthermore, by introducing the attention mechanism, the model can assign different weights to words based on their emotional tendencies. This enables it to prioritize words with discernible emotional inclinations for more precise sentiment analysis. The BCBA model has achieved remarkable results in emotion recognition and classification tasks through experimental validation on two datasets. The model has significantly improved both accuracy and F1 scores, with an average accuracy of 0.84 and an average F1 score of 0.8. The confusion matrix analysis reveals a minimal classification error rate for this model. Additionally, as the number of iterations increases, the model's recall rate stabilizes at approximately 0.7. This accomplishment demonstrates the model's robust capabilities in semantic understanding and sentiment analysis and showcases its advantages in handling emotional characteristics in language expressions within a cross-cultural context. The BCBA model proposed in this study provides effective technical support for emotion recognition in human-computer dialogue, which is of great significance for building more intelligent and user-friendly human-computer interaction systems. In the future, we will continue to optimize the model's structure, improve its capability in handling complex emotions and cross-lingual emotion recognition, and explore applying the model to more practical scenarios to further promote the development and application of human-computer dialogue technology.

3.
Int J Mol Sci ; 25(13)2024 Jun 30.
Artículo en Inglés | MEDLINE | ID: mdl-39000335

RESUMEN

In various domains, including everyday activities, agricultural practices, and medical treatments, the escalating challenge of antibiotic resistance poses a significant concern. Traditional approaches to studying antibiotic resistance genes (ARGs) often require substantial time and effort and are limited in accuracy. Moreover, the decentralized nature of existing data repositories complicates comprehensive analysis of antibiotic resistance gene sequences. In this study, we introduce a novel computational framework named TGC-ARG designed to predict potential ARGs. This framework takes protein sequences as input, utilizes SCRATCH-1D for protein secondary structure prediction, and employs feature extraction techniques to derive distinctive features from both sequence and structural data. Subsequently, a Siamese network is employed to foster a contrastive learning environment, enhancing the model's ability to effectively represent the data. Finally, a multi-layer perceptron (MLP) integrates and processes sequence embeddings alongside predicted secondary structure embeddings to forecast ARG presence. To evaluate our approach, we curated a pioneering open dataset termed ARSS (Antibiotic Resistance Sequence Statistics). Comprehensive comparative experiments demonstrate that our method surpasses current state-of-the-art methodologies. Additionally, through detailed case studies, we illustrate the efficacy of our approach in predicting potential ARGs.


Asunto(s)
Farmacorresistencia Microbiana , Farmacorresistencia Microbiana/genética , Biología Computacional/métodos , Estructura Secundaria de Proteína , Aprendizaje Automático , Antibacterianos/farmacología , Farmacorresistencia Bacteriana/genética , Redes Neurales de la Computación
4.
Sensors (Basel) ; 24(11)2024 May 24.
Artículo en Inglés | MEDLINE | ID: mdl-38894157

RESUMEN

With the development of deep learning, several graph neural network (GNN)-based approaches have been utilized for text classification. However, GNNs encounter challenges when capturing contextual text information within a document sequence. To address this, a novel text classification model, RB-GAT, is proposed by combining RoBERTa-BiGRU embedding and a multi-head Graph ATtention Network (GAT). First, the pre-trained RoBERTa model is exploited to learn word and text embeddings in different contexts. Second, the Bidirectional Gated Recurrent Unit (BiGRU) is employed to capture long-term dependencies and bidirectional sentence information from the text context. Next, the multi-head graph attention network is applied to analyze this information, which serves as a node feature for the document. Finally, the classification results are generated through a Softmax layer. Experimental results on five benchmark datasets demonstrate that our method can achieve an accuracy of 71.48%, 98.45%, 80.32%, 90.84%, and 95.67% on Ohsumed, R8, MR, 20NG and R52, respectively, which is superior to the existing nine text classification approaches.

5.
Int J Mol Sci ; 25(10)2024 May 08.
Artículo en Inglés | MEDLINE | ID: mdl-38791165

RESUMEN

Studying drug-target interactions (DTIs) is the foundational and crucial phase in drug discovery. Biochemical experiments, while being the most reliable method for determining drug-target affinity (DTA), are time-consuming and costly, making it challenging to meet the current demands for swift and efficient drug development. Consequently, computational DTA prediction methods have emerged as indispensable tools for this research. In this article, we propose a novel deep learning algorithm named GRA-DTA, for DTA prediction. Specifically, we introduce Bidirectional Gated Recurrent Unit (BiGRU) combined with a soft attention mechanism to learn target representations. We employ Graph Sample and Aggregate (GraphSAGE) to learn drug representation, especially to distinguish the different features of drug and target representations and their dimensional contributions. We merge drug and target representations by an attention neural network (ANN) to learn drug-target pair representations, which are fed into fully connected layers to yield predictive DTA. The experimental results showed that GRA-DTA achieved mean squared error of 0.142 and 0.225 and concordance index reached 0.897 and 0.890 on the benchmark datasets KIBA and Davis, respectively, surpassing the most state-of-the-art DTA prediction algorithms.


Asunto(s)
Algoritmos , Aprendizaje Profundo , Redes Neurales de la Computación , Descubrimiento de Drogas/métodos , Humanos , Preparaciones Farmacéuticas/química
6.
Med Biol Eng Comput ; 62(9): 2911-2938, 2024 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-38713340

RESUMEN

Most diabetes patients are liable to have diabetic retinopathy (DR); however, the majority of them might not be even aware of the ailment. Therefore, early detection and treatment of DR are necessary to prevent vision loss. But, avoiding DR is not a simple process. An ophthalmologist can typically identify DR through an optical evaluation of the fundus and through the evaluation of color pictures. However, due to the increased count of DR patients, this could not be possible as it consumes more time. To rectify this problem, a novel deep ensemble-based DR classification technique is developed in this work. Initially, a Wiener filter (WF) is applied for preprocessing the image. Then, the enhanced U-Net-based segmentation process is done. Subsequent to the segmentation process, features are extracted that include statistical features, inferior superior nasal temporal (ISNT), cup to disc ratio (CDR), and improved LGBP as well. Further, deep ensemble classifiers (DEC) like CNN, Bi-GRU, and DMN are used to recognize the disease. The outcomes from DMN, CNN, and Bi-GRU are then subjected to improved SLF. Additionally, the weights of DMN, CNN, and Bi-GRU are adjusted via pelican updated Tasmanian devil optimization (PU-TDO). Finally, outputs on DR (microaneurysms, hemorrhages, hard exudates, and soft exudates) are obtained. The performance of DEC + PU-TDO for diabetic retinopathy is computed over extant models with regard to different measures for four datasets. The results on accuracy using the DEC + PU-TDO scheme for the IDRID dataset is maximum around 0.975 at 90th LP while other models have less accuracy. The FPR of DEC + PU-TDO is less around 0.039 at the 90th LP for the SUSTech-SYSU dataset, while other extant models have maximum FPR.


Asunto(s)
Retinopatía Diabética , Retinopatía Diabética/diagnóstico , Retinopatía Diabética/clasificación , Humanos , Algoritmos , Redes Neurales de la Computación , Interpretación de Imagen Asistida por Computador/métodos , Procesamiento de Imagen Asistido por Computador/métodos , Aprendizaje Profundo
7.
Front Neurosci ; 18: 1369832, 2024.
Artículo en Inglés | MEDLINE | ID: mdl-38741790

RESUMEN

Introduction: In urban traffic management, the timely detection of road faults plays a crucial role in improving traffic efficiency and safety. However, conventional methods often fail to fully leverage the information from road topology and traffic data. Methods: To address this issue, we propose an innovative detection system that combines Artificial Neural Networks (ANNs), specifically Graph Convolutional Networks (GCN), Bidirectional Gated Recurrent Units (BiGRU), and self-attention mechanisms. Our approach begins by representing the road topology as a graph and utilizing GCN to model it. This allows us to learn the relationships between roads and capture their structural dependencies. By doing so, we can effectively incorporate the spatial information provided by the road network. Next, we employ BiGRU to model the historical traffic data, enabling us to capture the temporal dynamics and patterns in the traffic flow. The BiGRU architecture allows for bidirectional processing, which aids in understanding the traffic conditions based on both past and future information. This temporal modeling enhances our system's ability to handle time-varying traffic patterns. To further enhance the feature representations, we leverage self-attention mechanisms. By combining the hidden states of the BiGRU with self-attention, we can assign importance weights to different temporal features, focusing on the most relevant information. This attention mechanism helps to extract salient features from the traffic data. Subsequently, we merge the features learned by GCN from the road topology and BiGRU from the traffic data. This fusion of spatial and temporal information provides a comprehensive representation of the road status. Results and discussions: By employing a Multilayer Perceptron (MLP) as a classifier, we can effectively determine whether a road is experiencing a fault. The MLP model is trained using labeled road fault data through supervised learning, optimizing its performance for fault detection. Experimental evaluations of our system demonstrate excellent performance in road fault detection. Compared to traditional methods, our system achieves more accurate fault detection, thereby improving the efficiency of urban traffic management. This is of significant importance for city administrators, as they can promptly identify road faults and take appropriate measures for repair and traffic diversion.

8.
Sensors (Basel) ; 24(7)2024 Mar 24.
Artículo en Inglés | MEDLINE | ID: mdl-38610289

RESUMEN

Phishing is one of the most dangerous attacks targeting individuals, organizations, and nations. Although many traditional methods for email phishing detection exist, there is a need to improve accuracy and reduce false-positive rates. Our work investigates one-dimensional CNN-based models (1D-CNNPD) to detect phishing emails in order to address these challenges. Additionally, further improvement is achieved with the augmentation of the base 1D-CNNPD model with recurrent layers, namely, LSTM, Bi-LSTM, GRU, and Bi-GRU, and experimented with the four resulting models. Two benchmark datasets were used to evaluate the performance of our models: Phishing Corpus and Spam Assassin. Our results indicate that, in general, the augmentations improve the performance of the 1D-CNNPD base model. Specifically, the 1D-CNNPD with Bi-GRU yields the best results. Overall, the performance of our models is comparable to the state of the art of CNN-based phishing email detection. The Advanced 1D-CNNPD with Leaky ReLU and Bi-GRU achieved 100% precision, 99.68% accuracy, an F1 score of 99.66%, and a recall of 99.32%. We observe that increasing model depth typically leads to an initial performance improvement, succeeded by a decline. In conclusion, this study highlights the effectiveness of augmented 1D-CNNPD models in detecting phishing emails with improved accuracy. The reported performance measure values indicate the potential of these models in advancing the implementation of cybersecurity solutions to combat email phishing attacks.

9.
BMC Genomics ; 25(1): 242, 2024 Mar 05.
Artículo en Inglés | MEDLINE | ID: mdl-38443802

RESUMEN

BACKGROUND: 5-Methylcytosine (5mC) plays a very important role in gene stability, transcription, and development. Therefore, accurate identification of the 5mC site is of key importance in genetic and pathological studies. However, traditional experimental methods for identifying 5mC sites are time-consuming and costly, so there is an urgent need to develop computational methods to automatically detect and identify these 5mC sites. RESULTS: Deep learning methods have shown great potential in the field of 5mC sites, so we developed a deep learning combinatorial model called i5mC-DCGA. The model innovatively uses the Convolutional Block Attention Module (CBAM) to improve the Dense Convolutional Network (DenseNet), which is improved to extract advanced local feature information. Subsequently, we combined a Bidirectional Gated Recurrent Unit (BiGRU) and a Self-Attention mechanism to extract global feature information. Our model can learn feature representations of abstract and complex from simple sequence coding, while having the ability to solve the sample imbalance problem in benchmark datasets. The experimental results show that the i5mC-DCGA model achieves 97.02%, 96.52%, 96.58% and 85.58% in sensitivity (Sn), specificity (Sp), accuracy (Acc) and matthews correlation coefficient (MCC), respectively. CONCLUSIONS: The i5mC-DCGA model outperforms other existing prediction tools in predicting 5mC sites, and it is currently the most representative promoter 5mC site prediction tool. The benchmark dataset and source code for the i5mC-DCGA model can be found in https://github.com/leirufeng/i5mC-DCGA .


Asunto(s)
5-Metilcitosina , Benchmarking , Regiones Promotoras Genéticas , Proyectos de Investigación , Programas Informáticos
10.
Comput Methods Programs Biomed ; 248: 108123, 2024 May.
Artículo en Inglés | MEDLINE | ID: mdl-38471292

RESUMEN

BACKGROUND AND OBJECTIVE: Early diagnosis of mild cognitive impairment (MCI) is one of the essential measures to prevent its further development into Alzheimer's disease (AD). In this paper, we propose a hybrid deep learning model for early diagnosis of MCI, called spatio-temporal convolutional gated recurrent unit network (STCGRU). METHODS: The STCGRU comprises three bespoke convolutional neural network (CNN) modules and a bi-directional gated recurrent unit (BiGRU) module, which can effectively extract the spatial and temporal features of EEG and obtain excellent diagnostic results. We use a publicly available EEG dataset that has not undergone pre-processing to verify the robustness and accuracy of the model. Ablation experiments on STCGRU are conducted to showcase the individual performance improvement of each module. RESULTS: Compared with other state-of-the-art approaches using the same publicly available EEG dataset, the results show that STCGRU is more suitable for early diagnosis of MCI. After 10-fold cross-validation, the average classification accuracy of the hybrid model reached 99.95 %, while the average kappa value reached 0.9989. CONCLUSIONS: The experimental results show that the hybrid model proposed in this paper can directly extract compelling spatio-temporal features from the raw EEG data for classification. The STCGRU allows for accurate diagnosis of patients with MCI and has a high practical value.


Asunto(s)
Enfermedad de Alzheimer , Disfunción Cognitiva , Humanos , Redes Neurales de la Computación , Disfunción Cognitiva/diagnóstico por imagen , Enfermedad de Alzheimer/diagnóstico por imagen , Diagnóstico Precoz , Proyectos de Investigación , Electroencefalografía/métodos
11.
Digit Health ; 10: 20552076241234624, 2024.
Artículo en Inglés | MEDLINE | ID: mdl-38449680

RESUMEN

Objectives: Cardiac arrhythmia is one of the most severe cardiovascular diseases that can be fatal. Therefore, its early detection is critical. However, detecting types of arrhythmia by physicians based on visual identification is time-consuming and subjective. Deep learning can develop effective approaches to classify arrhythmias accurately and quickly. This study proposed a deep learning approach developed based on a Chapman-Shaoxing electrocardiogram (ECG) dataset signal to detect seven types of arrhythmias. Method: Our DNN model is a hybrid CNN-BILSTM-BiGRU algorithm assisted by a multi-head self-attention mechanism regarding the challenging problem of classifying various arrhythmias of ECG signals. Additionally, the synthetic minority oversampling technique (SMOTE)-Tomek technique was utilized to address the data imbalance problem to detect and classify cardiac arrhythmias. Result: The proposed model, trained with a single lead, was tested using a dataset containing 10,466 participants. The performance of the algorithm was evaluated using a random split validation approach. The proposed algorithm achieved an accuracy of 98.57% by lead II and 98.34% by lead aVF for the classification of arrhythmias. Conclusion: We conducted an analysis of single-lead ECG signals to evaluate the effectiveness of our proposed hybrid model in diagnosing and classifying different types of arrhythmias. We trained separate classification models using each individual signal lead. Additionally, we implemented the SMOTE-Tomek technique along with cross-entropy loss as a cost function to address the class imbalance problem. Furthermore, we utilized a multi-headed self-attention mechanism to adjust the network structure and classify the seven arrhythmia classes. Our model achieved high accuracy and demonstrated good generalization ability in detecting ECG arrhythmias. However, further testing of the model with diverse datasets is crucial to validate its performance.

12.
Sensors (Basel) ; 24(5)2024 Feb 22.
Artículo en Inglés | MEDLINE | ID: mdl-38474965

RESUMEN

Deep learning promotes the breakthrough of emotion recognition in many fields, especially speech emotion recognition (SER). As an important part of speech emotion recognition, the most relevant acoustic feature extraction has always attracted the attention of existing researchers. Aiming at the problem that the emotional information contained in the current speech signals is distributed dispersedly and cannot comprehensively integrate local and global information, this paper presents a network model based on a gated recurrent unit (GRU) and multi-head attention. We evaluate our proposed emotion model on the IEMOCAP and Emo-DB corpora. The experimental results show that the network model based on Bi-GRU and multi-head attention is significantly better than the traditional network model at detecting multiple evaluation indicators. At the same time, we also apply the model to a speech sentiment analysis task. On the CH-SIMS and MOSI datasets, the model shows excellent generalization performance.


Asunto(s)
Percepción , Habla , Acústica , Emociones , Reconocimiento en Psicología
13.
Sensors (Basel) ; 24(5)2024 Mar 01.
Artículo en Inglés | MEDLINE | ID: mdl-38475151

RESUMEN

An equalizer based on a recurrent neural network (RNN), especially with a bidirectional gated recurrent unit (biGRU) structure, is a good choice to deal with nonlinear damage and inter-symbol interference (ISI) in optical communication systems because of its excellent performance in processing time series information. However, its recursive structure prevents the parallelization of the computation, resulting in a low equalization rate. In order to improve the speed without compromising the equalization performance, we propose a minimalist 1D convolutional neural network (CNN) equalizer, which is reconverted from a biGRU with knowledge distillation (KD). In this work, we applied KD to regression problems and explain how KD helps students learn from teachers in solving regression problems. In addition, we compared the biGRU, 1D-CNN after KD and 1D-CNN without KD in terms of Q-factor and equalization velocity. The experimental data showed that the Q-factor of the 1D-CNN increased by 1 dB after KD learning from the biGRU, and KD increased the RoP sensitivity of the 1D-CNN by 0.89 dB with the HD-FEC threshold of 1 × 10-3. At the same time, compared with the biGRU, the proposed 1D-CNN equalizer reduced the computational time consumption by 97% and the number of trainable parameters by 99.3%, with only a 0.5 dB Q-factor penalty. The results demonstrate that the proposed minimalist 1D-CNN equalizer holds significant promise for future practical deployments in optical wireless communication systems.

14.
Methods ; 226: 1-8, 2024 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-38485031

RESUMEN

N6-methyladenosine (m6A) is the most prevalent, abundant, and conserved internal modification in the eukaryotic messenger RNA (mRNAs) and plays a crucial role in the cellular process. Although more than ten methods were developed for m6A detection over the past decades, there were rooms left to improve the predictive accuracy and the efficiency. In this paper, we proposed an improved method for predicting m6A modification sites, which was based on bi-directional gated recurrent unit (Bi-GRU) and convolutional neural networks (CNN), called Deepm6A-MT. The Deepm6A-MT has two input channels. One is to use an embedding layer followed by the Bi-GRU and then by the CNN, and another is to use one-hot encoding, dinucleotide one-hot encoding, and nucleotide chemical property codes. We trained and evaluated the Deepm6A-MT both by the 5-fold cross-validation and the independent test. The empirical tests showed that the Deepm6A-MT achieved the state of the art performance. In addition, we also conducted the cross-species and the cross-tissues tests to further verify the Deepm6A-MT for effectiveness and efficiency. Finally, for the convenience of academic research, we deployed the Deepm6A-MT to the web server, which is accessed at the URL http://www.biolscience.cn/Deepm6A-MT/.


Asunto(s)
Adenosina , Aprendizaje Profundo , Adenosina/análogos & derivados , Adenosina/metabolismo , Adenosina/genética , Adenosina/química , Humanos , Animales , Redes Neurales de la Computación , ARN Mensajero/genética , ARN Mensajero/metabolismo , Biología Computacional/métodos
15.
Environ Geochem Health ; 46(4): 127, 2024 Mar 14.
Artículo en Inglés | MEDLINE | ID: mdl-38483668

RESUMEN

Dissolved oxygen is one of the important comprehensive indicators of river water quality, which reflects the degree of pollution in the water body. Monitoring and predicting dissolved oxygen are an important tool for water quality management, which helps to effectively maintain water ecological balance and prevent environmental problems. A single model cannot describe the dynamic characteristics of dissolved oxygen sequence, which affects the prediction accuracy. In order to obtain more accurate dissolved oxygen prediction results, decomposition techniques are commonly used to extract the main fluctuations and trends of water quality sequences. However, the high-frequency modes obtained from decomposition are still unstable. To solve this problem, this paper proposed a hybrid prediction model of dissolved oxygen concentration based on secondary decomposition and bidirectional gate recurrent unit. Firstly, dissolved oxygen sequence is preliminarily decomposed by complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) and obtain several intrinsic mode functions (IMF). The fuzzy entropy (FE) is calculated to quantify the complexity of the IMF. Then, variational mode decomposition improved by northern goshawk optimization is used to decompose the IMF with higher entropy. The nonlinearity and instability of the sequence are further weakened. Finally, the bidirectional gate recurrent unit (BiGRU) neural network is used to predict each IMF component, and the final prediction result is obtained by reconstructing the prediction results of each component. In order to verify the effectiveness of the proposed model, this paper selects the dissolved oxygen data of Xin'anjiang Reservoir as the research object. The experimental results show that the RMSE, MAE, MAPE, and R2 of the proposed model are 0.1164, 0.0894, 1.0403%, and 0.9939, respectively, which is best among other comparative prediction models (BP, LSTM, GRU, BiGRU, EMD-BiGRU, CEEMDAN-BiGRU, VMD-BiGRU, and GNO-VMD-BiGRU). Therefore, this model effectively deals with high volatility and nonlinear dissolved oxygen data and provides reference for water environment management and ecological protection.


Asunto(s)
Agua Dulce , Redes Neurales de la Computación , Entropía , Oxígeno , Calidad del Agua
16.
Interdiscip Sci ; 16(3): 635-648, 2024 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-38381315

RESUMEN

Circular RNAs (circRNAs) are non-coding RNAs generated by reverse splicing. They are involved in biological process and human diseases by interacting with specific RNA-binding proteins (RBPs). Due to traditional biological experiments being costly, computational methods have been proposed to predict the circRNA-RBP interaction. However, these methods have problems of single feature extraction. Therefore, we propose a novel model called circ-FHN, which utilizes only circRNA sequences to predict circRNA-RBP interactions. The circ-FHN approach involves feature coding and a hybrid deep learning model. Feature coding takes into account the physicochemical properties of circRNA sequences and employs four coding methods to extract sequence features. The hybrid deep structure comprises a convolutional neural network (CNN) and a bidirectional gated recurrent unit (BiGRU). The CNN learns high-level abstract features, while the BiGRU captures long-term dependencies in the sequence. To assess the effectiveness of circ-FHN, we compared it to other computational methods on 16 datasets and conducted ablation experiments. Additionally, we conducted motif analysis. The results demonstrate that circ-FHN exhibits exceptional performance and surpasses other methods. circ-FHN is freely available at https://github.com/zhaoqi106/circ-FHN .


Asunto(s)
Redes Neurales de la Computación , ARN Circular , Proteínas de Unión al ARN , ARN Circular/genética , ARN Circular/metabolismo , Sitios de Unión , Humanos , Proteínas de Unión al ARN/metabolismo , Proteínas de Unión al ARN/genética , Biología Computacional/métodos , Aprendizaje Profundo
17.
Heliyon ; 10(4): e25957, 2024 Feb 29.
Artículo en Inglés | MEDLINE | ID: mdl-38380007

RESUMEN

Predicting the duration of traffic accidents is a critical component of traffic management and emergency response on expressways. Traffic accident information is inherently multi-mode data in terms of data types. However, most existing studies focus on single-mode data, and the influence of multi-mode data on the prediction performances of models has been the subject of only very limited quantitative analysis. The present work addresses these issues by proposing a heterogeneous deep learning architecture employing multi-modal features to improve the accuracy of predictions for traffic accident durations on expressways. Firstly, six unique data modes are obtained based on the structured data and the text data. Secondly, a hybrid deep learning approach is applied to build classification models with reduced prediction error. Finally, a rigorous analysis of the influence for multi-mode data on the accident duration prediction performances is conducted using a variety of deep learning models. The proposed method is evaluated using survey data collected from an expressway monitoring system in Shaanxi Province, China. The experimental results show that Word2Vec-BiGRU-CNN is a suitable and better model using text features for traffic accident duration prediction, as the F1-score is 0.3648. This study confirms that the newly established structured features extracted from text data substantially enhance the prediction effects of deep learning algorithms. However, these new features were a detriment to the prediction effects of conventional machine learning algorithms. Accordingly, these results demonstrate that the processing and extraction of text features is a complex issue in the field of traffic accident duration prediction.

18.
Comput Biol Chem ; 108: 107977, 2024 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-37995493

RESUMEN

Named Entity Recognition (NER) is a fundamental but crucial task in natural language processing (NLP) and big data analysis, with wide application range. NER for rice genes and phenotypes is a technique to identify genes and phenotypes from a large amount of text. NER for rice genes and phenotypes can facilitate the acquisition of information in the field of crops and provide references for our research on higher quality crops. At the same time, named entity recognition still faces many challenges. In this paper, we propose an improved bidirectional gated recurrent unit neural network (BI-GRU) method, which is used to automatically identify the required entities (i.e. gene names, rice phenotypes) from relevant rice literature and patents. The neural network model is combined with the Softmax function to directly output the probabilities of labels, forming the BI-GRU-SF model. With the ability of deep learning methods, the semantic information in the context can be learned without the need for feature engineering. Finally, we conducted experiments, and the results showed that our proposed model provided better performance compared to other models. All datasets and resource codes of BI-GRU-SF are available at https://github.com/qqeeqq/NER for academic use.


Asunto(s)
Oryza , Oryza/genética , Redes Neurales de la Computación , Macrodatos , Procesamiento de Lenguaje Natural , Productos Agrícolas
19.
Comput Biol Med ; 168: 107795, 2024 01.
Artículo en Inglés | MEDLINE | ID: mdl-38056206

RESUMEN

BACKGROUND: Physiological parameter monitoring based on photoplethysmography (PPG) detection has the advantage of fast, portable, and non-invasive. Changes in the morphology of the PPG waveform can reflect the effect of arterial elasticity changes on blood pressure (BP). However, machine learning models and non-recurrent neural network models typically ignore the time-dependency of continuous PPG data, leading to the decrease of accuracy or the increased calibration frequency. OBJECTIVE: This paper proposes a BiGRU model with attention to estimate BP trends, which uses a single-channel PPG signal combined with demographic information to estimate continuous BP trends point-by-point and to discuss the impact of calibration cycle. METHODS: This paper selects 15 typical subjects from two groups with/without cardiovascular disease (CVD) and evaluates the model performance. Regarding the calibration frequency problem, we set two modes of non-calibration and calibration to validate the results of blood pressure trends estimation. RESULTS: In the calibration mode, the estimation errors (ME ± STD) of SBP for CVD/non-CVD groups are 0.91 ± 10.58 mmHg/0.17 ± 10.06 mmHg respectively, and DBP are 0.34 ± 5.28 mmHg/-0.19 ± 5.36 mmHg; in the non-calibration mode, the estimation errors of SBP for CVD/non-CVD groups are 0.27 ± 9.87 mmHg/-0.82 ± 9.92 mmHg respectively, and DBP are -0.63 ± 3.28 mmHg/0.80 ± 4.93 mmHg. CONCLUSIONS: The results show that the proposed model has high accuracy in estimating BP levels, which is expected to achieve real-time, long-term continuous BP trends monitoring in wearable devices.


Asunto(s)
Determinación de la Presión Sanguínea , Enfermedades Cardiovasculares , Humanos , Presión Sanguínea/fisiología , Determinación de la Presión Sanguínea/métodos , Fotopletismografía/métodos , Análisis de la Onda del Pulso
20.
Artículo en Chino | WPRIM (Pacífico Occidental) | ID: wpr-1031682

RESUMEN

@#Objective To propose a heart sound segmentation method based on multi-feature fusion network. Methods Data were obtained from the CinC/PhysioNet 2016 Challenge dataset (a total of 3 153 recordings from 764 patients, about 91.93% of whom were male, with an average age of 30.36 years). Firstly the features were extracted in time domain and time-frequency domain respectively, and reduced redundant features by feature dimensionality reduction. Then, we selected optimal features separately from the two feature spaces that performed best through feature selection. Next, the multi-feature fusion was completed through multi-scale dilated convolution, cooperative fusion, and channel attention mechanism. Finally, the fused features were fed into a bidirectional gated recurrent unit (BiGRU) network to heart sound segmentation results. Results The proposed method achieved precision, recall and F1 score of 96.70%, 96.99%, and 96.84% respectively. Conclusion The multi-feature fusion network proposed in this study has better heart sound segmentation performance, which can provide high-accuracy heart sound segmentation technology support for the design of automatic analysis of heart diseases based on heart sounds.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA