Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 41
Filter
1.
Protein Sci ; 33(6): e5006, 2024 Jun.
Article in English | MEDLINE | ID: mdl-38723168

ABSTRACT

The emergence and spread of antibiotic-resistant bacteria pose a significant public health threat, necessitating the exploration of alternative antibacterial strategies. Antibacterial peptide (ABP) is a kind of antimicrobial peptide (AMP) that has the potential ability to fight against bacteria infection, offering a promising avenue for developing novel therapeutic interventions. This study introduces AMPActiPred, a three-stage computational framework designed to identify ABPs, characterize their activity against diverse bacterial species, and predict their activity levels. AMPActiPred employed multiple effective peptide descriptors to effectively capture the compositional features and physicochemical properties of peptides. AMPActiPred utilized deep forest architecture, a cascading architecture similar to deep neural networks, capable of effectively processing and exploring original features to enhance predictive performance. In the first stage, AMPActiPred focuses on ABP identification, achieving an Accuracy of 87.6% and an MCC of 0.742 on an elaborate dataset, demonstrating state-of-the-art performance. In the second stage, AMPActiPred achieved an average GMean at 82.8% in identifying ABPs targeting 10 bacterial species, indicating AMPActiPred can achieve balanced predictions regarding the functional activity of ABP across this set of species. In the third stage, AMPActiPred demonstrates robust predictive capabilities for ABP activity levels with an average PCC of 0.722. Furthermore, AMPActiPred exhibits excellent interpretability, elucidating crucial features associated with antibacterial activity. AMPActiPred is the first computational framework capable of predicting targets and activity levels of ABPs. Finally, to facilitate the utilization of AMPActiPred, we have established a user-friendly web interface deployed at https://awi.cuhk.edu.cn/∼AMPActiPred/.


Subject(s)
Anti-Bacterial Agents , Anti-Bacterial Agents/pharmacology , Anti-Bacterial Agents/chemistry , Antimicrobial Peptides/chemistry , Antimicrobial Peptides/pharmacology , Bacteria/drug effects , Computational Biology/methods , Neural Networks, Computer , Microbial Sensitivity Tests
2.
Brief Bioinform ; 25(3)2024 Mar 27.
Article in English | MEDLINE | ID: mdl-38706321

ABSTRACT

Antiviral peptides (AVPs) have shown potential in inhibiting viral attachment, preventing viral fusion with host cells and disrupting viral replication due to their unique action mechanisms. They have now become a broad-spectrum, promising antiviral therapy. However, identifying effective AVPs is traditionally slow and costly. This study proposed a new two-stage computational framework for AVP identification. The first stage identifies AVPs from a wide range of peptides, and the second stage recognizes AVPs targeting specific families or viruses. This method integrates contrastive learning and multi-feature fusion strategy, focusing on sequence information and peptide characteristics, significantly enhancing predictive ability and interpretability. The evaluation results of the model show excellent performance, with accuracy of 0.9240 and Matthews correlation coefficient (MCC) score of 0.8482 on the non-AVP independent dataset, and accuracy of 0.9934 and MCC score of 0.9869 on the non-AMP independent dataset. Furthermore, our model can predict antiviral activities of AVPs against six key viral families (Coronaviridae, Retroviridae, Herpesviridae, Paramyxoviridae, Orthomyxoviridae, Flaviviridae) and eight viruses (FIV, HCV, HIV, HPIV3, HSV1, INFVA, RSV, SARS-CoV). Finally, to facilitate user accessibility, we built a user-friendly web interface deployed at https://awi.cuhk.edu.cn/∼dbAMP/AVP/.


Subject(s)
Antiviral Agents , Computational Biology , Peptides , Antiviral Agents/pharmacology , Peptides/chemistry , Computational Biology/methods , Humans , Viruses , Machine Learning , Algorithms
3.
iScience ; 27(4): 109333, 2024 Apr 19.
Article in English | MEDLINE | ID: mdl-38523792

ABSTRACT

Kinases as important enzymes can transfer phosphate groups from high-energy and phosphate-donating molecules to specific substrates and play essential roles in various cellular processes. Existing algorithms for kinase activity from phosphorylated proteomics data are often costly, requiring valuable samples. Moreover, methods to extract kinase activities from bulk RNA sequencing data remain undeveloped. In this study, we propose a computational framework KinPred-RNA to derive kinase activities from bulk RNA-sequencing data in cancer samples. KinPred-RNA framework, using the extreme gradient boosting (XGBoost) regression model, outperforms random forest regression, multiple linear regression, and support vector machine regression models in predicting kinase activities from cancer-related RNA sequencing data. Efficient gene signatures from the LINCS-L1000 dataset were used as inputs for KinPred-RNA. The results highlight its potential to be related to biological function. In conclusion, KinPred RNA constitutes a significant advance in cancer research by potentially facilitating the identification of cancer.

4.
Int J Mol Sci ; 25(5)2024 Mar 01.
Article in English | MEDLINE | ID: mdl-38474116

ABSTRACT

RNA modification plays a crucial role in cellular regulation. However, traditional high-throughput sequencing methods for elucidating their functional mechanisms are time-consuming and labor-intensive, despite extensive research. Moreover, existing methods often limit their focus to specific species, neglecting the simultaneous exploration of RNA modifications across diverse species. Therefore, a versatile computational approach is necessary for interpretable analysis of RNA modifications across species. A multi-scale biological language-based deep learning model is proposed for interpretable, sequential-level prediction of diverse RNA modifications. Benchmark comparisons across species demonstrate the model's superiority in predicting various RNA methylation types over current state-of-the-art methods. The cross-species validation and attention weight visualization also highlight the model's capability to capture sequential and functional semantics from genomic backgrounds. Our analysis of RNA modifications helps us find the potential existence of "biological grammars" in each modification type, which could be effective for mapping methylation-related sequential patterns and understanding the underlying biological mechanisms of RNA modifications.


Subject(s)
Deep Learning , RNA , RNA/genetics , RNA Methylation , Methylation , Protein Processing, Post-Translational
5.
Anal Chem ; 96(4): 1538-1546, 2024 01 30.
Article in English | MEDLINE | ID: mdl-38226973

ABSTRACT

Tuberculosis (TB) is a severe disease caused by Mycobacterium tuberculosis that poses a significant threat to human health. The emergence of drug-resistant strains has made the global fight against TB even more challenging. Antituberculosis peptides (ATPs) have shown promising results as a potential treatment for TB. However, conventional wet lab-based approaches to ATP discovery are time-consuming and costly and often fail to discover peptides with desired properties. To address these challenges, we propose a novel machine learning-based framework called ATPfinder that can significantly accelerate the discovery of ATP. Our approach integrates various efficient peptide descriptors and utilizes the deep forest algorithm to construct the model. This neural network-like cascading structure can effectively process and mine features without complex hyperparameter tuning. Our experimental results show that ATPfinder outperforms existing ATP prediction tools, achieving state-of-the-art performance with an accuracy of 89.3% and an MCC of 0.70. Moreover, our framework exhibits better robustness than baseline algorithms commonly used for other sequence analysis tasks. Additionally, the excellent interpretability of our model can assist researchers in understanding the critical features of ATP. Finally, we developed a downloadable desktop application to simplify the use of our framework for researchers. Therefore, ATPfinder can facilitate the discovery of peptide drugs and provide potential solutions for TB treatment. Our framework is freely available at https://github.com/lantianyao/ATPfinder/ (data sets and code) and https://awi.cuhk.edu.cn/dbAMP/ATPfinder.html (software).


Subject(s)
Mycobacterium tuberculosis , Tuberculosis , Humans , Peptides/pharmacology , Antitubercular Agents/pharmacology , Algorithms , Tuberculosis/drug therapy , Forests , Adenosine Triphosphate
6.
Article in English | MEDLINE | ID: mdl-38059128

ABSTRACT

Methamphetamine use disorder (MUD) is an illness associated with severe health consequences. Virtual reality (VR) is used to induce the drug-cue reactivity and significant EEG and ECG abnormalities were found in MUD patients. However, whether a link exists between EEG and ECG abnormalities in patients with MUD during exposure to drug cues remains unknown. This is important from the therapeutic viewpoint because different treatment strategies may be applied when EEG abnormalities and ECG irregularities are complications of MUD. We designed a VR system with drug cues and EEG and ECG were recorded during VR exposure. Sixteen patients with MUD and sixteen healthy subjects were recruited. Statistical tests and Pearson correlation were employed to analyze the EEG and ECG. The results showed that, during VR induction, the patients with MUD but not healthy controls showed significant [Formula: see text] and [Formula: see text] power increases when the stimulus materials were most intense. This finding indicated that the stimuli are indiscriminate to healthy controls but meaningful to patients with MUD. Five heart rate variability (HRV) indexes significantly differed between patients and controls, suggesting abnormalities in the reaction of patient's autonomic nervous system. Importantly, significant relations between EEG and HRV indexes changes were only identified in the controls, but not in MUD patients, signifying a disruption of brain-heart relations in patients. Our findings of stimulus-specific EEG changes and the impaired brain-heart relations in patients with MUD shed light on the understanding of drug-cue reactivity and may be used to design diagnostic and/or therapeutic strategies for MUD.


Subject(s)
Methamphetamine , Virtual Reality , Humans , Methamphetamine/adverse effects , Cues , Brain , Heart Rate/physiology
7.
J Chem Inf Model ; 63(24): 7886-7898, 2023 Dec 25.
Article in English | MEDLINE | ID: mdl-38054927

ABSTRACT

Inflammation is a biological response to harmful stimuli, aiding in the maintenance of tissue homeostasis. However, excessive or persistent inflammation can precipitate a myriad of pathological conditions. Although current treatments such as NSAIDs, corticosteroids, and immunosuppressants are effective, they can have side effects and resistance issues. In this backdrop, anti-inflammatory peptides (AIPs) have emerged as a promising therapeutic approach against inflammation. Leveraging machine learning methods, we have the opportunity to accelerate the discovery and investigation of these AIPs more effectively. In this study, we proposed an advanced framework by ensemble machine learning and deep learning for AIP prediction. Initially, we constructed three individual models with extremely randomized trees (ET), gated recurrent unit (GRU), and convolutional neural networks (CNNs) with attention mechanism and then used stacking architecture to build the final predictor. By utilizing various sequence encodings and combining the strengths of different algorithms, our predictor demonstrated exemplary performance. On our independent test set, our model achieved an accuracy, MCC, and F1-score of 0.757, 0.500, and 0.707, respectively, clearly outperforming other contemporary AIP prediction methods. Additionally, our model offers profound insights into the feature interpretation of AIPs, establishing a valuable knowledge foundation for the design and development of future anti-inflammatory strategies.


Subject(s)
Deep Learning , Humans , Anti-Inflammatory Agents/pharmacology , Anti-Inflammatory Agents/therapeutic use , Peptides/pharmacology , Inflammation/drug therapy , Algorithms , Machine Learning
8.
iScience ; 26(12): 108250, 2023 Dec 15.
Article in English | MEDLINE | ID: mdl-38025779

ABSTRACT

The challenge of drug-resistant bacteria to global public health has led to increased attention on antimicrobial peptides (AMPs) as a targeted therapeutic alternative with a lower risk of resistance. However, high production costs and limitations in functional class prediction have hindered progress in this field. In this study, we used multi-label classifiers with binary relevance and algorithm adaptation techniques to predict different functions of AMPs across a wide range of pathogen categories, including bacteria, mammalian cells, fungi, viruses, and cancer cells. Our classifiers attained promising AUC scores varying from 0.8492 to 0.9126 on independent testing data. Forward feature selection identified sequence order and charge as critical, with specific amino acids (C and E) as discriminative. These findings provide valuable insights for the design of antimicrobial peptides (AMPs) with multiple functionalities, thus contributing to the broader effort to combat drug-resistant pathogens.

9.
Viruses ; 15(10)2023 09 27.
Article in English | MEDLINE | ID: mdl-37896791

ABSTRACT

Cervical cancer, a major health concern among women worldwide, is closely linked to human papillomavirus (HPV) infection. This study explores the evolving landscape of HPV molecular epidemiology in Taiwan over a decade (2010-2020), where prophylactic HPV vaccination has been implemented since 2007. Analyzing data from 40,561 vaginal swab samples, with 42.0% testing positive for HPV, we reveal shifting trends in HPV genotype distribution and infection patterns. The 12 high-risk genotypes, in order of decreasing percentage, were HPV 52, 58, 16, 18, 51, 56, 39, 59, 33, 31, 45, and 35. The predominant genotypes were HPV 52, 58, and 16, accounting for over 70% of cases annually. The proportions of high-risk and non-high-risk HPV infections varied across age groups. High-risk infections predominated in sexually active individuals aged 30-50 and were mixed-type infections. The composition of high-risk HPV genotypes was generally stable over time; however, HPV31, 33, 39, and 51 significantly decreased over the decade. Of the strains, HPV31 and 33 are shielded by the nonavalent HPV vaccine. However, no reduction was noted for the other seven genotypes. This study offers valuable insights into the post-vaccine HPV epidemiology. Future investigations should delve into HPV vaccines' effects and their implications for cervical cancer prevention strategies. These findings underscore the need for continued surveillance and research to guide effective public health interventions targeting HPV-associated diseases.


Subject(s)
Papillomavirus Infections , Papillomavirus Vaccines , Uterine Cervical Neoplasms , Humans , Female , Uterine Cervical Neoplasms/epidemiology , Uterine Cervical Neoplasms/prevention & control , Human Papillomavirus Viruses , Papillomavirus Infections/epidemiology , Papillomavirus Infections/prevention & control , Molecular Epidemiology , Papillomaviridae/genetics , Genotype , Human papillomavirus 31/genetics , Prevalence
10.
Brief Bioinform ; 24(6)2023 09 22.
Article in English | MEDLINE | ID: mdl-37742050

ABSTRACT

The emergence of multidrug-resistant bacteria is a critical global crisis that poses a serious threat to public health, particularly with the rise of multidrug-resistant Staphylococcus aureus. Accurate assessment of drug resistance is essential for appropriate treatment and prevention of transmission of these deadly pathogens. Early detection of drug resistance in patients is critical for providing timely treatment and reducing the spread of multidrug-resistant bacteria. This study aims to develop a novel risk assessment framework for S. aureus that can accurately determine the resistance to multiple antibiotics. The comprehensive 7-year study involved ˃20 000 isolates with susceptibility testing profiles of six antibiotics. By incorporating mass spectrometry and machine learning, the study was able to predict the susceptibility to four different antibiotics with high accuracy. To validate the accuracy of our models, we externally tested on an independent cohort and achieved impressive results with an area under the receiver operating characteristic curve of 0. 94, 0.90, 0.86 and 0.91, and an area under the precision-recall curve of 0.93, 0.87, 0.87 and 0.81, respectively, for oxacillin, clindamycin, erythromycin and trimethoprim-sulfamethoxazole. In addition, the framework evaluated the level of multidrug resistance of the isolates by using the predicted drug resistance probabilities, interpreting them in the context of a multidrug resistance risk score and analyzing the performance contribution of different sample groups. The results of this study provide an efficient method for early antibiotic decision-making and a better understanding of the multidrug resistance risk of S. aureus.


Subject(s)
Methicillin-Resistant Staphylococcus aureus , Staphylococcal Infections , Humans , Staphylococcus aureus , Staphylococcal Infections/drug therapy , Staphylococcal Infections/microbiology , Anti-Bacterial Agents/pharmacology , Spectrometry, Mass, Matrix-Assisted Laser Desorption-Ionization/methods , Machine Learning , Risk Assessment
11.
Protein Sci ; 32(10): e4758, 2023 10.
Article in English | MEDLINE | ID: mdl-37595093

ABSTRACT

Fungal infections have become a significant global health issue, affecting millions worldwide. Antifungal peptides (AFPs) have emerged as a promising alternative to conventional antifungal drugs due to their low toxicity and low propensity for inducing resistance. In this study, we developed a deep learning-based framework called DeepAFP to efficiently identify AFPs. DeepAFP fully leverages and mines composition information, evolutionary information, and physicochemical properties of peptides by employing combined kernels from multiple branches of convolutional neural network with bi-directional long short-term memory layers. In addition, DeepAFP integrates a transfer learning strategy to obtain efficient representations of peptides for improving model performance. DeepAFP demonstrates strong predictive ability on carefully curated datasets, yielding an accuracy of 93.29% and an F1-score of 93.45% on the DeepAFP-Main dataset. The experimental results show that DeepAFP outperforms existing AFP prediction tools, achieving state-of-the-art performance. Finally, we provide a downloadable AFP prediction tool to meet the demands of large-scale prediction and facilitate the usage of our framework by the public or other researchers. Our framework can accurately identify AFPs in a short time without requiring significant human and material resources, and hence can accelerate the development of AFPs as well as contribute to the treatment of fungal infections. Furthermore, our method can provide new perspectives for other biological sequence analysis tasks.


Subject(s)
Deep Learning , Mycoses , Humans , Algorithms , Antifungal Agents/pharmacology , alpha-Fetoproteins , Peptides/pharmacology , Peptides/chemistry
12.
Int J Mol Sci ; 24(12)2023 Jun 19.
Article in English | MEDLINE | ID: mdl-37373494

ABSTRACT

One of the major challenges in cancer therapy lies in the limited targeting specificity exhibited by existing anti-cancer drugs. Tumor-homing peptides (THPs) have emerged as a promising solution to this issue, due to their capability to specifically bind to and accumulate in tumor tissues while minimally impacting healthy tissues. THPs are short oligopeptides that offer a superior biological safety profile, with minimal antigenicity, and faster incorporation rates into target cells/tissues. However, identifying THPs experimentally, using methods such as phage display or in vivo screening, is a complex, time-consuming task, hence the need for computational methods. In this study, we proposed StackTHPred, a novel machine learning-based framework that predicts THPs using optimal features and a stacking architecture. With an effective feature selection algorithm and three tree-based machine learning algorithms, StackTHPred has demonstrated advanced performance, surpassing existing THP prediction methods. It achieved an accuracy of 0.915 and a 0.831 Matthews Correlation Coefficient (MCC) score on the main dataset, and an accuracy of 0.883 and a 0.767 MCC score on the small dataset. StackTHPred also offers favorable interpretability, enabling researchers to better understand the intrinsic characteristics of THPs. Overall, StackTHPred is beneficial for both the exploration and identification of THPs and facilitates the development of innovative cancer therapies.


Subject(s)
Neoplasms , Peptides , Humans , Peptides/metabolism , Oligopeptides , Algorithms , Machine Learning
13.
Microbiol Spectr ; 11(3): e0347922, 2023 06 15.
Article in English | MEDLINE | ID: mdl-37042778

ABSTRACT

In clinical microbiology, matrix-assisted laser desorption ionization-time-of-flight mass spectrometry (MALDI-TOF MS) is frequently employed for rapid microbial identification. However, rapid identification of antimicrobial resistance (AMR) in Escherichia coli based on a large amount of MALDI-TOF MS data has not yet been reported. This may be because building a prediction model to cover all E. coli isolates would be challenging given the high diversity of the E. coli population. This study aimed to develop a MALDI-TOF MS-based, data-driven, two-stage framework for characterizing different AMRs in E. coli. Specifically, amoxicillin (AMC), ceftazidime (CAZ), ciprofloxacin (CIP), ceftriaxone (CRO), and cefuroxime (CXM) were used. In the first stage, we split the data into two groups based on informative peaks according to the importance of the random forest. In the second stage, prediction models were constructed using four different machine learning algorithms-logistic regression, support vector machine, random forest, and extreme gradient boosting (XGBoost). The findings demonstrate that XGBoost outperformed the other four machine learning models. The values of the area under the receiver operating characteristic curve were 0.62, 0.72, 0.87, 0.72, and 0.72 for AMC, CAZ, CIP, CRO, and CXM, respectively. This implies that a data-driven, two-stage framework could improve accuracy by approximately 2.8%. As a result, we developed AMR prediction models for E. coli using a data-driven two-stage framework, which is promising for assisting physicians in making decisions. Further, the analysis of informative peaks in future studies could potentially reveal new insights. IMPORTANCE Based on a large amount of matrix-assisted laser desorption ionization-time-of-flight mass spectrometry (MALDI-TOF MS) clinical data, comprising 37,918 Escherichia coli isolates, a data-driven two-stage framework was established to evaluate the antimicrobial resistance of E. coli. Five antibiotics, including amoxicillin (AMC), ceftazidime (CAZ), ciprofloxacin (CIP), ceftriaxone (CRO), and cefuroxime (CXM), were considered for the two-stage model training, and the values of the area under the receiver operating characteristic curve (AUC) were 0.62 for AMC, 0.72 for CAZ, 0.87 for CIP, 0.72 for CRO, and 0.72 for CXM. Further investigations revealed that the informative peak m/z 9714 appeared with some important peaks at m/z 6809, m/z 7650, m/z 10534, and m/z 11783 for CIP and at m/z 6809, m/z 10475, and m/z 8447 for CAZ, CRO, and CXM. This framework has the potential to improve the accuracy by approximately 2.8%, indicating a promising potential for further research.


Subject(s)
Anti-Bacterial Agents , Escherichia coli , Anti-Bacterial Agents/pharmacology , Ceftriaxone/pharmacology , Ceftazidime , Cefuroxime , Spectrometry, Mass, Matrix-Assisted Laser Desorption-Ionization/methods , Ciprofloxacin , Amoxicillin
14.
Article in English | MEDLINE | ID: mdl-37022368

ABSTRACT

Early diagnosis and treatment can reduce the symptoms of Attention Deficit/Hyperactivity Disorder (ADHD) in children, but medical diagnosis is usually delayed. Hence, it is important to increase the efficiency of early diagnosis. Previous studies used behavioral and neuronal data during GO/NOGO task to help detect ADHD and the accuracy differed considerably from 53% to 92%, depending on the employed methods and the number of electroencephalogram (EEG) channels. It remains unclear whether data from a few EEG channels can still lead to a good accuracy of detecting ADHD. Here, we hypothesize that introducing distractions into a VR-based GO/NOGO task can augment the detection of ADHD using 6-channel EEG because children with ADHD are easily distracted. Forty-nine ADHD children and 32 typically developing children were recruited. We use a clinically applicable system with EEG to record data. Statistical analysis and machine learning methods were employed to analyze the data. The behavioral results revealed significant differences in task performance when there are distractions. The presence of distractions leads to EEG changes in both groups, indicating immaturity in inhibitory control. Importantly, the distractions additionally enhanced the between-group differences in NOGO α and γ power, reflecting insufficient inhibition in different neural networks for distraction suppression in the ADHD group. Machine learning methods further confirmed that distractions enhance the detection of ADHD with an accuracy of 85.45%. In conclusion, this system can assist in fast screenings for ADHD and the findings of neuronal correlates of distractions can help design therapeutic strategies.

15.
Int J Mol Sci ; 24(5)2023 Feb 21.
Article in English | MEDLINE | ID: mdl-36901759

ABSTRACT

Cancer is one of the leading diseases threatening human life and health worldwide. Peptide-based therapies have attracted much attention in recent years. Therefore, the precise prediction of anticancer peptides (ACPs) is crucial for discovering and designing novel cancer treatments. In this study, we proposed a novel machine learning framework (GRDF) that incorporates deep graphical representation and deep forest architecture for identifying ACPs. Specifically, GRDF extracts graphical features based on the physicochemical properties of peptides and integrates their evolutionary information along with binary profiles for constructing models. Moreover, we employ the deep forest algorithm, which adopts a layer-by-layer cascade architecture similar to deep neural networks, enabling excellent performance on small datasets but without complicated tuning of hyperparameters. The experiment shows GRDF exhibits state-of-the-art performance on two elaborate datasets (Set 1 and Set 2), achieving 77.12% accuracy and 77.54% F1-score on Set 1, as well as 94.10% accuracy and 94.15% F1-score on Set 2, exceeding existing ACP prediction methods. Our models exhibit greater robustness than the baseline algorithms commonly used for other sequence analysis tasks. In addition, GRDF is well-interpretable, enabling researchers to better understand the features of peptide sequences. The promising results demonstrate that GRDF is remarkably effective in identifying ACPs. Therefore, the framework presented in this study could assist researchers in facilitating the discovery of anticancer peptides and contribute to developing novel cancer treatments.


Subject(s)
Neoplasms , Peptides , Humans , Peptides/chemistry , Algorithms , Amino Acid Sequence , Neural Networks, Computer
16.
Int J Mol Sci ; 24(2)2023 Jan 05.
Article in English | MEDLINE | ID: mdl-36674514

ABSTRACT

Matrix-assisted laser desorption/ionization time-of-flight (MALDI-TOF) mass spectrometry (MS) has been used to identify microorganisms and predict antibiotic resistance. The preprocessing method for the MS spectrum is key to extracting critical information from complicated MS spectral data. Different preprocessing methods yield different data, and the optimal approach is unclear. In this study, we adopted an ensemble of multiple preprocessing methods--FlexAnalysis, MALDIquant, and continuous wavelet transform-based methods--to detect peaks and build machine learning classifiers, including logistic regressions, naïve Bayes classifiers, random forests, and a support vector machine. The aim was to identify antibiotic resistance in Acinetobacter baumannii, Acinetobacter nosocomialis, Enterococcus faecium, and Group B Streptococci (GBS) based on MALDI-TOF MS spectra collected from two branches of a referral tertiary medical center. The ensemble method was compared with the individual methods. Random forest models built with the data preprocessed by the ensemble method outperformed individual preprocessing methods and achieved the highest accuracy, with values of 84.37% (A. baumannii), 90.96% (A. nosocomialis), 78.54% (E. faecium), and 70.12% (GBS) on independent testing datasets. Through feature selection, important peaks related to antibiotic resistance could be detected from integrated information. The prediction model can provide an opinion for clinicians. The discriminative peaks enabling better prediction performance can provide a reference for further investigation of the resistance mechanism.


Subject(s)
Acinetobacter Infections , Acinetobacter baumannii , Humans , Anti-Bacterial Agents/pharmacology , Spectrometry, Mass, Matrix-Assisted Laser Desorption-Ionization/methods , Bayes Theorem , Acinetobacter baumannii/chemistry
17.
Brief Bioinform ; 24(2)2023 03 19.
Article in English | MEDLINE | ID: mdl-36715277

ABSTRACT

N6-methyladinosine (m6A) modification is the most abundant co-transcriptional modification in eukaryotic RNA and plays important roles in cellular regulation. Traditional high-throughput sequencing experiments used to explore functional mechanisms are time-consuming and labor-intensive, and most of the proposed methods focused on limited species types. To further understand the relevant biological mechanisms among different species with the same RNA modification, it is necessary to develop a computational scheme that can be applied to different species. To achieve this, we proposed an attention-based deep learning method, adaptive-m6A, which consists of convolutional neural network, bi-directional long short-term memory and an attention mechanism, to identify m6A sites in multiple species. In addition, three conventional machine learning (ML) methods, including support vector machine, random forest and logistic regression classifiers, were considered in this work. In addition to the performance of ML methods for multi-species prediction, the optimal performance of adaptive-m6A yielded an accuracy of 0.9832 and the area under the receiver operating characteristic curve of 0.98. Moreover, the motif analysis and cross-validation among different species were conducted to test the robustness of one model towards multiple species, which helped improve our understanding about the sequence characteristics and biological functions of RNA modifications in different species.


Subject(s)
Machine Learning , RNA , Base Sequence , RNA/genetics , Neural Networks, Computer
18.
IEEE J Transl Eng Health Med ; 10: 2100811, 2022.
Article in English | MEDLINE | ID: mdl-36457894

ABSTRACT

Virtual reality (VR) has been widely adopted by therapists to provide rich motor training tasks. Time series data of motion trajectory accompanied with the interaction of VR system may contain important clues in regard to the assessment of motor function, however, clinical evaluation scales such as Fugl-Meyer Assessment (FMA), Wolf Motor Function Test (WMFT), and Test D'évaluation Des Membres Supérieurs Des Personnes Âgées (TEMPA) are highly depended in clinic. Further, there is not yet an assessment method that simultaneously consider motion trajectory and clinical evaluation scales. The objective of this study is to establish an evidence-based assessment model by machine-learning method that integrated motion trajectory of a VR task with clinical evaluation scales. In this study, a VR system for upper-limb motor training was proposed for stroke rehabilitation. Clinical trials with 20 stroke patients were performed. A variety of motor indicators that derived via motion trajectory were proposed. The correlations between motor indicators and clinical evaluation scales were examined. Further, motor indicators were integrated with evaluation scales to develop a machine-learning based model that represents an evidence-based motor assessment approach. Clinical evaluation scales, FMA, TEMPA and WMFT, were significantly progressed. A few motor indicators were found significantly correlated with clinical evaluation scales. The accuracy of machine-learning based assessment model was up to 86%. The proposed VR system is validated to be effective in motor rehabilitation. Motor indicators derived from motor trajectory were with potential for clinical motor assessment. Machine learning could be a promising tool to perform automatic assessment. Clinical and Translational Impact Statement-A VR task for motor rehabilitation was exanimated via clinical trials. Integrating motor indices with clinical assessment, a machine-learning model with accuracy of 86% was developed to evaluate motor function.


Subject(s)
Stroke Rehabilitation , Stroke , Virtual Reality , Humans , Upper Extremity , User-Computer Interface
19.
Front Microbiol ; 13: 821233, 2022.
Article in English | MEDLINE | ID: mdl-35756017

ABSTRACT

Matrix-assisted laser desorption ionization time-of-flight (MALDI-TOF) mass spectrometry (MS) has recently become a useful analytical approach for microbial identification. The presence and absence of specific peaks on MS spectra are commonly used to identify the bacterial species and predict antibiotic-resistant strains. However, the conventional approach using few single peaks would result in insufficient prediction power without using complete information of whole MS spectra. In the past few years, machine learning algorithms have been successfully applied to analyze the MALDI-TOF MS peaks pattern for rapid strain typing. In this study, we developed a convolutional neural network (CNN) method to deal with the complete information of MALDI-TOF MS spectra for detecting Enterococcus faecium, which is one of the leading pathogens in the world. We developed a CNN model to rapidly and accurately predict vancomycin-resistant Enterococcus faecium (VREfm) samples from the whole mass spectra profiles of clinical samples. The CNN models demonstrated good classification performances with the average area under the receiver operating characteristic curve (AUROC) of 0.887 when using external validation data independently. Additionally, we employed the score-class activation mapping (CAM) method to identify the important features of our CNN models and found some discriminative signals that can substantially contribute to detecting the ion of resistance. This study not only utilized the complete information of MALTI-TOF MS data directly but also provided a practical means for rapid detection of VREfm using a deep learning algorithm.

20.
Front Microbiol ; 13: 853775, 2022.
Article in English | MEDLINE | ID: mdl-35495667

ABSTRACT

Multidrug resistance has become a phenotype that commonly exists among Staphylococcus aureus and is a serious concern for infection treatment. Nowadays, to detect the antibiotic susceptibility, antibiotic testing is generated based on the level of genomic for cure decision consuming huge of time and labor, while matrix-assisted laser desorption-ionization (MALDI) time-of-flight mass spectrometry (TOF/MS) shows its possibility in high-speed and effective detection on the level of proteomic. In this study, on the basis of MALDI-TOF spectra data of discovery cohort with 26,852 samples and replication cohort with 4,963 samples from Taiwan area and their corresponding susceptibilities to oxacillin and clindamycin, a multi-label prediction model against double resistance using Lowest Power set ensemble with XGBoost is constructed for rapid susceptibility prediction. With the output of serial susceptibility prediction, the model performance can realize 77% of accuracy for the serial prediction, the area under the receiver characteristic curve of 0.93 for oxacillin susceptibility prediction, and the area under the receiver characteristic curve of 0.89 for clindamycin susceptibility prediction. The generated multi-label prediction model provides serial antibiotic resistance, such as the susceptibilities of oxacillin and clindamycin in this study, for S. aureus-infected patients based on MALDI-TOF, which will provide guidance in antibiotic usage during the treatment taking the advantage of speed and efficiency.

SELECTION OF CITATIONS
SEARCH DETAIL
...