Search | VHL Regional Portal

1.

SEnSCA: Identifying possible ligand-receptor interactions and its application in cell-cell communication inference.

Zhou, Liqian; Wang, Xiwen; Peng, Lihong; Chen, Min; Wen, Hong.

J Cell Mol Med ; 28(9): e18372, 2024 May.

Article in English | MEDLINE | ID: mdl-38747737

ABSTRACT

Multicellular organisms have dense affinity with the coordination of cellular activities, which severely depend on communication across diverse cell types. Cell-cell communication (CCC) is often mediated via ligand-receptor interactions (LRIs). Existing CCC inference methods are limited to known LRIs. To address this problem, we developed a comprehensive CCC analysis tool SEnSCA by integrating single cell RNA sequencing and proteome data. SEnSCA mainly contains potential LRI acquisition and CCC strength evaluation. For acquiring potential LRIs, it first extracts LRI features and reduces the feature dimension, subsequently constructs negative LRI samples through K-means clustering, finally acquires potential LRIs based on Stacking ensemble comprising support vector machine, 1D-convolutional neural networks and multi-head attention mechanism. During CCC strength evaluation, SEnSCA conducts LRI filtering and then infers CCC by combining the three-point estimation approach and single cell RNA sequencing data. SEnSCA computed better precision, recall, accuracy, F1 score, AUC and AUPR under most of conditions when predicting possible LRIs. To better illustrate the inferred CCC network, SEnSCA provided three visualization options: heatmap, bubble diagram and network diagram. Its application on human melanoma tissue demonstrated its reliability in CCC detection. In summary, SEnSCA offers a useful CCC inference tool and is freely available at https://github.com/plhhnu/SEnSCA.

Subject(s)

Cell Communication , Single-Cell Analysis , Humans , Ligands , Single-Cell Analysis/methods , Software , Computational Biology/methods , Algorithms , Support Vector Machine , Sequence Analysis, RNA/methods , Melanoma/metabolism , Melanoma/pathology , Melanoma/genetics , Proteome/metabolism , Neural Networks, Computer

2.

Finding potential lncRNA-disease associations using a boosting-based ensemble learning model.

Zhou, Liqian; Peng, Xinhuai; Zeng, Lijun; Peng, Lihong.

Front Genet ; 15: 1356205, 2024.

Article in English | MEDLINE | ID: mdl-38495672

ABSTRACT

Introduction: Long non-coding RNAs (lncRNAs) have been in the clinical use as potential prognostic biomarkers of various types of cancer. Identifying associations between lncRNAs and diseases helps capture the potential biomarkers and design efficient therapeutic options for diseases. Wet experiments for identifying these associations are costly and laborious. Methods: We developed LDA-SABC, a novel boosting-based framework for lncRNA-disease association (LDA) prediction. LDA-SABC extracts LDA features based on singular value decomposition (SVD) and classifies lncRNA-disease pairs (LDPs) by incorporating LightGBM and AdaBoost into the convolutional neural network. Results: The LDA-SABC performance was evaluated under five-fold cross validations (CVs) on lncRNAs, diseases, and LDPs. It obviously outperformed four other classical LDA inference methods (SDLDA, LDNFSGB, LDASR, and IPCAF) through precision, recall, accuracy, F1 score, AUC, and AUPR. Based on the accurate LDA prediction performance of LDA-SABC, we used it to find potential lncRNA biomarkers for lung cancer. The results elucidated that 7SK and HULC could have a relationship with non-small-cell lung cancer (NSCLC) and lung adenocarcinoma (LUAD), respectively. Conclusion: We hope that our proposed LDA-SABC method can help improve the LDA identification.

3.

Role of SARSCoV2 nucleocapsid protein in affecting immune cells and insights on its molecular mechanisms.

Lu, Yan; Ye, Ziyu; Liu, Xinlan; Zhou, Liqian; Ding, Xiang; Hou, Yiling.

Exp Ther Med ; 26(5): 504, 2023 Nov.

Article in English | MEDLINE | ID: mdl-37822585

ABSTRACT

The present study aimed to explore the immune regulatory function of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) nucleocapsid (N) protein and related mechanisms. In a series of protein activity experiments, SARS-CoV-2 N protein promoted proliferation of three immune cell lines: mouse Raw264.7, human Jurkat and human Raji in a dose-dependent manner. A total of 10 µg/ml N protein could significantly change cell cycle progression of the aforementioned three immune cell lines and could promote quick entry of Raw264.7 cells into G2/M phase from S phase to achieve rapid growth. Additionally, the N protein could also stimulate Raw264.7 cells to secrete a number of proinflammatory factors such as TNF-α, IL-6 and IL-10. RNA sequencing analysis indicated that the N protein changed the expression of certain genes involved in immune-related functions and four important signaling pathways, including JAK-STAT, TNF, NF-κB and MAPK signaling pathways, which suggested that the N protein may not only regulate the expression of genes involved in the process of resisting viral infection in macrophages of the immune system, but also change cellular signal processing.

4.

Identifying potential drug-target interactions based on ensemble deep learning.

Zhou, Liqian; Wang, Yuzhuang; Peng, Lihong; Li, Zejun; Luo, Xueming.

Front Aging Neurosci ; 15: 1176400, 2023.

Article in English | MEDLINE | ID: mdl-37396659

ABSTRACT

Introduction: Drug-target interaction prediction is one important step in drug research and development. Experimental methods are time consuming and laborious. Methods: In this study, we developed a novel DTI prediction method called EnGDD by combining initial feature acquisition, dimensional reduction, and DTI classification based on Gradient boosting neural network, Deep neural network, and Deep Forest. Results: EnGDD was compared with seven stat-of-the-art DTI prediction methods (BLM-NII, NRLMF, WNNGIP, NEDTP, DTi2Vec, RoFDT, and MolTrans) on the nuclear receptor, GPCR, ion channel, and enzyme datasets under cross validations on drugs, targets, and drug-target pairs, respectively. EnGDD computed the best recall, accuracy, F1-score, AUC, and AUPR under the majority of conditions, demonstrating its powerful DTI identification performance. EnGDD predicted that D00182 and hsa2099, D07871 and hsa1813, DB00599 and hsa2562, D00002 and hsa10935 have a higher interaction probabilities among unknown drug-target pairs and may be potential DTIs on the four datasets, respectively. In particular, D00002 (Nadide) was identified to interact with hsa10935 (Mitochondrial peroxiredoxin3) whose up-regulation might be used to treat neurodegenerative diseases. Finally, EnGDD was used to find possible drug targets for Parkinson's disease and Alzheimer's disease after confirming its DTI identification performance. The results show that D01277, D04641, and D08969 may be applied to the treatment of Parkinson's disease through targeting hsa1813 (dopamine receptor D2) and D02173, D02558, and D03822 may be the clues of treatment for patients with Alzheimer's disease through targeting hsa5743 (prostaglandinendoperoxide synthase 2). The above prediction results need further biomedical validation. Discussion: We anticipate that our proposed EnGDD model can help discover potential therapeutic clues for various diseases including neurodegenerative diseases.

5.

Editorial: Machine learning-based methods for RNA data analysis-Volume II.

Peng, Lihong; Yang, Jialiang; Wang, Minxian; Zhou, Liqian.

Front Genet ; 13: 1010089, 2022.

Article in English | MEDLINE | ID: mdl-36523762

6.

Editorial: Machine Learning-Based Methods for RNA Data Analysis.

Peng, Lihong; Yang, Jialiang; Wang, Minxian; Zhou, Liqian.

Front Genet ; 13: 828575, 2022.

Article in English | MEDLINE | ID: mdl-35692815

7.

Cell-cell communication inference and analysis in the tumour microenvironments from single-cell transcriptomics: data resources and computational strategies.

Peng, Lihong; Wang, Feixiang; Wang, Zhao; Tan, Jingwei; Huang, Li; Tian, Xiongfei; Liu, Guangyi; Zhou, Liqian.

Brief Bioinform ; 23(4)2022 07 18.

Article in English | MEDLINE | ID: mdl-35753695

ABSTRACT

Carcinomas are complex ecosystems composed of cancer, stromal and immune cells. Communication between these cells and their microenvironments induces cancer progression and causes therapy resistance. In order to improve the treatment of cancers, it is essential to quantify crosstalk between and within various cell types in a tumour microenvironment. Focusing on the coordinated expression patterns of ligands and cognate receptors, cell-cell communication can be inferred through ligand-receptor interactions (LRIs). In this manuscript, we carry out the following work: (i) introduce pipeline for ligand-receptor-mediated intercellular communication estimation from single-cell transcriptomics and list a few available LRI-related databases and visualization tools; (ii) demonstrate seven classical intercellular communication scoring strategies, highlight four types of representative intercellular communication inference methods, including network-based approaches, machine learning-based approaches, spatial information-based approaches and other approaches; (iii) summarize the evaluation and validation avenues for intercellular communication inference and analyze the advantages and limitations for the above four types of cell-cell communication methods; (iv) comment several major challenges while provide further research directions for intercellular communication analysis in the tumour microenvironments. We anticipate that this work helps to better understand intercellular crosstalk and to further develop powerful cell-cell communication estimation tools for tumor-targeted therapy.

Subject(s)

Neoplasms , Tumor Microenvironment , Cell Communication , Ecosystem , Humans , Ligands , Neoplasms/metabolism , Transcriptome

8.

Discovery of Potential Therapeutic Drugs for COVID-19 Through Logistic Matrix Factorization With Kernel Diffusion.

Tian, Xiongfei; Shen, Ling; Gao, Pengfei; Huang, Li; Liu, Guangyi; Zhou, Liqian; Peng, Lihong.

Front Microbiol ; 13: 740382, 2022.

Article in English | MEDLINE | ID: mdl-35295301

ABSTRACT

Coronavirus disease 2019 (COVID-19) is rapidly spreading. Researchers around the world are dedicated to finding the treatment clues for COVID-19. Drug repositioning, as a rapid and cost-effective way for finding therapeutic options from available FDA-approved drugs, has been applied to drug discovery for COVID-19. In this study, we develop a novel drug repositioning method (VDA-KLMF) to prioritize possible anti-SARS-CoV-2 drugs integrating virus sequences, drug chemical structures, known Virus-Drug Associations, and Logistic Matrix Factorization with Kernel diffusion. First, Gaussian kernels of viruses and drugs are built based on known VDAs and nearest neighbors. Second, sequence similarity kernel of viruses and chemical structure similarity kernel of drugs are constructed based on biological features and an identity matrix. Third, Gaussian kernel and similarity kernel are diffused. Forth, a logistic matrix factorization model with kernel diffusion is proposed to identify potential anti-SARS-CoV-2 drugs. Finally, molecular dockings between the inferred antiviral drugs and the junction of SARS-CoV-2 spike protein-ACE2 interface are implemented to investigate the binding abilities between them. VDA-KLMF is compared with two state-of-the-art VDA prediction models (VDA-KATZ and VDA-RWR) and three classical association prediction methods (NGRHMDA, LRLSHMDA, and NRLMF) based on 5-fold cross validations on viruses, drugs, and VDAs on three datasets. It obtains the best recalls, AUCs, and AUPRs, significantly outperforming other five methods under the three different cross validations. We observe that four chemical agents coming together on any two datasets, that is, remdesivir, ribavirin, nitazoxanide, and emetine, may be the clues of treatment for COVID-19. The docking results suggest that the key residues K353 and G496 may affect the binding energies and dynamics between the inferred anti-SARS-CoV-2 chemical agents and the junction of the spike protein-ACE2 interface. Integrating various biological data, Gaussian kernel, similarity kernel, and logistic matrix factorization with kernel diffusion, this work demonstrates that a few chemical agents may assist in drug discovery for COVID-19.

9.

EnANNDeep: An Ensemble-based lncRNA-protein Interaction Prediction Framework with Adaptive k-Nearest Neighbor Classifier and Deep Models.

Peng, Lihong; Tan, Jingwei; Tian, Xiongfei; Zhou, Liqian.

Interdiscip Sci ; 14(1): 209-232, 2022 Mar.

Article in English | MEDLINE | ID: mdl-35006529

ABSTRACT

lncRNA-protein interactions (LPIs) prediction can deepen the understanding of many important biological processes. Artificial intelligence methods have reported many possible LPIs. However, most computational techniques were evaluated mainly on one dataset, which may produce prediction bias. More importantly, they were validated only under cross validation on lncRNA-protein pairs, and did not consider the performance under cross validations on lncRNAs and proteins, thus fail to search related proteins/lncRNAs for a new lncRNA/protein. Under an ensemble learning framework (EnANNDeep) composed of adaptive k-nearest neighbor classifier and Deep models, this study focuses on systematically finding underlying linkages between lncRNAs and proteins. First, five LPI-related datasets are arranged. Second, multiple source features are integrated to depict an lncRNA-protein pair. Third, adaptive k-nearest neighbor classifier, deep neural network, and deep forest are designed to score unknown lncRNA-protein pairs, respectively. Finally, interaction probabilities from the three predictors are integrated based on a soft voting technique. In comparing to five classical LPI identification models (SFPEL, PMDKN, CatBoost, PLIPCOM, and LPI-SKF) under fivefold cross validations on lncRNAs, proteins, and LPIs, EnANNDeep computes the best average AUCs of 0.8660, 0.8775, and 0.9166, respectively, and the best average AUPRs of 0.8545, 0.8595, and 0.9054, respectively, indicating its superior LPI prediction ability. Case study analyses indicate that SNHG10 may have dense linkage with Q15717. In the ensemble framework, adaptive k-nearest neighbor classifier can separately pick the most appropriate k for each query lncRNA-protein pair. More importantly, deep models including deep neural network and deep forest can effectively learn the representative features of lncRNAs and proteins.

Subject(s)

RNA, Long Noncoding , Artificial Intelligence , Computational Biology/methods , Machine Learning , Neural Networks, Computer , RNA, Long Noncoding/genetics , RNA, Long Noncoding/metabolism

10.

Finding lncRNA-Protein Interactions Based on Deep Learning With Dual-Net Neural Architecture.

Peng, Lihong; Wang, Chang; Tian, Xiongfei; Zhou, Liqian; Li, Keqin.

IEEE/ACM Trans Comput Biol Bioinform ; 19(6): 3456-3468, 2022.

Article in English | MEDLINE | ID: mdl-34587091

ABSTRACT

The identification of lncRNA-protein interactions (LPIs) is important to understand the biological functions and molecular mechanisms of lncRNAs. However, most computational models are evaluated on a unique dataset, thereby resulting in prediction bias. Furthermore, previous models have not uncovered potential proteins (or lncRNAs) interacting with a new lncRNA (or protein). Finally, the performance of these models can be improved. In this study, we develop a Deep Learning framework with Dual-net Neural architecture to find potential LPIs (LPI-DLDN). First, five LPI datasets are collected. Second, the features of lncRNAs and proteins are extracted by Pyfeat and BioTriangle, respectively. Third, these features are concatenated as a vector after dimension reduction. Finally, a deep learning model with dual-net neural architecture is designed to classify lncRNA-protein pairs. LPI-DLDN is compared with six state-of-the-art LPI prediction methods (LPI-XGBoost, LPI-HeteSim, LPI-NRLMF, PLIPCOM, LPI-CNNCP, and Capsule-LPI) under four cross validations. The results demonstrate the powerful LPI classification performance of LPI-DLDN. Case study analyses show that there may be interactions between RP11-439E19.10 and Q15717, and between RP11-196G18.22 and Q9NUL5. The novelty of LPI-DLDN remains, integrating various biological features, designing a novel deep learning-based LPI identification framework, and selecting the optimal LPI feature subset based on feature importance ranking.

Subject(s)

Deep Learning , RNA, Long Noncoding , RNA, Long Noncoding/genetics , RNA, Long Noncoding/metabolism , Computational Biology/methods

11.

VDA-RWLRLS: An anti-SARS-CoV-2 drug prioritizing framework combining an unbalanced bi-random walk and Laplacian regularized least squares.

Shen, Ling; Liu, Fuxing; Huang, Li; Liu, Guangyi; Zhou, Liqian; Peng, Lihong.

Comput Biol Med ; 140: 105119, 2021 Dec 07.

Article in English | MEDLINE | ID: mdl-34902608

ABSTRACT

BACKGROUND: A new coronavirus disease named COVID-19, caused by severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2), is rapidly spreading worldwide. However, there is currently no effective drug to fight COVID-19. METHODS: In this study, we developed a Virus-Drug Association (VDA) identification framework (VDA-RWLRLS) combining unbalanced bi-Random Walk, Laplacian Regularized Least Squares, molecular docking, and molecular dynamics simulation to find clues for the treatment of COVID-19. First, virus similarity and drug similarity are computed based on genomic sequences, chemical structures, and Gaussian association profiles. Second, an unbalanced bi-random walk is implemented on the virus network and the drug network, respectively. Third, the results of the random walks are taken as the input of Laplacian regularized least squares to compute the association score for each virus-drug pair. Fourth, the final associations are characterized by integrating the predictions from the virus network and the drug network. Finally, molecular docking and molecular dynamics simulation are implemented to measure the potential of screened anti-COVID-19 drugs and further validate the predicted results. RESULTS: In comparison with six state-of-the-art association prediction models (NGRHMDA, SMiR-NBI, LRLSHMDA, VDA-KATZ, VDA-RWR, and VDA-BiRW), VDA-RWLRLS demonstrates superior VDA prediction performance. It obtains the best AUCs of 0.885 8, 0.835 5, and 0.862 5 on the three VDA datasets. Molecular docking and dynamics simulations demonstrated that remdesivir and ribavirin may be potential anti-COVID-19 drugs. CONCLUSIONS: Integrating unbalanced bi-random walks, Laplacian regularized least squares, molecular docking, and molecular dynamics simulation, this work initially screened a few anti-SARS-CoV-2 drugs and may contribute to preventing COVID-19 transmission.

12.

LPI-EnEDT: an ensemble framework with extra tree and decision tree classifiers for imbalanced lncRNA-protein interaction data classification.

Peng, Lihong; Yuan, Ruya; Shen, Ling; Gao, Pengfei; Zhou, Liqian.

BioData Min ; 14(1): 50, 2021 Dec 03.

Article in English | MEDLINE | ID: mdl-34861891

ABSTRACT

BACKGROUND: Long noncoding RNAs (lncRNAs) have dense linkages with various biological processes. Identifying interacting lncRNA-protein pairs contributes to understand the functions and mechanisms of lncRNAs. Wet experiments are costly and time-consuming. Most computational methods failed to observe the imbalanced characterize of lncRNA-protein interaction (LPI) data. More importantly, they were measured based on a unique dataset, which produced the prediction bias. RESULTS: In this study, we develop an Ensemble framework (LPI-EnEDT) with Extra tree and Decision Tree classifiers to implement imbalanced LPI data classification. First, five LPI datasets are arranged. Second, lncRNAs and proteins are separately characterized based on Pyfeat and BioTriangle and concatenated as a vector to represent each lncRNA-protein pair. Finally, an ensemble framework with Extra tree and decision tree classifiers is developed to classify unlabeled lncRNA-protein pairs. The comparative experiments demonstrate that LPI-EnEDT outperforms four classical LPI prediction methods (LPI-BLS, LPI-CatBoost, LPI-SKF, and PLIPCOM) under cross validations on lncRNAs, proteins, and LPIs. The average AUC values on the five datasets are 0.8480, 0,7078, and 0.9066 under the three cross validations, respectively. The average AUPRs are 0.8175, 0.7265, and 0.8882, respectively. Case analyses suggest that there are underlying associations between HOTTIP and Q9Y6M1, NRON and Q15717. CONCLUSIONS: Fusing diverse biological features of lncRNAs and proteins and exploiting an ensemble learning model with Extra tree and decision tree classifiers, this work focus on imbalanced LPI data classification as well as interaction information inference for a new lncRNA (or protein).

13.

LPI-HyADBS: a hybrid framework for lncRNA-protein interaction prediction integrating feature selection and classification.

Zhou, Liqian; Duan, Qi; Tian, Xiongfei; Xu, He; Tang, Jianxin; Peng, Lihong.

BMC Bioinformatics ; 22(1): 568, 2021 Nov 26.

Article in English | MEDLINE | ID: mdl-34836494

ABSTRACT

BACKGROUND: Long noncoding RNAs (lncRNAs) have dense linkages with a plethora of important cellular activities. lncRNAs exert functions by linking with corresponding RNA-binding proteins. Since experimental techniques to detect lncRNA-protein interactions (LPIs) are laborious and time-consuming, a few computational methods have been reported for LPI prediction. However, computation-based LPI identification methods have the following limitations: (1) Most methods were evaluated on a single dataset, and researchers may thus fail to measure their generalization ability. (2) The majority of methods were validated under cross validation on lncRNA-protein pairs, did not investigate the performance under other cross validations, especially for cross validation on independent lncRNAs and independent proteins. (3) lncRNAs and proteins have abundant biological information, how to select informative features need to further investigate. RESULTS: Under a hybrid framework (LPI-HyADBS) integrating feature selection based on AdaBoost, and classification models including deep neural network (DNN), extreme gradient Boost (XGBoost), and SVM with a penalty Coefficient of misclassification (C-SVM), this work focuses on finding new LPIs. First, five datasets are arranged. Each dataset contains lncRNA sequences, protein sequences, and an LPI network. Second, biological features of lncRNAs and proteins are acquired based on Pyfeat. Third, the obtained features of lncRNAs and proteins are selected based on AdaBoost and concatenated to depict each LPI sample. Fourth, DNN, XGBoost, and C-SVM are used to classify lncRNA-protein pairs based on the concatenated features. Finally, a hybrid framework is developed to integrate the classification results from the above three classifiers. LPI-HyADBS is compared to six classical LPI prediction approaches (LPI-SKF, LPI-NRLMF, Capsule-LPI, LPI-CNNCP, LPLNP, and LPBNI) on five datasets under 5-fold cross validations on lncRNAs, proteins, lncRNA-protein pairs, and independent lncRNAs and independent proteins. The results show LPI-HyADBS has the best LPI prediction performance under four different cross validations. In particular, LPI-HyADBS obtains better classification ability than other six approaches under the constructed independent dataset. Case analyses suggest that there is relevance between ZNF667-AS1 and Q15717. CONCLUSIONS: Integrating feature selection approach based on AdaBoost, three classification techniques including DNN, XGBoost, and C-SVM, this work develops a hybrid framework to identify new linkages between lncRNAs and proteins.

Subject(s)

RNA, Long Noncoding , Amino Acid Sequence , Computational Biology , Neural Networks, Computer , RNA, Long Noncoding/genetics , RNA-Binding Proteins/metabolism

14.

LPI-deepGBDT: a multiple-layer deep framework based on gradient boosting decision trees for lncRNA-protein interaction identification.

Zhou, Liqian; Wang, Zhao; Tian, Xiongfei; Peng, Lihong.

BMC Bioinformatics ; 22(1): 479, 2021 Oct 04.

Article in English | MEDLINE | ID: mdl-34607567

ABSTRACT

BACKGROUND: Long noncoding RNAs (lncRNAs) play important roles in various biological and pathological processes. Discovery of lncRNA-protein interactions (LPIs) contributes to understand the biological functions and mechanisms of lncRNAs. Although wet experiments find a few interactions between lncRNAs and proteins, experimental techniques are costly and time-consuming. Therefore, computational methods are increasingly exploited to uncover the possible associations. However, existing computational methods have several limitations. First, majority of them were measured based on one simple dataset, which may result in the prediction bias. Second, few of them are applied to identify relevant data for new lncRNAs (or proteins). Finally, they failed to utilize diverse biological information of lncRNAs and proteins. RESULTS: Under the feed-forward deep architecture based on gradient boosting decision trees (LPI-deepGBDT), this work focuses on classify unobserved LPIs. First, three human LPI datasets and two plant LPI datasets are arranged. Second, the biological features of lncRNAs and proteins are extracted by Pyfeat and BioProt, respectively. Thirdly, the features are dimensionally reduced and concatenated as a vector to represent an lncRNA-protein pair. Finally, a deep architecture composed of forward mappings and inverse mappings is developed to predict underlying linkages between lncRNAs and proteins. LPI-deepGBDT is compared with five classical LPI prediction models (LPI-BLS, LPI-CatBoost, PLIPCOM, LPI-SKF, and LPI-HNM) under three cross validations on lncRNAs, proteins, lncRNA-protein pairs, respectively. It obtains the best average AUC and AUPR values under the majority of situations, significantly outperforming other five LPI identification methods. That is, AUCs computed by LPI-deepGBDT are 0.8321, 0.6815, and 0.9073, respectively and AUPRs are 0.8095, 0.6771, and 0.8849, respectively. The results demonstrate the powerful classification ability of LPI-deepGBDT. Case study analyses show that there may be interactions between GAS5 and Q15717, RAB30-AS1 and O00425, and LINC-01572 and P35637. CONCLUSIONS: Integrating ensemble learning and hierarchical distributed representations and building a multiple-layered deep architecture, this work improves LPI prediction performance as well as effectively probes interaction data for new lncRNAs/proteins.

Subject(s)

RNA, Long Noncoding , Computational Biology , Decision Trees , Humans , Plants , RNA, Long Noncoding/genetics

15.

Screening Potential Drugs for COVID-19 Based on Bound Nuclear Norm Regularization.

Wang, Juanjuan; Wang, Chang; Shen, Ling; Zhou, Liqian; Peng, Lihong.

Front Genet ; 12: 749256, 2021.

Article in English | MEDLINE | ID: mdl-34691157

ABSTRACT

The novel coronavirus pneumonia COVID-19 infected by SARS-CoV-2 has attracted worldwide attention. It is urgent to find effective therapeutic strategies for stopping COVID-19. In this study, a Bounded Nuclear Norm Regularization (BNNR) method is developed to predict anti-SARS-CoV-2 drug candidates. First, three virus-drug association datasets are compiled. Second, a heterogeneous virus-drug network is constructed. Third, complete genomic sequences and Gaussian association profiles are integrated to compute virus similarities; chemical structures and Gaussian association profiles are integrated to calculate drug similarities. Fourth, a BNNR model based on kernel similarity (VDA-GBNNR) is proposed to predict possible anti-SARS-CoV-2 drugs. VDA-GBNNR is compared with four existing advanced methods under fivefold cross-validation. The results show that VDA-GBNNR computes better AUCs of 0.8965, 0.8562, and 0.8803 on the three datasets, respectively. There are 6 anti-SARS-CoV-2 drugs overlapping in any two datasets, that is, remdesivir, favipiravir, ribavirin, mycophenolic acid, niclosamide, and mizoribine. Molecular dockings are conducted for the 6 small molecules and the junction of SARS-CoV-2 spike protein and human angiotensin-converting enzyme 2. In particular, niclosamide and mizoribine show higher binding energy of -8.06 and -7.06 kcal/mol with the junction, respectively. G496 and K353 may be potential key residues between anti-SARS-CoV-2 drugs and the interface junction. We hope that the predicted results can contribute to the treatment of COVID-19.

16.

A novel lncRNA-protein interaction prediction method based on deep forest with cascade forest structure.

Tian, Xiongfei; Shen, Ling; Wang, Zhenwu; Zhou, Liqian; Peng, Lihong.

Sci Rep ; 11(1): 18881, 2021 09 23.

Article in English | MEDLINE | ID: mdl-34556758

ABSTRACT

Long noncoding RNAs (lncRNAs) regulate many biological processes by interacting with corresponding RNA-binding proteins. The identification of lncRNA-protein Interactions (LPIs) is significantly important to well characterize the biological functions and mechanisms of lncRNAs. Existing computational methods have been effectively applied to LPI prediction. However, the majority of them were evaluated only on one LPI dataset, thereby resulting in prediction bias. More importantly, part of models did not discover possible LPIs for new lncRNAs (or proteins). In addition, the prediction performance remains limited. To solve with the above problems, in this study, we develop a Deep Forest-based LPI prediction method (LPIDF). First, five LPI datasets are obtained and the corresponding sequence information of lncRNAs and proteins are collected. Second, features of lncRNAs and proteins are constructed based on four-nucleotide composition and BioSeq2vec with encoder-decoder structure, respectively. Finally, a deep forest model with cascade forest structure is developed to find new LPIs. We compare LPIDF with four classical association prediction models based on three fivefold cross validations on lncRNAs, proteins, and LPIs. LPIDF obtains better average AUCs of 0.9012, 0.6937 and 0.9457, and the best average AUPRs of 0.9022, 0.6860, and 0.9382, respectively, for the three CVs, significantly outperforming other methods. The results show that the lncRNA FTX may interact with the protein P35637 and needs further validation.

Subject(s)

Machine Learning , RNA, Long Noncoding/metabolism , RNA-Binding Proteins/metabolism , Arabidopsis , Computational Biology/methods , Datasets as Topic , Humans , Zea mays

17.

Prioritizing antiviral drugs against SARS-CoV-2 by integrating viral complete genome sequences and drug chemical structures.

Peng, Lihong; Shen, Ling; Xu, Junlin; Tian, Xiongfei; Liu, Fuxing; Wang, Juanjuan; Tian, Geng; Yang, Jialiang; Zhou, Liqian.

Sci Rep ; 11(1): 6248, 2021 03 18.

Article in English | MEDLINE | ID: mdl-33737523

ABSTRACT

The outbreak of a novel febrile respiratory disease called COVID-19, caused by a newfound coronavirus SARS-CoV-2, has brought a worldwide attention. Prioritizing approved drugs is critical for quick clinical trials against COVID-19. In this study, we first manually curated three Virus-Drug Association (VDA) datasets. By incorporating VDAs with the similarity between drugs and that between viruses, we constructed a heterogeneous Virus-Drug network. A novel Random Walk with Restart method (VDA-RWR) was then developed to identify possible VDAs related to SARS-CoV-2. We compared VDA-RWR with three state-of-the-art association prediction models based on fivefold cross-validations (CVs) on viruses, drugs and virus-drug associations on three datasets. VDA-RWR obtained the best AUCs for the three fivefold CVs, significantly outperforming other methods. We found two small molecules coming together on the three datasets, that is, remdesivir and ribavirin. These two chemical agents have higher molecular binding energies of - 7.0 kcal/mol and - 6.59 kcal/mol with the domain bound structure of the human receptor angiotensin converting enzyme 2 (ACE2) and the SARS-CoV-2 spike protein, respectively. Interestingly, for the first time, experimental results suggested that navitoclax could be potentially applied to stop SARS-CoV-2 and remains to further validation.

Subject(s)

Adenosine Monophosphate/analogs & derivatives , Alanine/analogs & derivatives , Angiotensin-Converting Enzyme 2/chemistry , Antiviral Agents/chemistry , Ribavirin/chemistry , Spike Glycoprotein, Coronavirus/chemistry , Adenosine Monophosphate/chemistry , Alanine/chemistry , Aniline Compounds/chemistry , Drug Evaluation, Preclinical , Genome, Viral , Molecular Docking Simulation , SARS-CoV-2/genetics , Sulfonamides/chemistry

18.

Self-assembling a natural small molecular inhibitor that shows aggregation-induced emission and potentiates antitumor efficacy.

Chen, Xiaona; Hu, Zhen; Zhou, Liqian; Zhang, Fu; Wan, Jianqin; Wang, Hangxiang.

Nanoscale Horiz ; 6(1): 33-42, 2021 01 05.

Article in English | MEDLINE | ID: mdl-33210687

ABSTRACT

Targeted therapy using small molecular inhibitors has been developed to rewire key signaling pathways in tumor cells, but these inhibitors have had mixed success in the clinic due to their poor pharmaceutical properties and suboptimal intratumoral concentrations. Here, we developed a "self-assembling natural molecular inhibitor" strategy to test the efficacy and feasibility of the water-insoluble agent dasatinib (DAS), a tyrosine kinase inhibitor, for cancer therapy. By exploiting a facile reprecipitation protocol, the DAS inhibitor self-assembled into soluble supramolecular nanoparticles (termed sDNPs) in aqueous solution, without an exogenous excipient. This strategy is applicable for generating systemically injectable and colloid-stable therapeutic nanoparticles of hydrophobic small-molecule inhibitors. Concurrently, during this process, we observed aggregation-induced emission (AIE) of fluorescence for this self-assembled DAS, which makes sDNPs suitable for bioimaging and tracing of cellular trafficking. Notably, in an orthotopic model of breast cancer, administration of sDNPs induced a durable inhibition of primary tumors and reduced the metastatic tumor burden, significantly surpassing the effects of the free DAS inhibitor after oral delivery. In addition, low toxicity was observed for this platform, with effective avoidance of immunotoxicity. To the best of our knowledge, our studies provide the first successful demonstration of self-assembling natural molecular inhibitors with AIE and highlight the feasibility of this approach for the preparation of therapeutic nanoparticles for highly lethal human cancers and many other diseases.

Subject(s)

Breast Neoplasms/drug therapy , Dasatinib/administration & dosage , Drug Delivery Systems , Nanoparticles/chemistry , Antineoplastic Agents/administration & dosage , Antineoplastic Agents/therapeutic use , Cell Line, Tumor , Dasatinib/therapeutic use , Female , Humans , Protein Kinase Inhibitors/administration & dosage , Protein Kinase Inhibitors/therapeutic use

19.

RNMFMDA: A Microbe-Disease Association Identification Method Based on Reliable Negative Sample Selection and Logistic Matrix Factorization With Neighborhood Regularization.

Peng, Lihong; Shen, Ling; Liao, Longjie; Liu, Guangyi; Zhou, Liqian.

Front Microbiol ; 11: 592430, 2020.

Article in English | MEDLINE | ID: mdl-33193260

ABSTRACT

Microbes with abnormal levels have important impacts on the formation and development of various complex diseases. Identifying possible Microbe-Disease Associations (MDAs) helps to understand the mechanisms of complex diseases. However, experimental methods for MDA identification are costly and time-consuming. In this study, a new computational model, RNMFMDA, was developed to find possible MDAs. RNMFMDA contains two main processes. First, Reliable Negative MDA samples were selected based on Positive-Unlabeled (PU) learning and random walk with restart on the heterogeneous microbe-disease network. Second, Logistic Matrix Factorization with Neighborhood Regularization (LMFNR) was developed to compute the association probabilities for all microbe-disease pairs. To evaluate the performance of the proposed RNMFMDA method, we compared RNMFMDA with five state-of-the-art MDA prediction methods based on five-fold cross-validations on microbes, diseases, and MDAs. As a result, RNMFMDA obtained the best AUCs of 0.6332, 0.8669, and 0.9081, respectively for the three five-fold cross validations, significantly outperforming other models. The promising prediction performance may be attributed to the following three features: highly quality negative MDA sample selection, LMFNR-based MDA prediction model, and various biological information integration. In addition, a few predicted microbe-disease pairs with high association scores are worthy of further experimental validation.

20.

Identifying Effective Antiviral Drugs Against SARS-CoV-2 by Drug Repositioning Through Virus-Drug Association Prediction.

Peng, Lihong; Tian, Xiongfei; Shen, Ling; Kuang, Ming; Li, Tianbao; Tian, Geng; Yang, Jialiang; Zhou, Liqian.

Front Genet ; 11: 577387, 2020.

Article in English | MEDLINE | ID: mdl-33193695

ABSTRACT

A new coronavirus called SARS-CoV-2 is rapidly spreading around the world. Over 16,558,289 infected cases with 656,093 deaths have been reported by July 29th, 2020, and it is urgent to identify effective antiviral treatment. In this study, potential antiviral drugs against SARS-CoV-2 were identified by drug repositioning through Virus-Drug Association (VDA) prediction. 96 VDAs between 11 types of viruses similar to SARS-CoV-2 and 78 small molecular drugs were extracted and a novel VDA identification model (VDA-RLSBN) was developed to find potential VDAs related to SARS-CoV-2. The model integrated the complete genome sequences of the viruses, the chemical structures of drugs, a regularized least squared classifier (RLS), a bipartite local model, and the neighbor association information. Compared with five state-of-the-art association prediction methods, VDA-RLSBN obtained the best AUC of 0.9085 and AUPR of 0.6630. Ribavirin was predicted to be the best small molecular drug, with a higher molecular binding energy of -6.39 kcal/mol with human angiotensin-converting enzyme 2 (ACE2), followed by remdesivir (-7.4 kcal/mol), mycophenolic acid (-5.35 kcal/mol), and chloroquine (-6.29 kcal/mol). Ribavirin, remdesivir, and chloroquine have been under clinical trials or supported by recent works. In addition, for the first time, our results suggested several antiviral drugs, such as FK506, with molecular binding energies of -11.06 and -10.1 kcal/mol with ACE2 and the spike protein, respectively, could be potentially used to prevent SARS-CoV-2 and remains to further validation. Drug repositioning through virus-drug association prediction can effectively find potential antiviral drugs against SARS-CoV-2.

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL