Search | VHL Regional Portal

KS-CMI: A circRNA-miRNA interaction prediction method based on the signed graph neural network and denoising autoencoder.

Wang, Xin-Fei; Yu, Chang-Qing; You, Zhu-Hong; Qiao, Yan; Li, Zheng-Wei; Huang, Wen-Zhun; Zhou, Ji-Ren; Jin, Hai-Yan.

iScience ; 26(8): 107478, 2023 Aug 18.

Article in English | MEDLINE | ID: mdl-37583550

ABSTRACT

Circular RNA (circRNA) plays an important role in the diagnosis, treatment, and prognosis of human diseases. The discovery of potential circRNA-miRNA interactions (CMI) is of guiding significance for subsequent biological experiments. Limited by the small amount of experimentally supported data and high randomness, existing models are difficult to accomplish the CMI prediction task based on real cases. In this paper, we propose KS-CMI, a novel method for effectively accomplishing CMI prediction in real cases. KS-CMI enriches the 'behavior relationships' of molecules by constructing circRNA-miRNA-cancer (CMCI) networks and extracts the behavior relationship attribute of molecules based on balance theory. Next, the denoising autoencoder (DAE) is used to enhance the feature representation of molecules. Finally, the CatBoost classifier was used for prediction. KS-CMI achieved the most reliable prediction results in real cases and achieved competitive performance in all datasets in the CMI prediction.

SMMDA: Predicting miRNA-Disease Associations by Incorporating Multiple Similarity Profiles and a Novel Disease Representation.

Ji, Bo-Ya; Pan, Liang-Rui; Zhou, Ji-Ren; You, Zhu-Hong; Peng, Shao-Liang.

Biology (Basel) ; 11(5)2022 May 20.

Article in English | MEDLINE | ID: mdl-35625505

ABSTRACT

Increasing evidence has suggested that microRNAs (miRNAs) are significant in research on human diseases. Predicting possible associations between miRNAs and diseases would provide new perspectives on disease diagnosis, pathogenesis, and gene therapy. However, considering the intrinsic time-consuming and expensive cost of traditional Vitro studies, there is an urgent need for a computational approach that would allow researchers to identify potential associations between miRNAs and diseases for further research. In this paper, we presented a novel computational method called SMMDA to predict potential miRNA-disease associations. In particular, SMMDA first utilized a new disease representation method (MeSHHeading2vec) based on the network embedding algorithm and then fused it with Gaussian interaction profile kernel similarity information of miRNAs and diseases, disease semantic similarity, and miRNA functional similarity. Secondly, SMMDA utilized a deep auto-coder network to transform the original features further to achieve a better feature representation. Finally, the ensemble learning model, XGBoost, was used as the underlying training and prediction method for SMMDA. In the results, SMMDA acquired a mean accuracy of 86.68% with a standard deviation of 0.42% and a mean AUC of 94.07% with a standard deviation of 0.23%, outperforming many previous works. Moreover, we also compared the predictive ability of SMMDA with different classifiers and different feature descriptors. In the case studies of three common Human diseases, the top 50 candidate miRNAs have 47 (esophageal neoplasms), 48 (breast neoplasms), and 48 (colon neoplasms) are successfully verified by two other databases. The experimental results proved that SMMDA has a reliable prediction ability in predicting potential miRNA-disease associations. Therefore, it is anticipated that SMMDA could be an effective tool for biomedical researchers.

Prediction of lncRNA-disease associations via an embedding learning HOPE in heterogeneous information networks.

Zhou, Ji-Ren; You, Zhu-Hong; Cheng, Li; Ji, Bo-Ya.

Mol Ther Nucleic Acids ; 23: 277-285, 2021 Mar 05.

Article in English | MEDLINE | ID: mdl-33425486

ABSTRACT

Uncovering additional long non-coding RNA (lncRNA)-disease associations has become increasingly important for developing treatments for complex human diseases. Identification of lncRNA biomarkers and lncRNA-disease associations is central to diagnoses and treatment. However, traditional experimental methods are expensive and time-consuming. Enormous amounts of data present in public biological databases are available for computational methods used to predict lncRNA-disease associations. In this study, we propose a novel computational method to predict lncRNA-disease associations. More specifically, a heterogeneous network is first constructed by integrating the associations among microRNA (miRNA), lncRNA, protein, drug, and disease, Second, high-order proximity preserved embedding (HOPE) was used to embed nodes into a network. Finally, the rotation forest classifier was adopted to train the prediction model. In the 5-fold cross-validation experiment, the area under the curve (AUC) of our method achieved 0.8328 ± 0.0236. We compare it with the other four classifiers, in which the proposed method remarkably outperformed other comparison methods. Otherwise, we constructed three case studies for three excess death rate cancers, respectively. The results show that 9 (lung cancer, gastric cancer, and hepatocellular carcinomas) out of the top 15 predicted disease-related lncRNAs were confirmed by our method. In conclusion, our method could predict the unknown lncRNA-disease associations effectively.

MISSIM: An Incremental Learning-Based Model With Applications to the Prediction of miRNA-Disease Association.

Zheng, Kai; You, Zhu-Hong; Wang, Lei; Li, Yi-Ran; Zhou, Ji-Ren; Zeng, Hai-Tao.

IEEE/ACM Trans Comput Biol Bioinform ; 18(5): 1733-1742, 2021.

Article in English | MEDLINE | ID: mdl-32749964

ABSTRACT

In the past few years, the prediction models have shown remarkable performance in most biological correlation prediction tasks. These tasks traditionally use a fixed dataset, and the model, once trained, is deployed as is. These models often encounter training issues such as sensitivity to hyperparameter tuning and "catastrophic forgetting" when adding new data. However, with the development of biomedicine and the accumulation of biological data, new predictive models are required to face the challenge of adapting to change. To this end, we propose a computational approach based on Broad learning system (BLS) to predict potential disease-associated miRNAs that retain the ability to distinguish prior training associations when new data need to be adapted. In particular, we are introducing incremental learning to the field of biological association prediction for the first time and proposed a new method for quantifying sequence similarity. In the performance evaluation, the AUC in the 5-fold cross-validation was 0.9400 +/- 0.0041. To better assess the effectiveness of MISSIM, we compared it with various classifiers and former prediction models. Its performance is superior to the previous method. Besides, the case study on identifying miRNAs associated with breast neoplasms, lung neoplasms and esophageal neoplasms show that 34, 36 and 35 out of the top 40 associations predicted by MISSIM are confirmed by recent biomedical resources. These results provide ample convincing evidence of this approach have potential value and prospect in promoting biomedical research productivity.

Subject(s)

Computational Biology/methods , Genetic Predisposition to Disease/genetics , Machine Learning , MicroRNAs/genetics , Algorithms , Humans , Neoplasms/genetics , Neoplasms/metabolism , Transcriptome/genetics

Predicting miRNA-disease association from heterogeneous information network with GraRep embedding model.

Ji, Bo-Ya; You, Zhu-Hong; Cheng, Li; Zhou, Ji-Ren; Alghazzawi, Daniyal; Li, Li-Ping.

Sci Rep ; 10(1): 6658, 2020 04 20.

Article in English | MEDLINE | ID: mdl-32313121

ABSTRACT

In recent years, accumulating evidences have shown that microRNA (miRNA) plays an important role in the exploration and treatment of diseases, so detection of the associations between miRNA and disease has been drawn more and more attentions. However, traditional experimental methods have the limitations of high cost and time- consuming, a computational method can help us more systematically and effectively predict the potential miRNA-disease associations. In this work, we proposed a novel network embedding-based heterogeneous information integration method to predict miRNA-disease associations. More specifically, a heterogeneous information network is constructed by combining the known associations among lncRNA, drug, protein, disease, and miRNA. After that, the network embedding method Learning Graph Representations with Global Structural Information (GraRep) is employed to learn embeddings of nodes in heterogeneous information network. In this way, the embedding representations of miRNA and disease are integrated with the attribute information of miRNA and disease (e.g. miRNA sequence information and disease semantic similarity) to represent miRNA-disease association pairs. Finally, the Random Forest (RF) classifier is used for predicting potential miRNA-disease associations. Under the 5-fold cross validation, our method obtained 85.11% prediction accuracy with 80.41% sensitivity at the AUC of 91.25%. In addition, in case studies of three major Human diseases, 45 (Colon Neoplasms), 42 (Breast Neoplasms) and 44 (Esophageal Neoplasms) of top-50 predicted miRNAs are respectively verified by other miRNA-disease association databases. In conclusion, the experimental results suggest that our method can be a powerful and useful tool for predicting potential miRNA-disease associations.

Subject(s)

Breast Neoplasms/genetics , Colonic Neoplasms/genetics , Esophageal Neoplasms/genetics , MicroRNAs/genetics , RNA, Circular/genetics , RNA, Long Noncoding/genetics , RNA, Messenger/genetics , RNA, Neoplasm/genetics , Algorithms , Antineoplastic Agents/metabolism , Antineoplastic Agents/pharmacokinetics , Breast Neoplasms/diagnosis , Breast Neoplasms/drug therapy , Breast Neoplasms/pathology , Colonic Neoplasms/diagnosis , Colonic Neoplasms/drug therapy , Colonic Neoplasms/pathology , Computational Biology/methods , Databases, Genetic , Decision Trees , Esophageal Neoplasms/diagnosis , Esophageal Neoplasms/drug therapy , Esophageal Neoplasms/pathology , Female , Humans , Male , MicroRNAs/classification , MicroRNAs/metabolism , Models, Genetic , RNA, Circular/classification , RNA, Circular/metabolism , RNA, Long Noncoding/classification , RNA, Long Noncoding/metabolism , RNA, Messenger/classification , RNA, Messenger/metabolism , RNA, Neoplasm/classification , RNA, Neoplasm/metabolism

RPI-SE: a stacking ensemble learning framework for ncRNA-protein interactions prediction using sequence information.

Yi, Hai-Cheng; You, Zhu-Hong; Wang, Mei-Neng; Guo, Zhen-Hao; Wang, Yan-Bin; Zhou, Ji-Ren.

BMC Bioinformatics ; 21(1): 60, 2020 Feb 18.

Article in English | MEDLINE | ID: mdl-32070279

ABSTRACT

BACKGROUND: The interactions between non-coding RNAs (ncRNA) and proteins play an essential role in many biological processes. Several high-throughput experimental methods have been applied to detect ncRNA-protein interactions. However, these methods are time-consuming and expensive. Accurate and efficient computational methods can assist and accelerate the study of ncRNA-protein interactions. RESULTS: In this work, we develop a stacking ensemble computational framework, RPI-SE, for effectively predicting ncRNA-protein interactions. More specifically, to fully exploit protein and RNA sequence feature, Position Weight Matrix combined with Legendre Moments is applied to obtain protein evolutionary information. Meanwhile, k-mer sparse matrix is employed to extract efficient feature of ncRNA sequences. Finally, an ensemble learning framework integrated different types of base classifier is developed to predict ncRNA-protein interactions using these discriminative features. The accuracy and robustness of RPI-SE was evaluated on three benchmark data sets under five-fold cross-validation and compared with other state-of-the-art methods. CONCLUSIONS: The results demonstrate that RPI-SE is competent for ncRNA-protein interactions prediction task with high accuracy and robustness. It's anticipated that this work can provide a computational prediction tool to advance ncRNA-protein interactions related biomedical research.

Subject(s)

RNA, Untranslated/metabolism , RNA-Binding Proteins/metabolism , Sequence Analysis, Protein/methods , Sequence Analysis, RNA/methods , Position-Specific Scoring Matrices , RNA, Untranslated/chemistry , RNA-Binding Proteins/chemistry

An Improved Deep Forest Model for Predicting Self-Interacting Proteins From Protein Sequence Using Wavelet Transformation.

Chen, Zhan-Heng; Li, Li-Ping; He, Zhou; Zhou, Ji-Ren; Li, Yangming; Wong, Leon.

Front Genet ; 10: 90, 2019.

Article in English | MEDLINE | ID: mdl-30881376

ABSTRACT

Self-interacting proteins (SIPs), whose more than two identities can interact with each other, play significant roles in the understanding of cellular process and cell functions. Although a number of experimental methods have been designed to detect the SIPs, they remain to be extremely time-consuming, expensive, and challenging even nowadays. Therefore, there is an urgent need to develop the computational methods for predicting SIPs. In this study, we propose a deep forest based predictor for accurate prediction of SIPs using protein sequence information. More specifically, a novel feature representation method, which integrate position-specific scoring matrix (PSSM) with wavelet transform, is introduced. To evaluate the performance of the proposed method, cross-validation tests are performed on two widely used benchmark datasets. The experimental results show that the proposed model achieved high accuracies of 95.43 and 93.65% on human and yeast datasets, respectively. The AUC value for evaluating the performance of the proposed method was also reported. The AUC value for yeast and human datasets are 0.9203 and 0.9586, respectively. To further show the advantage of the proposed method, it is compared with several existing methods. The results demonstrate that the proposed model is better than other SIPs prediction methods. This work can offer an effective architecture to biologists in detecting new SIPs.

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL