Your browser doesn't support javascript.
loading
An efficient circRNA-miRNA interaction prediction model by combining biological text mining and wavelet diffusion-based sparse network structure embedding.
Wang, Xin-Fei; Yu, Chang-Qing; You, Zhu-Hong; Qiao, Yan; Li, Zheng-Wei; Huang, Wen-Zhun.
Affiliation
  • Wang XF; School of Information Engineering, Xijing University, Xi'an, China.
  • Yu CQ; School of Information Engineering, Xijing University, Xi'an, China. Electronic address: xaycq@163.com.
  • You ZH; School of Computer Science, Northwestern Polytechnical University, Xi'an, China. Electronic address: zhuhongyou@nwpu.edu.cn.
  • Qiao Y; College of Agriculture and Forestry, Longdong University, Qingyang, China.
  • Li ZW; School of Computer Science and Technology, China University of Mining and Technology, Xuzhou, China.
  • Huang WZ; School of Information Engineering, Xijing University, Xi'an, China.
Comput Biol Med ; 165: 107421, 2023 Oct.
Article in En | MEDLINE | ID: mdl-37672925
ABSTRACT
MOTIVATION Accumulating clinical evidence shows that circular RNA (circRNA) plays an important regulatory role in the occurrence and development of human diseases, which is expected to provide a new perspective for the diagnosis and treatment of related diseases. Using computational methods can provide high probability preselection for wet experiments to save resources. However, due to the lack of neighborhood structure in sparse biological networks, the model based on network embedding and graph embedding is difficult to achieve ideal results.

RESULTS:

In this paper, we propose BioDGW-CMI, which combines biological text mining and wavelet diffusion-based sparse network structure embedding to predict circRNA-miRNA interaction (CMI). In detail, BioDGW-CMI first uses the Bidirectional Encoder Representations from Transformers (BERT) for biological text mining to mine hidden features in RNA sequences, then constructs a CMI network, obtains the topological structure embedding of nodes in the network through heat wavelet diffusion patterns. Next, the Denoising autoencoder organically combines the structural features and Gaussian kernel similarity, finally, the feature is sent to lightGBM for training and prediction. BioDGW-CMI achieves the highest prediction performance in all three datasets in the field of CMI prediction. In the case study, all the 8 pairs of CMI based on circ-ITCH were successfully predicted.

AVAILABILITY:

The data and source code can be found at https//github.com/1axin/BioDGW-CMI-model.
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Type of study: Prognostic_studies / Risk_factors_studies Language: En Journal: Comput Biol Med Year: 2023 Document type: Article Affiliation country:

Full text: 1 Collection: 01-internacional Database: MEDLINE Type of study: Prognostic_studies / Risk_factors_studies Language: En Journal: Comput Biol Med Year: 2023 Document type: Article Affiliation country: