Your browser doesn't support javascript.
loading
Weighted matrix factorization on multi-relational data for LncRNA-disease association prediction.
Wang, Yuehui; Yu, Guoxian; Wang, Jun; Fu, Guangyuan; Guo, Maozu; Domeniconi, Carlotta.
Afiliação
  • Wang Y; College of Computer and Information Sciences, Southwest University, Chongqing, China. Electronic address: yhwang@email.swu.edu.cn.
  • Yu G; College of Computer and Information Sciences, Southwest University, Chongqing, China; Hubei Key Laboratory of Intelligent Geo-Information Processing, China University of Geosciences, Wuhan, China; Chongqing Key Laboratory of Plant Resource Conservation and Germplasm Innovation, Southwest University,
  • Wang J; College of Computer and Information Sciences, Southwest University, Chongqing, China. Electronic address: kingjun@swu.edu.cn.
  • Fu G; College of Computer and Information Sciences, Southwest University, Chongqing, China. Electronic address: fugy@email.swu.edu.cn.
  • Guo M; College of Electrical and Information Engineering, Beijing University of Civil Engineering and Architecture, Beijing, China. Electronic address: guomaozu@bucea.edu.cn.
  • Domeniconi C; Department of Computer Science, George Mason University, Fairfax, VA, USA. Electronic address: carlotta@cs.gmu.edu.
Methods ; 173: 32-43, 2020 02 15.
Article em En | MEDLINE | ID: mdl-31226302
ABSTRACT
Influx evidences show that red long non-coding RNAs (lncRNAs) play important roles in various critical biological processes, and they afffect the development and progression of various human diseases. Therefore, it is necessary to precisely identify the lncRNA-disease associations. The identification precision can be improved by developing data integrative models. However, current models mainly need to project heterogeneous data onto the homologous networks, and then merge these networks into a composite one for integrative prediction. We recognize that this projection overrides the individual structure of the heterogeneous data, and the combination is impacted by noisy networks. As a result, the performance is compromised. Given that, we introduce a weighted matrix factorization model on multi-relational data to predict LncRNA-disease associations (WMFLDA). WMFLDA firstly uses a heterogeneous network to capture the inter(intra)-associations between different types of nodes (including genes, lncRNAs, and Disease Ontology terms). Then, it presets weights to these inter-association and intra-association matrices of the network, and cooperatively decomposes these matrices into low-rank ones to explore the underlying relationships between nodes. Next, it jointly optimizes the low-rank matrices and the weights. After that, WMFLDA approximates the lncRNA-disease association matrix using the optimized matrices and weights, and thus to achieve the prediction. WMFLDA obtains a much better performance than related data integrative solutions across different experiment settings and evaluation metrics. It can not only respect the intrinsic structures of individual data sources, but can also fuse them with selection.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Biologia Computacional / Predisposição Genética para Doença / RNA Longo não Codificante Tipo de estudo: Prognostic_studies / Risk_factors_studies Limite: Humans Idioma: En Revista: Methods Ano de publicação: 2020 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Biologia Computacional / Predisposição Genética para Doença / RNA Longo não Codificante Tipo de estudo: Prognostic_studies / Risk_factors_studies Limite: Humans Idioma: En Revista: Methods Ano de publicação: 2020 Tipo de documento: Article