Your browser doesn't support javascript.
loading
SG-ATT: A Sequence Graph Cross-Attention Representation Architecture for Molecular Property Prediction.
Hao, Yajie; Chen, Xing; Fei, Ailu; Jia, Qifeng; Chen, Yu; Shao, Jinsong; Pandiyan, Sanjeevi; Wang, Li.
Afiliação
  • Hao Y; School of Information Science and Technology, Nantong University, Nantong 226001, China.
  • Chen X; School of Information Science and Technology, Nantong University, Nantong 226001, China.
  • Fei A; School of Information Science and Technology, Nantong University, Nantong 226001, China.
  • Jia Q; School of Information Science and Technology, Nantong University, Nantong 226001, China.
  • Chen Y; School of Information Science and Technology, Nantong University, Nantong 226001, China.
  • Shao J; School of Information Science and Technology, Nantong University, Nantong 226001, China.
  • Pandiyan S; School of Information Science and Technology, Nantong University, Nantong 226001, China.
  • Wang L; School of Information Science and Technology, Nantong University, Nantong 226001, China.
Molecules ; 29(2)2024 Jan 19.
Article em En | MEDLINE | ID: mdl-38276570
ABSTRACT
Existing formats based on the simplified molecular input line entry system (SMILES) encoding and molecular graph structure are designed to encode the complete semantic and structural information of molecules. However, the physicochemical properties of molecules are complex, and a single encoding of molecular features from SMILES sequences or molecular graph structures cannot adequately represent molecular information. Aiming to address this problem, this study proposes a sequence graph cross-attention (SG-ATT) representation architecture for a molecular property prediction model to efficiently use domain knowledge to enhance molecular graph feature encoding and combine the features of molecular SMILES sequences. The SG-ATT fuses the two-dimensional molecular features so that the current model input molecular information contains molecular structure information and semantic information. The SG-ATT was tested on nine molecular property prediction tasks. Among them, the biggest SG-ATT model performance improvement was 4.5% on the BACE dataset, and the average model performance improvement was 1.83% on the full dataset. Additionally, specific model interpretability studies were conducted to showcase the performance of the SG-ATT model on different datasets. In-depth analysis was provided through case studies of in vitro validation. Finally, network tools for molecular property prediction were developed for the use of researchers.
Palavras-chave

Texto completo: 1 Bases de dados: MEDLINE Tipo de estudo: Prognostic_studies / Risk_factors_studies Idioma: En Revista: Molecules Assunto da revista: BIOLOGIA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: China

Texto completo: 1 Bases de dados: MEDLINE Tipo de estudo: Prognostic_studies / Risk_factors_studies Idioma: En Revista: Molecules Assunto da revista: BIOLOGIA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: China