Your browser doesn't support javascript.
loading
Identifying the kind behind SMILES-anatomical therapeutic chemical classification using structure-only representations.
Cao, Yi; Yang, Zhen-Qun; Zhang, Xu-Lu; Fan, Wenqi; Wang, Yaowei; Shen, Jiajun; Wei, Dong-Qing; Li, Qing; Wei, Xiao-Yong.
Afiliação
  • Cao Y; Department of Computer Science, Sichuan University, 610065, Chengdu, China.
  • Yang ZQ; Department of Biomedical Engineering, Chinese University of Hong Kong, Street, Shatin, Hong Kong.
  • Zhang XL; Department of Computer Science, Sichuan University, 610065, Chengdu, China.
  • Fan W; Department of Computing, Hong Kong Polytechnic University, Kowloon, Hong Kong.
  • Wang Y; Peng Cheng Laboratory, 518000, Shenzhen, China.
  • Shen J; TCL AI Research Institute, Hong Kong.
  • Wei DQ; School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China.
  • Li Q; Department of Computing, Hong Kong Polytechnic University, Kowloon, Hong Kong.
  • Wei XY; Department of Computer Science, Sichuan University, 610065, Chengdu, China.
Brief Bioinform ; 23(5)2022 09 20.
Article em En | MEDLINE | ID: mdl-36027578
ABSTRACT
Anatomical Therapeutic Chemical (ATC) classification for compounds/drugs plays an important role in drug development and basic research. However, previous methods depend on interactions extracted from STITCH dataset which may make it depend on lab experiments. We present a pilot study to explore the possibility of conducting the ATC prediction solely based on the molecular structures. The motivation is to eliminate the reliance on the costly lab experiments so that the characteristics of a drug can be pre-assessed for better decision-making and effort-saving before the actual development. To this end, we construct a new benchmark consisting of 4545 compounds which is with larger scale than the one used in previous study. A light-weight prediction model is proposed. The model is with better explainability in the sense that it is consists of a straightforward tokenization that extracts and embeds statistically and physicochemically meaningful tokens, and a deep network backed by a set of pyramid kernels to capture multi-resolution chemical structural characteristics. Its efficacy has been validated in the experiments where it outperforms the state-of-the-art methods by 15.53% in accuracy and by 69.66% in terms of efficiency. We make the benchmark dataset, source code and web server open to ease the reproduction of this study.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Software / Benchmarking Tipo de estudo: Prognostic_studies Idioma: En Revista: Brief Bioinform Assunto da revista: BIOLOGIA / INFORMATICA MEDICA Ano de publicação: 2022 Tipo de documento: Article País de afiliação: China

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Software / Benchmarking Tipo de estudo: Prognostic_studies Idioma: En Revista: Brief Bioinform Assunto da revista: BIOLOGIA / INFORMATICA MEDICA Ano de publicação: 2022 Tipo de documento: Article País de afiliação: China