Your browser doesn't support javascript.
loading
Data-balanced transformer for accelerated ionizable lipid nanoparticles screening in mRNA delivery.
Wu, Kun; Yang, Xiulong; Wang, Zixu; Li, Na; Zhang, Jialu; Liu, Lizhuang.
Afiliação
  • Wu K; Shanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai 201210, China.
  • Yang X; University of Chinese Academy of Sciences, Beijing 100049, China.
  • Wang Z; Shanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai 201210, China.
  • Li N; University of Chinese Academy of Sciences, Beijing 100049, China.
  • Zhang J; Department of Computer Science, University of Tsukuba, Tsukuba 3058577, Japan.
  • Liu L; National Facility for Protein Science in Shanghai, Zhangjiang Laboratory, Shanghai Advanced Research Institute, Chinese Academy of Sciences.
Brief Bioinform ; 25(3)2024 Mar 27.
Article em En | MEDLINE | ID: mdl-38670158
ABSTRACT
Despite the widespread use of ionizable lipid nanoparticles (LNPs) in clinical applications for messenger RNA (mRNA) delivery, the mRNA drug delivery system faces an efficient challenge in the screening of LNPs. Traditional screening methods often require a substantial amount of experimental time and incur high research and development costs. To accelerate the early development stage of LNPs, we propose TransLNP, a transformer-based transfection prediction model designed to aid in the selection of LNPs for mRNA drug delivery systems. TransLNP uses two types of molecular information to perceive the relationship between structure and transfection efficiency coarse-grained atomic sequence information and fine-grained atomic spatial relationship information. Due to the scarcity of existing LNPs experimental data, we find that pretraining the molecular model is crucial for better understanding the task of predicting LNPs properties, which is achieved through reconstructing atomic 3D coordinates and masking atom predictions. In addition, the issue of data imbalance is particularly prominent in the real-world exploration of LNPs. We introduce the BalMol block to solve this problem by smoothing the distribution of labels and molecular features. Our approach outperforms state-of-the-art works in transfection property prediction under both random and scaffold data splitting. Additionally, we establish a relationship between molecular structural similarity and transfection differences, selecting 4267 pairs of molecular transfection cliffs, which are pairs of molecules that exhibit high structural similarity but significant differences in transfection efficiency, thereby revealing the primary source of prediction errors. The code, model and data are made publicly available at https//github.com/wklix/TransLNP.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: RNA Mensageiro / Nanopartículas / Lipídeos / Lipossomos Limite: Humans Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: RNA Mensageiro / Nanopartículas / Lipídeos / Lipossomos Limite: Humans Idioma: En Ano de publicação: 2024 Tipo de documento: Article