Your browser doesn't support javascript.
loading
Cross-Modal Retrieval Between 13C NMR Spectra and Structures Based on Focused Libraries.
Sun, Hanyu; Xue, Xi; Liu, Xue; Hu, Hai-Yu; Deng, Yafeng; Wang, Xiaojian.
  • Sun H; State Key Laboratory of Bioactive Substances and Functions of Natural Medicines, Institute of Materia Medica, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100050, PR China.
  • Xue X; Beijing Key Laboratory of Active Substances Discovery and Druggability Evaluation, Department of Medicinal Chemistry, Institute of Materia Medica, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100050, PR China.
  • Liu X; State Key Laboratory of Bioactive Substances and Functions of Natural Medicines, Institute of Materia Medica, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100050, PR China.
  • Hu HY; State Key Laboratory of Bioactive Substances and Functions of Natural Medicines, Institute of Materia Medica, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100050, PR China.
  • Deng Y; State Key Laboratory of Bioactive Substances and Functions of Natural Medicines, Institute of Materia Medica, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100050, PR China.
  • Wang X; CarbonSilicon AI Technology Co., Ltd., Beijing 100080, China.
Anal Chem ; 96(15): 5763-5770, 2024 04 16.
Article en En | MEDLINE | ID: mdl-38564366
ABSTRACT
Library matching by comparing carbon-13 nuclear magnetic resonance (13C NMR) spectra with spectral data in the library is a crucial method for compound identification. In our previous paper, we introduced a deep contrastive learning system called CReSS, which used a library that contained more structures. However, CReSS has two

limitations:

there were no unknown structures in the library, and a redundant library reduces the structure-elucidation accuracy. Herein, we replaced the oversize traditional libraries with focused libraries containing a small number of molecules. A previously generative model, CMGNet, was used to generate focused libraries for CReSS. The combined model achieved a Top-10 accuracy of 54.03% when tested on 6,471 13C NMR spectra. In comparison, CReSS with a random reference structure library achieved an accuracy of only 9.17%. Furthermore, to expand the advantages of the focused libraries, we proposed SAmpRNN, which is a recurrent neural network (RNN). With the large focused library amplified by SAmpRNN, the structure-identification accuracy of the model increased in 70.0% of the 30 random example cases. In general, cross-modal retrieval between 13C NMR spectra and structures based on focused libraries (CFLS) achieved high accuracy and provided more accurate candidate structures than traditional libraries for compound identification.
Asunto(s)

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Imagen por Resonancia Magnética Idioma: En Año: 2024 Tipo del documento: Article

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Imagen por Resonancia Magnética Idioma: En Año: 2024 Tipo del documento: Article