GIT-Mol: A multi-modal large language model for molecular science with graph, image, and text.

Liu, Pengfei; Ren, Yiming; Tao, Jun; Ren, Zhixiang

Liu, Pengfei; Ren, Yiming; Tao, Jun; Ren, Zhixiang.

Afiliação

Liu P; Peng Cheng Laboratory, Shenzhen, 518055, Guangdong Province, China; School of Computer Science and Engineering, Sun Yat-Sen University, Guangzhou, 510006, Guangdong Province, China.
Ren Y; Peng Cheng Laboratory, Shenzhen, 518055, Guangdong Province, China.
Tao J; School of Computer Science and Engineering, Sun Yat-Sen University, Guangzhou, 510006, Guangdong Province, China.
Ren Z; Peng Cheng Laboratory, Shenzhen, 518055, Guangdong Province, China. Electronic address: renzhx@pcl.ac.cn.

Comput Biol Med ; 171: 108073, 2024 Mar.

Article em En | MEDLINE | ID: mdl-38359660

ABSTRACT

ABSTRACT

Large language models have made significant strides in natural language processing, enabling innovative applications in molecular science by processing textual representations of molecules. However, most existing language models cannot capture the rich information with complex molecular structures or images. In this paper, we introduce GIT-Mol, a multi-modal large language model that integrates the Graph, Image, and Text information. To facilitate the integration of multi-modal molecular data, we propose GIT-Former, a novel architecture that is capable of aligning all modalities into a unified latent space. We achieve a 5%-10% accuracy increase in properties prediction and a 20.2% boost in molecule generation validity compared to the baselines. With the any-to-language molecular translation strategy, our model has the potential to perform more downstream tasks, such as compound name recognition and chemical reaction prediction.

Assuntos

Idioma; Processamento de Linguagem Natural

Palavras-chave

Large language model; Molecular representation; Molecule generation; Multi-modality

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Processamento de Linguagem Natural / Idioma Tipo de estudo: Prognostic_studies Idioma: En Revista: Comput Biol Med Ano de publicação: 2024 Tipo de documento: Article País de afiliação: China

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google