Your browser doesn't support javascript.
loading
Research on image content description in Chinese based on fusion of image global and local features.
Kong, Dongyi; Zhao, Hong; Zeng, Xiangyan.
Afiliação
  • Kong D; School of Computer and Communication, Lanzhou University of Technology, Lanzhou, China.
  • Zhao H; School of Computer and Communication, Lanzhou University of Technology, Lanzhou, China.
  • Zeng X; Department of Mathematics and Computer Science, Fort Valley State University, Fort Valley, GA, United States of America.
PLoS One ; 17(8): e0271322, 2022.
Article em En | MEDLINE | ID: mdl-36037226
ABSTRACT
Most image content modelling methods are designed for English description which is different form Chinese in syntax structure. The few existing Chinese image description models do not fully integrate the global features and the local features of an image, limiting the capability of the models to represent the details of the image. In this paper, an encoder-decoder architecture based on the fusion of global and local features is used to describe the Chinese image content. In the encoding stage, the global and local features of the image are extracted by the Convolutional Neural Network (CNN) and the target detection network, and fed to the feature fusion module. In the decoding stage, an image feature attention mechanism is used to calculate the weights of word vectors, and a new gating mechanism is added to the traditional Long Short-Term Memory (LSTM) network to emphasize the fused image features, and the corresponding word vectors. In the description generation stage, the beam search algorithm is used to optimize the word vector generation process. The integration of global and local features of the image is strengthened to allow the model to fully understand the details of the image through the above three stages. The experimental results show that the model improves the quality of Chinese description of image content. Compared with the baseline model, the score of CIDEr evaluation index improves by 20.07%, and other evaluation indices also improve significantly.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Algoritmos / Redes Neurais de Computação Tipo de estudo: Prognostic_studies País/Região como assunto: Asia Idioma: En Revista: PLoS One Assunto da revista: CIENCIA / MEDICINA Ano de publicação: 2022 Tipo de documento: Article País de afiliação: China

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Algoritmos / Redes Neurais de Computação Tipo de estudo: Prognostic_studies País/Região como assunto: Asia Idioma: En Revista: PLoS One Assunto da revista: CIENCIA / MEDICINA Ano de publicação: 2022 Tipo de documento: Article País de afiliação: China