Your browser doesn't support javascript.
loading
A Multi-Group Multi-Stream attribute Attention network for fine-grained zero-shot learning.
Song, Lingyun; Shang, Xuequn; Zhou, Ruizhi; Liu, Jun; Ma, Jie; Li, Zhanhuai; Sun, Mingxuan.
Afiliação
  • Song L; School of Computer Science, Northwestern Polytechnical University, Xi'an, 710129, China; Key Laboratory of Big Data Storage and Management, Northwestern Polytechnical University, Ministry of Industry and Information Technology, Xi'an, 710129, China. Electronic address: lysong@nwpu.edu.cn.
  • Shang X; School of Computer Science, Northwestern Polytechnical University, Xi'an, 710129, China; Key Laboratory of Big Data Storage and Management, Northwestern Polytechnical University, Ministry of Industry and Information Technology, Xi'an, 710129, China. Electronic address: shang@nwpu.edu.cn.
  • Zhou R; School of Computer Science, Northwestern Polytechnical University, Xi'an, 710129, China; Key Laboratory of Big Data Storage and Management, Northwestern Polytechnical University, Ministry of Industry and Information Technology, Xi'an, 710129, China. Electronic address: zhouruizhi@mail.nwpu.edu.cn.
  • Liu J; SPKLSTN Lab, Department of Computer Science and Technology, Xi'an Jiaotong University, Xi'an 710049, China. Electronic address: liukeen@mail.xjtu.edu.cn.
  • Ma J; SPKLSTN Lab, Department of Computer Science and Technology, Xi'an Jiaotong University, Xi'an 710049, China. Electronic address: jiema@xjtu.edu.cn.
  • Li Z; School of Computer Science, Northwestern Polytechnical University, Xi'an, 710129, China; Key Laboratory of Big Data Storage and Management, Northwestern Polytechnical University, Ministry of Industry and Information Technology, Xi'an, 710129, China. Electronic address: lizhh@mail.nwpu.edu.cn.
  • Sun M; Division of Computer Science and Engineering, School of Electrical Engineering and Computer Science, Louisiana State University, 70803, USA. Electronic address: msun@csc.lsu.edu.
Neural Netw ; 179: 106558, 2024 Jul 20.
Article em En | MEDLINE | ID: mdl-39089147
ABSTRACT
Fine-grained visual categorization in zero-shot setting is a challenging problem in the computer vision community. It requires algorithms to accurately identify fine-grained categories that do not appear during the training phase and have high visual similarity to each other. Existing methods usually address this problem by using attribute information as intermediate knowledge, which provides sufficient fine-grained characteristics of categories and can be transferred from seen categories to unseen categories. However, the learning of attribute visual features is not trivial due to the following two reasons (i) The visual information about attributes of different types may interfere with the visual feature learning of each other. (ii) The visual characteristics of the same attribute may vary in different categories. To solve these issues, we propose a Multi-Group Multi-Stream attribute Attention network (MGMSA), which not only separates the feature learning of attributes of different types, but also isolates the learning of attribute visual features for categories with big differences in attribute appearance. This avoids the interference between uncorrelated attributes and helps to learn category-specific attribute-related visual features. This is beneficial for distinguishing fine-grained categories with subtle visual differences. Extensive experiments on benchmark datasets show that MGMSA achieves state-of-the-art performance on attribute prediction and fine-grained zero-shot learning.
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article