Your browser doesn't support javascript.
loading
Multi-label zero-shot learning with graph convolutional networks.
Ou, Guangjin; Yu, Guoxian; Domeniconi, Carlotta; Lu, Xuequan; Zhang, Xiangliang.
Afiliação
  • Ou G; School of Software, Shandong University, Jinan, China; College of Computer and Information Sciences, Southwest University, Chongqing, China. Electronic address: gjou@swu.edu.cn.
  • Yu G; School of Software, Shandong University, Jinan, China; College of Computer and Information Sciences, Southwest University, Chongqing, China; CEMSE, King Abdullah University of Science and Technology, Thuwal, SA, Saudi Arabia. Electronic address: guoxian85@gmail.com.
  • Domeniconi C; Department of Computer Science, George Mason University, Fairfax, VA, USA. Electronic address: carlotta@cs.gmu.edu.
  • Lu X; School of Information Technology, Deakin University, Australia. Electronic address: xuequan.lu@deakin.edu.au.
  • Zhang X; CEMSE, King Abdullah University of Science and Technology, Thuwal, SA, Saudi Arabia. Electronic address: xiangliang.zhang@kaust.edu.sa.
Neural Netw ; 132: 333-341, 2020 Dec.
Article em En | MEDLINE | ID: mdl-32977278
ABSTRACT
The goal of zero-shot learning (ZSL) is to build a classifier that recognizes novel categories with no corresponding annotated training data. The typical routine is to transfer knowledge from seen classes to unseen ones by learning a visual-semantic embedding. Existing multi-label zero-shot learning approaches either ignore correlations among labels, suffer from large label combinations, or learn the embedding using only local or global visual features. In this paper, we propose a Graph Convolution Networks based Multi-label Zero-Shot Learning model, abbreviated as MZSL-GCN. Our model first constructs a label relation graph using label co-occurrences and compensates the absence of unseen labels in the training phase by semantic similarity. It then takes the graph and the word embedding of each seen (unseen) label as inputs to the GCN to learn the label semantic embedding, and to obtain a set of inter-dependent object classifiers. MZSL-GCN simultaneously trains another attention network to learn compatible local and global visual features of objects with respect to the classifiers, and thus makes the whole network end-to-end trainable. In addition, the use of unlabeled training data can reduce the bias toward seen labels and boost the generalization ability. Experimental results on benchmark datasets show that our MZSL-GCN competes with state-of-the-art approaches.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Reconhecimento Automatizado de Padrão / Redes Neurais de Computação / Aprendizado de Máquina Limite: Humans Idioma: En Revista: Neural Netw Ano de publicação: 2020 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Reconhecimento Automatizado de Padrão / Redes Neurais de Computação / Aprendizado de Máquina Limite: Humans Idioma: En Revista: Neural Netw Ano de publicação: 2020 Tipo de documento: Article