Generalizability and robustness evaluation of attribute-based zero-shot learning.

Rossi, Luca; Fiorentino, Maria Chiara; Mancini, Adriano; Paolanti, Marina; Rosati, Riccardo; Zingaretti, Primo

Rossi, Luca; Fiorentino, Maria Chiara; Mancini, Adriano; Paolanti, Marina; Rosati, Riccardo; Zingaretti, Primo.

Afiliação

Rossi L; Dipartimento di Ingegneria dell'Informazione, Università Politecnica delle Marche, Via Brecce Bianche 12, 60131, Ancona, Italy. Electronic address: l.rossi@pm.univpm.it.
Fiorentino MC; Dipartimento di Ingegneria dell'Informazione, Università Politecnica delle Marche, Via Brecce Bianche 12, 60131, Ancona, Italy. Electronic address: m.c.fiorentino@pm.univpm.it.
Mancini A; Dipartimento di Ingegneria dell'Informazione, Università Politecnica delle Marche, Via Brecce Bianche 12, 60131, Ancona, Italy. Electronic address: a.mancini@univpm.it.
Paolanti M; Dipartimento di Scienze politiche, della Comunicazione e delle Relazioni Internazionali, Università di Macerata, 62100, Macerata, Italy. Electronic address: marina.paolanti@unimc.it.
Rosati R; Dipartimento di Ingegneria dell'Informazione, Università Politecnica delle Marche, Via Brecce Bianche 12, 60131, Ancona, Italy. Electronic address: r.rosati@pm.univpm.it.
Zingaretti P; Dipartimento di Ingegneria dell'Informazione, Università Politecnica delle Marche, Via Brecce Bianche 12, 60131, Ancona, Italy. Electronic address: p.zingaretti@univpm.it.

Neural Netw ; 175: 106278, 2024 Jul.

Article em En | MEDLINE | ID: mdl-38581809

ABSTRACT

ABSTRACT

In the field of deep learning, large quantities of data are typically required to effectively train models. This challenge has given rise to techniques like zero-shot learning (ZSL), which trains models on a set of "seen" classes and evaluates them on a set of "unseen" classes. Although ZSL has shown considerable potential, particularly with the employment of generative methods, its generalizability to real-world scenarios remains uncertain. The hypothesis of this work is that the performance of ZSL models is systematically influenced by the chosen "splits"; in particular, the statistical properties of the classes and attributes used in training. In this paper, we test this hypothesis by introducing the concepts of generalizability and robustness in attribute-based ZSL and carry out a variety of experiments to stress-test ZSL models against different splits. Our aim is to lay the groundwork for future research on ZSL models' generalizability, robustness, and practical applications. We evaluate the accuracy of state-of-the-art models on benchmark datasets and identify consistent trends in generalizability and robustness. We analyze how these properties vary based on the dataset type, differentiating between coarse- and fine-grained datasets, and our findings indicate significant room for improvement in both generalizability and robustness. Furthermore, our results demonstrate the effectiveness of dimensionality reduction techniques in improving the performance of state-of-the-art models in fine-grained datasets.

Assuntos

Aprendizado Profundo; Redes Neurais de Computação; Humanos; Algoritmos; Aprendizado de Máquina

Palavras-chave

Evaluation metrics; Generalizability; Robustness; Zero-shot learning

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Aprendizado Profundo Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Aprendizado Profundo Idioma: En Ano de publicação: 2024 Tipo de documento: Article