Revisiting the fragility of influence functions.

Epifano, Jacob R; Ramachandran, Ravi P; Masino, Aaron J; Rasool, Ghulam

Epifano, Jacob R; Ramachandran, Ravi P; Masino, Aaron J; Rasool, Ghulam.

Afiliación

Epifano JR; Rowan University, Department of Electrical and Computer Engineering, 201 Mullica Hill Rd, Glassboro, 08028, NJ, USA. Electronic address: jrepifano@gmail.com.
Ramachandran RP; Rowan University, Department of Electrical and Computer Engineering, 201 Mullica Hill Rd, Glassboro, 08028, NJ, USA.
Masino AJ; University of Pennsylvania Perelman School of Medicine, Department of Biostatistics, Epidemiology, Informatics, 423 Guardian Drive, Philadelphia, 19104, PA, USA.
Rasool G; Moffitt Cancer Center, Department of Machine Learning, 12902 USF Magnolia Drive, Tampa, 33612, FL, USA.

Neural Netw ; 162: 581-588, 2023 May.

Article en En | MEDLINE | ID: mdl-37011460

RESUMEN

In the last few years, many works have tried to explain the predictions of deep learning models. Few methods, however, have been proposed to verify the accuracy or faithfulness of these explanations. Recently, influence functions, which is a method that approximates the effect that leave-one-out training has on the loss function, has been shown to be fragile. The proposed reason for their fragility remains unclear. Although previous work suggests the use of regularization to increase robustness, this does not hold in all cases. In this work, we seek to investigate the experiments performed in the prior work in an effort to understand the underlying mechanisms of influence function fragility. First, we verify influence functions using procedures from the literature under conditions where the convexity assumptions of influence functions are met. Then, we relax these assumptions and study the effects of non-convexity by using deeper models and more complex datasets. Here, we analyze the key metrics and procedures that are used to validate influence functions. Our results indicate that the validation procedures may cause the observed fragility.

Palabras clave

Bayesian neural networks; Deep learning; Explainable AI; Influence functions; Machine learning; Supervised learning

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Tipo de estudio: Prognostic_studies Idioma: En Revista: Neural Netw Asunto de la revista: NEUROLOGIA Año: 2023 Tipo del documento: Article

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google