Búsqueda | Portal de Búsqueda de la BVS Ecuador

1.

Toward Learning Trustworthily from Data Combining Privacy, Fairness, and Explainability: An Application to Face Recognition.

Franco, Danilo; Oneto, Luca; Navarin, Nicolò; Anguita, Davide.

Entropy (Basel) ; 23(8)2021 Aug 14.

Artículo en Inglés | MEDLINE | ID: mdl-34441187

RESUMEN

In many decision-making scenarios, ranging from recreational activities to healthcare and policing, the use of artificial intelligence coupled with the ability to learn from historical data is becoming ubiquitous. This widespread adoption of automated systems is accompanied by the increasing concerns regarding their ethical implications. Fundamental rights, such as the ones that require the preservation of privacy, do not discriminate based on sensible attributes (e.g., gender, ethnicity, political/sexual orientation), or require one to provide an explanation for a decision, are daily undermined by the use of increasingly complex and less understandable yet more accurate learning algorithms. For this purpose, in this work, we work toward the development of systems able to ensure trustworthiness by delivering privacy, fairness, and explainability by design. In particular, we show that it is possible to simultaneously learn from data while preserving the privacy of the individuals thanks to the use of Homomorphic Encryption, ensuring fairness by learning a fair representation from the data, and ensuring explainable decisions with local and global explanations without compromising the accuracy of the final models. We test our approach on a widespread but still controversial application, namely face recognition, using the recent FairFace dataset to prove the validity of our approach.

2.

Spectral Analysis of Electricity Demand Using Hilbert-Huang Transform.

Luque, Joaquin; Anguita, Davide; Pérez, Francisco; Denda, Robert.

Sensors (Basel) ; 20(10)2020 May 21.

Artículo en Inglés | MEDLINE | ID: mdl-32455613

RESUMEN

The large amount of sensors in modern electrical networks poses a serious challenge in the data processing side. For many years, spectral analysis has been one of the most used approaches to extract physically meaningful information from a sea of data. Fourier Transform (FT) and Wavelet Transform (WT) are by far the most employed tools in this analysis. In this paper we explore the alternative use of Hilbert-Huang Transform (HHT) for electricity demand spectral representation. A sequence of hourly consumptions, spanning 40 months of electrical demand in Spain, has been used as dataset. First, by Empirical Mode Decomposition (EMD), the sequence has been time-represented as an ensemble of 13 Intrinsic Mode Functions (IMFs). Later on, by applying Hilbert Transform (HT) to every IMF, an HHT spectrum has been obtained. Results show smoother spectra with more defined shapes and an excellent frequency resolution. EMD also fosters a deeper analysis of abnormal electricity demand at different timescales. Additionally, EMD permits information compression, which becomes very significant for lossless sequence representation. A 35% reduction has been obtained for the electricity demand sequence. On the negative side, HHT demands more computer resources than conventional spectral analysis techniques.

3.

Learning With Kernels: A Local Rademacher Complexity-Based Analysis With Application to Graph Kernels.

Oneto, Luca; Navarin, Nicolo; Donini, Michele; Ridella, Sandro; Sperduti, Alessandro; Aiolli, Fabio; Anguita, Davide.

IEEE Trans Neural Netw Learn Syst ; 29(10): 4660-4671, 2018 10.

Artículo en Inglés | MEDLINE | ID: mdl-29990207

RESUMEN

When dealing with kernel methods, one has to decide which kernel and which values for the hyperparameters to use. Resampling techniques can address this issue but these procedures are time-consuming. This problem is particularly challenging when dealing with structured data, in particular with graphs, since several kernels for graph data have been proposed in literature, but no clear relationship among them in terms of learning properties is defined. In these cases, exhaustive search seems to be the only reasonable approach. Recently, the global Rademacher complexity (RC) and local Rademacher complexity (LRC), two powerful measures of the complexity of a hypothesis space, have shown to be suited for studying kernels properties. In particular, the LRC is able to bound the generalization error of an hypothesis chosen in a space by disregarding those ones which will not be taken into account by any learning procedure because of their high error. In this paper, we show a new approach to efficiently bound the RC of the space induced by a kernel, since its exact computation is an NP-Hard problem. Then we show for the first time that RC can be used to estimate the accuracy and expressivity of different graph kernels under different parameter configurations. The authors' claims are supported by experimental results on several real-world graph data sets.

4.

Feed-forward support vector machine without multipliers.

Anguita, Davide; Pischiutta, Stefano; Ridella, Sandro; Sterpi, Dario.

IEEE Trans Neural Netw ; 17(5): 1328-31, 2006 Sep.

Artículo en Inglés | MEDLINE | ID: mdl-17001991

RESUMEN

In this letter, we propose a coordinate rotation digital computer (CORDIC)-like algorithm for computing the feed-forward phase of a support vector machine (SVM) in fixed-point arithmetic, using only shift and add operations and avoiding resource-consuming multiplications. This result is obtained thanks to a hardware-friendly kernel, which greatly simplifies the SVM feed-forward phase computation and, at the same time, maintains good classification performance respect to the conventional Gaussian kernel.

Asunto(s)

Algoritmos , Inteligencia Artificial , Metodologías Computacionales , Reconocimiento de Normas Patrones Automatizadas/métodos , Análisis por Conglomerados , Redes Neurales de la Computación

5.

A local Vapnik-Chervonenkis complexity.

Oneto, Luca; Anguita, Davide; Ridella, Sandro.

Neural Netw ; 82: 62-75, 2016 Oct.

Artículo en Inglés | MEDLINE | ID: mdl-27474843

RESUMEN

We define in this work a new localized version of a Vapnik-Chervonenkis (VC) complexity, namely the Local VC-Entropy, and, building on this new complexity, we derive a new generalization bound for binary classifiers. The Local VC-Entropy-based bound improves on the original Vapnik's results because it is able to discard those functions that, most likely, will not be selected during the learning phase. The result is achieved by applying the localization principle to the original global complexity measure, in the same spirit of the Local Rademacher Complexity. By exploiting and improving a recently developed geometrical framework, we show that it is also possible to relate the Local VC-Entropy to the Local Rademacher Complexity by finding an admissible range for one given the other. In addition, the Local VC-Entropy allows one to reduce the computational requirements that arise when dealing with the Local Rademacher Complexity in binary classification problems.

Asunto(s)

Entropía , Aprendizaje Automático , Aprendizaje Automático/tendencias

6.

Fully Empirical and Data-Dependent Stability-Based Bounds.

Oneto, Luca; Ghio, Alessandro; Ridella, Sandro; Anguita, Davide.

IEEE Trans Cybern ; 45(9): 1913-26, 2015 Sep.

Artículo en Inglés | MEDLINE | ID: mdl-25347893

RESUMEN

The purpose of this paper is to obtain a fully empirical stability-based bound on the generalization ability of a learning procedure, thus, circumventing some limitations of the structural risk minimization framework. We show that assuming a desirable property of a learning algorithm is sufficient to make data-dependency explicit for stability, which, instead, is usually bounded only in an algorithmic-dependent way. In addition, we prove that a well-known and widespread classifier, like the support vector machine (SVM), satisfies this condition. The obtained bound is then exploited for model selection purposes in SVM classification and tested on a series of real-world benchmarking datasets demonstrating, in practice, the effectiveness of our approach.

7.

Local Rademacher Complexity: sharper risk bounds with and without unlabeled samples.

Oneto, Luca; Ghio, Alessandro; Ridella, Sandro; Anguita, Davide.

Neural Netw ; 65: 115-25, 2015 May.

Artículo en Inglés | MEDLINE | ID: mdl-25734890

RESUMEN

We derive in this paper a new Local Rademacher Complexity risk bound on the generalization ability of a model, which is able to take advantage of the availability of unlabeled samples. Moreover, this new bound improves state-of-the-art results even when no unlabeled samples are available.

Asunto(s)

Inteligencia Artificial , Modelos Estadísticos

8.

Quantum optimization for training support vector machines.

Anguita, Davide; Ridella, Sandro; Rivieccio, Fabio; Zunino, Rodolfo.

Neural Netw ; 16(5-6): 763-70, 2003.

Artículo en Inglés | MEDLINE | ID: mdl-12850032

RESUMEN

Refined concepts, such as Rademacher estimates of model complexity and nonlinear criteria for weighting empirical classification errors, represent recent and promising approaches to characterize the generalization ability of Support Vector Machines (SVMs). The advantages of those techniques lie in both improving the SVM representation ability and yielding tighter generalization bounds. On the other hand, they often make Quadratic-Programming algorithms no longer applicable, and SVM training cannot benefit from efficient, specialized optimization techniques. The paper considers the application of Quantum Computing to solve the problem of effective SVM training, especially in the case of digital implementations. The presented research compares the behavioral aspects of conventional and enhanced SVMs; experiments in both a synthetic and real-world problems support the theoretical analysis. At the same time, the related differences between Quadratic-Programming and Quantum-based optimization techniques are considered.

Asunto(s)

Metodologías Computacionales , Teoría Cuántica

9.

A deep connection between the Vapnik-Chervonenkis entropy and the Rademacher complexity.

Anguita, Davide; Ghio, Alessandro; Oneto, Luca; Ridella, Sandro.

IEEE Trans Neural Netw Learn Syst ; 25(12): 2202-11, 2014 Dec.

Artículo en Inglés | MEDLINE | ID: mdl-25420243

RESUMEN

In this paper, we derive a deep connection between the Vapnik-Chervonenkis (VC) entropy and the Rademacher complexity. For this purpose, we first refine some previously known relationships between the two notions of complexity and then derive new results, which allow computing an admissible range for the Rademacher complexity, given a value of the VC-entropy, and vice versa. The approach adopted in this paper is new and relies on the careful analysis of the combinatorial nature of the problem. The obtained results improve the state of the art on this research topic.

10.

An improved analysis of the Rademacher data-dependent bound using its self bounding property.

Oneto, Luca; Ghio, Alessandro; Anguita, Davide; Ridella, Sandro.

Neural Netw ; 44: 107-11, 2013 Aug.

Artículo en Inglés | MEDLINE | ID: mdl-23587720

RESUMEN

The problem of assessing the performance of a classifier, in the finite-sample setting, has been addressed by Vapnik in his seminal work by using data-independent measures of complexity. Recently, several authors have addressed the same problem by proposing data-dependent measures, which tighten previous results by taking in account the actual data distribution. In this framework, we derive some data-dependent bounds on the generalization ability of a classifier by exploiting the Rademacher Complexity and recent concentration results: in addition of being appealing for practical purposes, as they exploit empirical quantities only, these bounds improve previously known results.

Asunto(s)

Inteligencia Artificial , Estadística como Asunto/tendencias , Estadística como Asunto/métodos

11.

In-sample and out-of-sample model selection and error estimation for support vector machines.

Anguita, Davide; Ghio, Alessandro; Oneto, Luca; Ridella, Sandro.

IEEE Trans Neural Netw Learn Syst ; 23(9): 1390-406, 2012 Sep.

Artículo en Inglés | MEDLINE | ID: mdl-24807923

RESUMEN

In-sample approaches to model selection and error estimation of support vector machines (SVMs) are not as widespread as out-of-sample methods, where part of the data is removed from the training set for validation and testing purposes, mainly because their practical application is not straightforward and the latter provide, in many cases, satisfactory results. In this paper, we survey some recent and not-so-recent results of the data-dependent structural risk minimization framework and propose a proper reformulation of the SVM learning algorithm, so that the in-sample approach can be effectively applied. The experiments, performed both on simulated and real-world datasets, show that our in-sample approach can be favorably compared to out-of-sample methods, especially in cases where the latter ones provide questionable results. In particular, when the number of samples is small compared to their dimensionality, like in classification of microarray data, our proposal can outperform conventional out-of-sample approaches such as the cross validation, the leave-one-out, or the Bootstrap methods.

12.

Using unsupervised analysis to constrain generalization bounds for support vector classifiers.

Decherchi, Sergio; Ridella, Sandro; Zunino, Rodolfo; Gastaldo, Paolo; Anguita, Davide.

IEEE Trans Neural Netw ; 21(3): 424-38, 2010 Mar.

Artículo en Inglés | MEDLINE | ID: mdl-20123572

RESUMEN

A crucial issue in designing learning machines is to select the correct model parameters. When the number of available samples is small, theoretical sample-based generalization bounds can prove effective, provided that they are tight and track the validation error correctly. The maximal discrepancy (MD) approach is a very promising technique for model selection for support vector machines (SVM), and estimates a classifier's generalization performance by multiple training cycles on random labeled data. This paper presents a general method to compute the generalization bounds for SVMs, which is based on referring the SVM parameters to an unsupervised solution, and shows that such an approach yields tight bounds and attains effective model selection. When one estimates the generalization error, one uses an unsupervised reference to constrain the complexity of the learning machine, thereby possibly decreasing sharply the number of admissible hypothesis. Although the methodology has a general value, the method described in the paper adopts vector quantization (VQ) as a representation paradigm, and introduces a biased regularization approach in bound computation and learning. Experimental results validate the proposed method on complex real-world data sets.

Asunto(s)

Algoritmos , Inteligencia Artificial , Generalización Psicológica , Redes Neurales de la Computación , Reconocimiento de Normas Patrones Automatizadas , Simulación por Computador , Humanos , Reproducibilidad de los Resultados

RESUMEN

RESUMEN

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA