Your browser doesn't support javascript.
loading
Coresets for the Average Case Error for Finite Query Sets.
Maalouf, Alaa; Jubran, Ibrahim; Tukan, Murad; Feldman, Dan.
Afiliación
  • Maalouf A; Robotics & Big Data Labs, University of Haifa, Haifa 3498838, Israel.
  • Jubran I; Robotics & Big Data Labs, University of Haifa, Haifa 3498838, Israel.
  • Tukan M; Robotics & Big Data Labs, University of Haifa, Haifa 3498838, Israel.
  • Feldman D; Robotics & Big Data Labs, University of Haifa, Haifa 3498838, Israel.
Sensors (Basel) ; 21(19)2021 Oct 08.
Article en En | MEDLINE | ID: mdl-34641008
ABSTRACT
Coreset is usually a small weighted subset of an input set of items, that provably approximates their loss function for a given set of queries (models, classifiers, hypothesis). That is, the maximum (worst-case) error over all queries is bounded. To obtain smaller coresets, we suggest a natural relaxation coresets whose average error over the given set of queries is bounded. We provide both deterministic and randomized (generic) algorithms for computing such a coreset for any finite set of queries. Unlike most corresponding coresets for the worst-case error, the size of the coreset in this work is independent of both the input size and its Vapnik-Chervonenkis (VC) dimension. The main technique is to reduce the average-case coreset into the vector summarization problem, where the goal is to compute a weighted subset of the n input vectors which approximates their sum. We then suggest the first algorithm for computing this weighted subset in time that is linear in the input size, for n≫1/ε, where ε is the approximation error, improving, e.g., both [ICML'17] and applications for principal component analysis (PCA) [NIPS'16]. Experimental results show significant and consistent improvement also in practice. Open source code is provided.
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Tipo de estudio: Clinical_trials Idioma: En Revista: Sensors (Basel) Año: 2021 Tipo del documento: Article País de afiliación: Israel

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Tipo de estudio: Clinical_trials Idioma: En Revista: Sensors (Basel) Año: 2021 Tipo del documento: Article País de afiliación: Israel
...