Your browser doesn't support javascript.
loading
A proposal of prior probability-oriented clustering in feature encoding strategies.
Shinomiya, Yuki; Hoshino, Yukinobu.
Afiliação
  • Shinomiya Y; School of System Engineering, Kochi University of Technology, Kami, Kochi, Japan.
  • Hoshino Y; School of System Engineering, Kochi University of Technology, Kami, Kochi, Japan.
PLoS One ; 14(1): e0210146, 2019.
Article em En | MEDLINE | ID: mdl-30629616
Codebook-based feature encodings are a standard framework for image recognition issues. A codebook is usually constructed by clusterings, such as the k-means and the Gaussian Mixture Model (GMM). A codebook size is an important factor to decide the trade-off between recognition performance and computational complexity and a traditional framework has the disadvantage to image recognition issues when a large codebook; the number of unique clusters becomes smaller than a designated codebook size because some clusters converge to close positions. This paper focusses on the disadvantage from a perspective of the distribution of prior probabilities and presents a clustering framework including two objectives that are alternated to the k-means and the GMM. Our approach is first evaluated with synthetic clustering datasets to analyze a difference to traditional clustering. In the experiment section, although our approach alternated to the k-means generates similar results to the k-means results, our approach is able to finely tune clusters for our objective. Our approach alternated to the GMM significantly improves our objective and constructs intuitively appropriate clusters, especially for huge and complicatedly distributed samples. In the experiment on image recognition issues, two state-of-the-art encodings, the Fisher Vector (FV) using the GMM and the Vector of Locally Aggregated Descriptors (VLAD) using the k-means, are evaluated with two publicly available image datasets, the Birds and the Butterflies. For the results of the VLAD with our approach, the recognition performances tend to be worse compared to the original VLAD results. On the other hand, the FV using our approach is able to improve the performance, especially in a larger codebook size.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Processamento de Imagem Assistida por Computador / Reconhecimento Automatizado de Padrão / Probabilidade / Máquina de Vetores de Suporte Tipo de estudo: Prognostic_studies Idioma: En Revista: PLoS One Assunto da revista: CIENCIA / MEDICINA Ano de publicação: 2019 Tipo de documento: Article País de afiliação: Japão País de publicação: Estados Unidos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Processamento de Imagem Assistida por Computador / Reconhecimento Automatizado de Padrão / Probabilidade / Máquina de Vetores de Suporte Tipo de estudo: Prognostic_studies Idioma: En Revista: PLoS One Assunto da revista: CIENCIA / MEDICINA Ano de publicação: 2019 Tipo de documento: Article País de afiliação: Japão País de publicação: Estados Unidos