Your browser doesn't support javascript.
loading
Unsupervised Learning for Monaural Source Separation Using Maximization⁻Minimization Algorithm with Time⁻Frequency Deconvolution.
Woo, Wai Lok; Gao, Bin; Bouridane, Ahmed; Ling, Bingo Wing-Kuen; Chin, Cheng Siong.
Afiliación
  • Woo WL; School of Electrical and Electronic Engineering, Newcastle University, Newcastle upon Tyne NE1 7RU, UK. lok.woo@ncl.ac.uk.
  • Gao B; School of Automation Engineering, University of Electronic Science and Technology of China, Chengdu 610054, China. bin_gao@uestc.edu.cn.
  • Bouridane A; Department of Computer and Information Sciences, Northumbria University, Newcastle upon Tyne NE1 8ST, UK. ahmed.bouridane@northumbria.ac.uk.
  • Ling BW; Faculty of Information Engineering, Guangdong University of Technology, Guangzhou 510006, China. yongquanling@gdut.edu.cn.
  • Chin CS; Faculty of Science Agriculture and Engineering, Newcastle University, Singapore 599493, Singapore. cheng.chin@ncl.ac.uk.
Sensors (Basel) ; 18(5)2018 Apr 27.
Article en En | MEDLINE | ID: mdl-29702629
ABSTRACT
This paper presents an unsupervised learning algorithm for sparse nonnegative matrix factor time⁻frequency deconvolution with optimized fractional ß-divergence. The ß-divergence is a group of cost functions parametrized by a single parameter ß. The Itakura⁻Saito divergence, Kullback⁻Leibler divergence and Least Square distance are special cases that correspond to ß=0, 1, 2, respectively. This paper presents a generalized algorithm that uses a flexible range of ß that includes fractional values. It describes a maximization⁻minimization (MM) algorithm leading to the development of a fast convergence multiplicative update algorithm with guaranteed convergence. The proposed model operates in the time⁻frequency domain and decomposes an information-bearing matrix into two-dimensional deconvolution of factor matrices that represent the spectral dictionary and temporal codes. The deconvolution process has been optimized to yield sparse temporal codes through maximizing the likelihood of the observations. The paper also presents a method to estimate the fractional ß value. The method is demonstrated on separating audio mixtures recorded from a single channel. The paper shows that the extraction of the spectral dictionary and temporal codes is significantly more efficient by using the proposed algorithm and subsequently leads to better source separation performance. Experimental tests and comparisons with other factorization methods have been conducted to verify its efficacy.
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Tipo de estudio: Clinical_trials / Prognostic_studies Idioma: En Revista: Sensors (Basel) Año: 2018 Tipo del documento: Article País de afiliación: Reino Unido Pais de publicación: CH / SUIZA / SUÍÇA / SWITZERLAND

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Tipo de estudio: Clinical_trials / Prognostic_studies Idioma: En Revista: Sensors (Basel) Año: 2018 Tipo del documento: Article País de afiliación: Reino Unido Pais de publicación: CH / SUIZA / SUÍÇA / SWITZERLAND