Your browser doesn't support javascript.
loading
On the compromise between noise reduction and speech/noise spatial information preservation in binaural speech enhancement.
Leng, Xin; Chen, Jingdong; Benesty, Jacob.
Afiliação
  • Leng X; Center of Intelligent Acoustics and Immersive Communications and School of Marine Science and Technology, Northwestern Polytechnical University, Xi'an, Shaanxi 710072, China.
  • Chen J; Center of Intelligent Acoustics and Immersive Communications, Northwestern Polytechnical University, Xi'an, Shaanxi 710072, China.
  • Benesty J; Institut National de la Recherche Scientifique-Énergie, Matériaux et Télécommunications, University of Quebec, Montreal, Québec H5A 1K6, Canada.
J Acoust Soc Am ; 149(5): 3151, 2021 05.
Article em En | MEDLINE | ID: mdl-34241094
ABSTRACT
Spatial information is important for human perception of speech and sound signals. However, this information is often either distorted or completely neglected in noise reduction because it is challenging, to say the least, to achieve optimal noise reduction and accurate spatial information preservation at the same time. This paper studies the problem of binaural speech enhancement. By jointly diagonalizing the speech and noise correlation matrices, we present a method to construct the noise reduction filter as a linear combination of different eigenvectors, which span a certain subspace of the entire space. A different dimension of the subspace gives a different trade-off between noise reduction and speech/noise spatial information preservation. On the one side, if the dimension is equal to 1, maximum noise reduction is achieved but at the price of significant spatial information distortion. On the other extreme, if the dimension of the subspace is equal to that of the entire space, spatial information is accurately preserved but at the cost of no noise reduction. Therefore, one can achieve different levels of compromises between the amount of noise reduction and the level of speech/noise spatial information preservation by adjusting the dimension of the used subspace.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Localização de Som / Percepção da Fala Limite: Humans Idioma: En Ano de publicação: 2021 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Localização de Som / Percepção da Fala Limite: Humans Idioma: En Ano de publicação: 2021 Tipo de documento: Article