A Real-Time Dual-Microphone Speech Enhancement Algorithm Assisted by Bone Conduction Sensor.

Zhou, Yi; Chen, Yufan; Ma, Yongbao; Liu, Hongqing

Zhou, Yi; Chen, Yufan; Ma, Yongbao; Liu, Hongqing.

Affiliation

Zhou Y; School of Communication and Information Engineering, Chongqing University of Posts and Telecommunications, Chongqing 400065, China.
Chen Y; School of Communication and Information Engineering, Chongqing University of Posts and Telecommunications, Chongqing 400065, China.
Ma Y; Suresense Technology, Chongqing 400065, China.
Liu H; School of Communication and Information Engineering, Chongqing University of Posts and Telecommunications, Chongqing 400065, China.

Sensors (Basel) ; 20(18)2020 Sep 05.

Article in En | MEDLINE | ID: mdl-32899533

ABSTRACT

ABSTRACT

The quality and intelligibility of the speech are usually impaired by the interference of background noise when using internet voice calls. To solve this problem in the context of wearable smart devices, this paper introduces a dual-microphone, bone-conduction (BC) sensor assisted beamformer and a simple recurrent unit (SRU)-based neural network postfilter for real-time speech enhancement. Assisted by the BC sensor, which is insensitive to the environmental noise compared to the regular air-conduction (AC) microphone, the accurate voice activity detection (VAD) can be obtained from the BC signal and incorporated into the adaptive noise canceller (ANC) and adaptive block matrix (ABM). The SRU-based postfilter consists of a recurrent neural network with a small number of parameters, which improves the computational efficiency. The sub-band signal processing is designed to compress the input features of the neural network, and the scale-invariant signal-to-distortion ratio (SI-SDR) is developed as the loss function to minimize the distortion of the desired speech signal. Experimental results demonstrate that the proposed real-time speech enhancement system provides significant speech sound quality and intelligibility improvements for all noise types and levels when compared with the AC-only beamformer with a postfiltering algorithm.

Subject(s)

Speech Perception; Speech; Algorithms; Bone Conduction; Noise

Key words

array signal processing; beamforming; bone conduction; deep learning; real time; speech enhancement

Fulltext

Add to My VHL

XML

PubMed Links

Search on Google

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Speech / Speech Perception Type of study: Prognostic_studies Language: En Journal: Sensors (Basel) Year: 2020 Document type: Article Affiliation country: China

Fulltext

Add to My VHL

XML

PubMed Links

Search on Google