[Speaker gender identification based on audio fractal dimension and pitch feature].

Wang, Zhenhua; Yang, Cuirong; Wu, Wei; Fan, Yingle

Wang, Zhenhua; Yang, Cuirong; Wu, Wei; Fan, Yingle.

Afiliação

Wang Z; Biomedical Engineering & Instrument Institute, Hangzhou Dianzi University, Hangzhou 310018, China. zhenhua0987@eyou.com

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi ; 25(4): 805-10, 2008 Aug.

Article em Zh | MEDLINE | ID: mdl-18788284

ABSTRACT

ABSTRACT

Automatic speaker gender identification based on voice feature is an important task in voice processing and analysis fields. In this paper non-linear parameters such as fractal dimension are applied to be one part of feature space for improving the ability of describing speaker gender feature through conventional linear parameters method. Pitch is picked using lifting scheme, and audio fractal dimension is extracted. Then based on Takens theory, the time delay method is used to reconstruct the phase space of fractal dimension sequence. And fractal dimension complexity is obtained by calculating Approximate Entropy. Three dimension feature vectors, including the pitch, the fractal dimension and the fractal dimension complexity, are applied to speaker gender identification. Experiment results show that through adding non-linear parameters, compared with the linear parameter using one dimension only such as pitch, the proposed method is more accurate and robust, and thus provides a new way for speaker gender identification.

Assuntos

Reconhecimento Automatizado de Padrão/métodos; Discriminação da Altura Tonal; Caracteres Sexuais; Processamento de Sinais Assistido por Computador; Voz; Algoritmos; Inteligência Artificial; Biometria/métodos; Humanos; Dinâmica não Linear; Fala; Acústica da Fala

Buscar no Google

Imprimir

XML

PubMed Links

Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Discriminação da Altura Tonal / Voz / Processamento de Sinais Assistido por Computador / Reconhecimento Automatizado de Padrão / Caracteres Sexuais Tipo de estudo: Diagnostic_studies / Prognostic_studies Limite: Humans Idioma: Zh Ano de publicação: 2008 Tipo de documento: Article

Buscar no Google

Imprimir

XML

PubMed Links