[Speaker gender identification based on audio fractal dimension and pitch feature].
Sheng Wu Yi Xue Gong Cheng Xue Za Zhi
; 25(4): 805-10, 2008 Aug.
Article
em Zh
| MEDLINE
| ID: mdl-18788284
ABSTRACT
Automatic speaker gender identification based on voice feature is an important task in voice processing and analysis fields. In this paper non-linear parameters such as fractal dimension are applied to be one part of feature space for improving the ability of describing speaker gender feature through conventional linear parameters method. Pitch is picked using lifting scheme, and audio fractal dimension is extracted. Then based on Takens theory, the time delay method is used to reconstruct the phase space of fractal dimension sequence. And fractal dimension complexity is obtained by calculating Approximate Entropy. Three dimension feature vectors, including the pitch, the fractal dimension and the fractal dimension complexity, are applied to speaker gender identification. Experiment results show that through adding non-linear parameters, compared with the linear parameter using one dimension only such as pitch, the proposed method is more accurate and robust, and thus provides a new way for speaker gender identification.
Buscar no Google
Coleções:
01-internacional
Base de dados:
MEDLINE
Assunto principal:
Discriminação da Altura Tonal
/
Voz
/
Processamento de Sinais Assistido por Computador
/
Reconhecimento Automatizado de Padrão
/
Caracteres Sexuais
Tipo de estudo:
Diagnostic_studies
/
Prognostic_studies
Limite:
Humans
Idioma:
Zh
Ano de publicação:
2008
Tipo de documento:
Article