INVESTIGADORES
MILONE Diego Humberto
artículos
Título:
Multiresolution information measures applied to speech recognition
Autor/es:
TORRES, M. E.; RUFINER, H. L.; MILONE, D. H.; CHERNIZ, A.
Revista:
PHYSICA A - STATISTICAL AND THEORETICAL PHYSICS
Editorial:
Elsevier Science
Referencias:
Lugar: Amsterdam; Año: 2007 vol. 2007 p. 319 - 332
ISSN:
0378-4371
Resumen:
Considerable advances in automatic speech recognition have been made in the last decades, thanks specially to the use of hidden Markov models. In the field of speech signal analysis, different techniques have been developed. However, deterioration in the performance of the speech recognizers has been observed when they are trained with clean signal and tested with noisy signals. This is still an open problem in this field. Continuous multiresolution entropy has been shown to be robust to additive noise in applications to different physiological signals. In previous works we have included Shannon and Tsallis entropies, and their corresponding divergences, in different speech analysis and recognition systems. In this paper we present an extension of the continuous multiresolution entropy to different divergences and we propose them as new dimensions for the pre-processing stage of a speech recognition system. This approach takes into account information about changes in the dynamics of speech signal at different scales. The methods proposed here are tested with speech signals corrupted with babble and white noise. Their performance is compared with classical mel cepstral parametrization. The results suggest that these continuous multiresolution entropy related measures provide valuable information to the speech recognition system and that they could be considered to be included as an extra component in the pre-processing stage.