CIFASIS   20631
CENTRO INTERNACIONAL FRANCO ARGENTINO DE CIENCIAS DE LA INFORMACION Y DE SISTEMAS
Unidad Ejecutora - UE
congresos y reuniones científicas
Título:
Isolated Word Speech Recognition improvements based on the fusion of Audio, Video and Audio-Video Classifiers
Autor/es:
GONZALO SAD; LUCAS D. TERISSI; JUAN C. GÓMEZ
Lugar:
Río Negro
Reunión:
Congreso; XV Reunión de Trabajo en Procesamiento de la Información y Control; 2013
Institución organizadora:
Universidad Nacional de Río Negro
Resumen:
This paper describes an isolated word speech recognition system based on audio-visual features. The inclusion of visual features related to mouth movements aims to improve the recognition rates, mainly on noisy audio conditions. The proposed system combines three classifiers based on audio, visual and audio-visual information, respectively. A Spanish audio-visual database is employed to test the proposed system. The experimental results show that a significant improvement is achieved when the visual information is considered. The structure of the proposed system allows to improve the recognition rates through a wide range of signal-to-noise ratios.