CIFASIS   20631
CENTRO INTERNACIONAL FRANCO ARGENTINO DE CIENCIAS DE LA INFORMACION Y DE SISTEMAS
Unidad Ejecutora - UE
congresos y reuniones científicas
Título:
Isolated Spanish Digit Recognition based on Audio-Visual Features
Autor/es:
GONZALO SAD; LUCAS D. TERISSI; JUAN C. GÓMEZ
Lugar:
Mar del Plata
Reunión:
Conferencia; XIX Congreso Argentino de Ciencias de la Computación - CACIC 2013; 2013
Institución organizadora:
Red de Universidades Nacionales con carreras en Informática (RedUNCI)
Resumen:
The performance of classical speech recognition techniques based on audio features is degraded in noisy environments. The inclusion of visual features related to mouth movements into the recognition process improves the performance of the system. This paper proposes an isolated word speech recognition system based on audio-visual features. The proposed system combines three classifiers based on audio, visual and audio-visual information, respectively. An audio-visual database composed by the utterances of the digits (in Spanish language) is employed to test the proposed system. The experimental results show a significant improvement on the recognition rates through a wide range of signal-to-noise ratios.