INVESTIGADORES
SAD Gonzalo Daniel
congresos y reuniones científicas
Título:
Isolated Spanish Digit Recognition based on Audio-Visual Features
Autor/es:
GONZALO SAD; LUCAS TERISSI; JUAN CARLOS GÓMEZ
Lugar:
Mar del Plata, Buenos Aires
Reunión:
Congreso; XIX Congreso Argentino de Ciencias de la Computación; 2013
Resumen:
The performance of classical speech recognition techniques based on audio features is degraded in noisy environments. The inclusion of visual features related to mouth movements into the recognition process improves the performance of the system. This paper proposes an isolated word speech recognition system based on audio-visual features. The proposed system combines three classifiers based on audio, visual and audio-visual information, respectively. An audio-visual database composed by the utterances of the digits (in Spanish language) is employed to test the proposed system. The experimental results show a significant improvement on the recognition rates through a wide range of signal-to-noise ratios.