TORRES Humberto Maximiliano
congresos y reuniones científicas
N-BEST RESCORING BASED ON INTONATION PREDICTION FOR A SPANISH ASR SYSTEM
DIEGO A. EVIN; JORGE A. GURLEKIAN; HUMBERTO M. TORRES
Conferencia; 21 Konferenz Elektronische Sprachsignalverarbeitung ( ESSV 2010); 2010
Departament of Computer Scince and Media Beuth University of Applied Scinces
This paper presents a novel method for rescoring the n-best recognition hypotheses using intonation knowledge. The model synthesizes the f0 contours for each of the n-best hypotheses and estimates an intonative matching index between the synthetic shapes and the real f0 contour. This index is applied in the rescoring process, and can be viewed as a degree of intonation compatibility between the hypotheses and the input sentence. The f0 prediction is based on classification and regression trees and the Fujisaki model. We evaluate our approach using a single speaker of the Buenos Aires Spanish LIS-SECYT database under clean and babble-noisy conditions. Considering the systems under no grammar condition, the proposed model reduces the mean absolute word error rate in 3.1% with respect to the baseline system, in a consistent manner and under different noise conditions.