TORRES Humberto Maximiliano
Novel Estimation Method for the Superpositional Intonation Model
HUMBERTO MAXIMILIANO TORRES; JORGE ALBERTO GURLEKIAN
EEE/ACM Transactions on Audio, Speech, and Language Processing
IEEE Signal Processing Society
Lugar: New York; Año: 2016 vol. 24 p. 151 - 151
Fujisaki´s intonation model parameterizes the F0´s contour efficiently and becouse of its strong physiological basis has been successfully tested in different languages. One problem that has not been fully addressed is the extraction of the model´s parameters, i.e., given a sentence, which model´s parameter values best describe its intonation. Most of the proposed methods strive to optimize the parameters so as to obtain the best fit for the F0 contour globally. In this paper we propose to use text information from the sentence as the main guide or reference for adjusting the parameters. We present a method that defines a set of rules to fix and optimize the model´s parameters. Optimization never loses sight of the text structure events that arouse it. When text information is not enough, the algorithm predicts parameters from F0 contour and tie it to the text. The process of parameter estimation can be seen as a way to go from text information to the F0 contour. Parameter optimization is carried out to fit the F0 contour locally. Our novel approach can be implemented manually or automatically. We present examples of manual implementation and the quantitative results of the automatic one. Tested on three corpora in Spanish, English and German, our automatic method shows a performance of 34% better than other tested methods.