INVESTIGADORES
FERRER Luciana
congresos y reuniones científicas
Título:
SRI’s 2004 NIST speaker recognition evaluation system
Autor/es:
SACHIN S. KAJAREKAR; LUCIANA FERRER; ELIZABETH SHRIBERG; KEMAL SÖNMEZ; ANDREAS STOLCKE; ANAND VENKATARAMAN; J. ZHENG
Lugar:
Filadelfia
Reunión:
Congreso; IEEE Conference on Acoustics, Speech, and Signal Processing (ICASSP); 2004
Institución organizadora:
IEEE
Resumen:
This paper describes our recent efforts in exploring longer range features and their statistical  modeling techniques for speaker recognition. In particular, we describe a system that uses discriminant features from cepstral coefficients, and systems that use discriminant models from word n-grams and syllable-based NERF n-grams. These systems together with a cepstral baseline system are evaluated on the 2004 NIST speaker recognition evaluation dataset. The effect of the development set is measured using two different datasets, one from Switchboard databases and another from the FISHER database. Results show that the difference between the development and evaluation sets affects the performance of the systems only when more training data is available. Results also show that systems using longer-range features combined with the baseline result in about a 31% improvement with 1-side training over the baseline system and about a 61% improvement with 8-side training over the baseline system.