TORRES Humberto Maximiliano
congresos y reuniones científicas
Database for Automatic Speech Recognition of Argentine Spanish
J. GURLEKIAN; L. COLANTONI; H. TORRES; A. RINCÓN; A. MORENO; J. MARIÑO
Workshop; IRCS Workshop on Linguistic Databases; 2001
University of Pennsylvania
The goal of this project was the design and realisation of a database to be used in an automatic speech recognition system for a fixed telephone network. One thousand speakers, native to five Argentine dialectal regions, were recorded. Each speaker answered five questions and read 38 texts, which consisted of numbers, names, last names, corporation names, and phonetically rich sentences. Two hundred sets of nine sentences, selected from a pool of 7000 sentences, were used to generate 1,000 prompt sheets. These sentences contained all of the contextual allophones transcribed with the SAMPA alphabet. A strategy of uniform information collection and recording was designed. Phone calls were received through a digital line connected to a computer, and monitored by acquisition software. This procedure resulted in an efficient way to collect data from a large number of speakers representing several different dialects in only two months.