IIIE   20352
INSTITUTO DE INVESTIGACIONES EN INGENIERIA ELECTRICA "ALFREDO DESAGES"
Unidad Ejecutora - UE
congresos y reuniones científicas
Título:
A multimodal-corpus data collection system for cognitive acoustic scene analysis
Autor/es:
GEORGIOU, J. ; POULIQUEN, P. ; CASSIDY, A. ; GARREAU, G. ; ANDREOU, C. ; STUARTS, G. ; D'URBAL, C. ; ANDREOU, A.G. ; DENHAM, S. ; WENNEKERS, T. ; MILL, R. ; WINKLER, I. ; BOHM, T. ; SZALARDY, O. ; KLUMP, G.M. ; JONES, S. ; BENDIXEN, A.
Lugar:
Baltimore
Reunión:
Conferencia; Information Sciences and Systems (CISS), 2011 45th Annual Conference on; 2011
Resumen:
We report on the design and the collection of a multi-modal data corpus
for cognitive acoustic scene analysis. Sounds are generated by
stationary and moving sources (people), that is by omni-directional
speakers mounted on people's heads. One or two subjects walk along
predetermined systematic and random paths, in synchrony and out of sync.
Sound is captured in multiple microphone systems, including a four MEMS
microphone directional array, two electret microphones situated in the
ears of a stuffed gerbil head, and a Head Acoustics, head-shoulder unit
with ICP microphones. Three micro-Doppler units operating at different
frequencies were employed to capture gait and the articulatory
signatures as well as location of the people in the scene. Three ground
vibration sensors were recording the footsteps of the walking people. A
3D MESA camera as well as a web-cam provided 2D and 3D visual data for
system calibration and ground truth. Data were collected in three
environments ranging from a well controlled environment (anechoic
chamber), an indoor environment (large classroom) and the natural
environment of an outside courtyard. A software tool has been developed
for the browsing and visualization of the data.