CONICET | Buscador de Institutos y Recursos Humanos

BECAS

BIANCHI Bruno

datos académicos

artículos

congresos y reuniones científicas

Título:

Using LSTM-based Language Models and human Eye Movements metrics to understand next-word predictions

Autor/es:

UMFURER, ALFREDO ; KAMIENKOWSKI, JUAN E.; BIANCHI, BRUNO

Lugar:

Buenos Aires

Reunión:

Congreso; Jornada Argentina de Informática; 2022

Institución organizadora:

SADIO

Resumen:

Modern Natural Language Processing (NLP) models canachieve great results resolving different types of linguistic tasks. Thisis possible thanks to a high volume of internal parametersthat are optimized during the training phase. They allow to model high-level linguisticproperties. For example, LSTM-based language models have the abilityto find long-term dependencies between words on a text, and use them tomake predictions about upcoming words. Nevertheless, their complexitymakes it hard to understand which features they use to generate predictions.The neurolinguistic field faces a similar issue when studying how ourbrain processes language. For example, every adult reader has the abilityto understand long texts and to make predictions of upcoming words.Nevertheless, our understanding on how these predictions are driven islimited. During the last decades, the study of eye movements duringreading have shed some light on this topic, finding a relation betweenthe time spent on a word (gaze duration) and its processing cost.Here, we aim to understand how LSTM-based models predict futurewords and these predictions relate with human predictions, fitting statistical models commonly used in the neurolinguistic field with gaze duration as the dependent variable. We found that an AWD-LSTM LanguageModel can partially model eye movements, with high overlap with bothhuman-Predictability and lexical frequency. Interestingly, this last overlap is seen to depend on the training corpus, being lower when the modelis fine-tuned with a corpus similar to the one used for testing.

enviar mensaje