ICIC   25583
INSTITUTO DE CIENCIAS E INGENIERIA DE LA COMPUTACION
Unidad Ejecutora - UE
congresos y reuniones científicas
Título:
Assessing Causality Structures learned from Digital Text Media
Autor/es:
DELBIANCO, FERNANDO; EVANGELOS, E. MILIOS; TOHMÉ, FERNANDO ABEL; MAISONNAVE, MARIANO; MAGUITMAN, ANA GABRIELA
Lugar:
virtual
Reunión:
Simposio; ACM Symposium on Document Engineering 2020 (DocEng); 2020
Institución organizadora:
ACM
Resumen:
In this paper we describe a framework to uncover potential causal relations between event mentions from streaming text of news media. This framework relies on a dataset of manually labeled events to train a recurrent neural network for event detection. It then creates a time series of event clusters, where clusters are based on BERT contextual word embedding representations of the identified events. Using these time series dataset, we assess four methods based on Granger causality for inferring causal relations. Granger causality is a statistical concept of causality that is based on forecasting. It states that a cause occurs before the effect, and the cause produces unique changes in the effect, so past values of the cause help predict future values of the effect. The four analyzed methods are the pairwise Granger test, VAR(1), BigVar and SiMoNe. The framework is applied to the New York Times dataset, which covers news for a period of 246 months. This preliminary analysis delivers important insights into the nature of each method, identifies differences and commonalities, and points out some of their strengths and weaknesses