ICIC   25583
INSTITUTO DE CIENCIAS E INGENIERIA DE LA COMPUTACION
Unidad Ejecutora - UE
artículos
Título:
ATR-Vis: Visual and Interactive Information Retrieval for Parliamentary Discussions in Twitter
Autor/es:
CARVALHO, EDER; MAKKI, RAHELEH; OLIVEIRA, MARIA CRISTINA FERREIRA DE; BROOKS, STEPHEN; MINGHIM, ROSANE; SOTO, AXEL J.; MILIOS, EVANGELOS
Revista:
ACM Transactions on Knowledge Discovery from Data
Editorial:
ACM
Referencias:
Lugar: New York; Año: 2018 vol. 12 p. 1 - 33
ISSN:
1556-4681
Resumen:
The worldwide adoption of Twitter turned it into one of the most popular platforms for content analysis as it serves as a gauge of the public?s feeling and opinion on a variety of topics. This is particularly true of political discussions and lawmakers? actions and initiatives. Yet, one common but unrealistic assumption is that the data of interest for analysis is readily available in a comprehensive and accurate form. Data need to be retrieved, but due to the brevity and noisy nature of Twitter content, it is difficult to formulate user queries that match relevant posts that use different terminology without introducing a considerable volume of unwanted content. This problem is aggravated when the analysis must contemplate multiple and related topics of interest, for which comments are being concurrently posted. This article presents Active Tweet Retrieval Visualization (ATR-Vis), a user-driven visual approach for the retrieval of Twitter content applicable to this scenario. The method proposes a set of active retrieval strategies to involve an analyst in such a way that a major improvement in retrieval coverage and precision is attained with minimal user effort. ATR-Vis enables non-technical users to benefit from the aforementioned active learning strategies by providing visual aids to facilitate the requested supervision. This supports the exploration of the space of potentially relevant tweets, and affords a better understanding of the retrieval results. We evaluate our approach in scenarios in which the task is to retrieve tweets related to multiple parliamentary debates within a specific time span. We collected two Twitter datasets, one associated with debates in the Canadian House of Commons during a particular week in May 2014, and another associated with debates in the Brazilian Federal Senate during a selected week in May 2015. The two use cases illustrate the effectiveness of ATR-Vis for the retrieval of relevant tweets, while quantitative results show that our approach achieves high retrieval quality with a modest amount of supervision. Finally, we evaluated our tool with three external users who perform searching in social media as part of their professional work.