PERSONAL DE APOYO
HERNANDEZ nidia Alejandra
artículos
Título:
Digital Narratives of COVID-19: A Twitter Dataset for Text Analysis in Spanish
Autor/es:
HERNANDEZ, NIDIA
Revista:
Journal of Open Humanities Data
Editorial:
Ubiquity Press
Referencias:
Año: 2021
Resumen:
Digital Narratives of COVID-19 (DHCovid) offers a curated Twitter corpus of digital conversations about the Coronavirus pandemic. The dataset is collected through a script via Twitter?s Application Programming Interface (API) starting on April 24th, 2020, and stored on GitHub as an open access repository of tweet identifiers that can be consulted, downloaded, and reused by scholars interested in Natural Language Processing (NLP), topic modelling, and other quantitative methods. A stable version of the dataset has also been released through Zenodo. Twitter datasets are structured in three main collections: tweets in Spanish worldwide; geolocated tweets in six Spanish-speaking areas spanning North and Central America (Mexico, Colombia, Ecuador), South America (Argentina, Peru), and Europe (Spain); and geolocated tweets in English and Spanish from the greater Miami area in South Florida.