ICSOH   24899
INSTITUTO DE INVESTIGACIONES EN CIENCIAS SOCIALES Y HUMANIDADES
Unidad Ejecutora - UE
congresos y reuniones científicas
Título:
Rebuilding the Story of a Hero: Information Extraction in Ancient Argentinian Texts
Autor/es:
MECHACA, ANA LIDIA; MARMANILLO, WALTER GABRIEL; XAMENA, EDUARDO
Lugar:
Salta
Reunión:
Congreso; Jornadas Argentinas de Informática e Investigación Operativa; 2019
Institución organizadora:
Sociedad Argentina de Informática e Investigación Operativa (SADIO)
Resumen:
Large amounts of ancient documents have become availablein the last years, regarding Argentinian history. This fact turns possibleto nd interesting and useful aggregated information. This workproposes the application of Natural Language Processing, Text Miningand Visualization tools over Argentinian ancient document repositories.Conceptual maps and entity networks make up the rst target of thispreliminary paper. The rst step is the normalization of OCR acquiredbooks of General Guemes. Exploratory analyses reveal the presence ofmanifold spelling errors, due to the OCR acquisition process of the volumes.We propose smart automatic ways for overcoming this issue in theprocess of normalization. Besides, a rst topic landscape of a subset ofvolumes is obtained and analysed, via Topic Modelling tools.