INVESTIGADORES
SOTO Axel Juan
congresos y reuniones científicas
Título:
Adaptive Visualization of Text Documents Incorporating Domain Knowledge
Autor/es:
AXEL J. SOTO; MARC STRICKERT; GUSTAVO E. VAZQUEZ; EVANGELOS MILIOS
Lugar:
Vancouver
Reunión:
Workshop; Challenges of Data Visualization. NIPS 2010 Workshop; 2010
Institución organizadora:
NIPS
Resumen:
We present a method for visualizing text corpora that are assumed to contain labeled and unlabeled documents. Our method aims at learning data mappings of labeled documents including the terms that are most relevant for label discrimination. We can use this information to visualize mapped unlabeled documents as well. We also show how this method allows the inclusion of user's feedback. This feedback is supplied in an iterative process, so that the user can use the output of the method to provide its domain knowledge of the data. At the same time, this technique is well suited for providing a new low-dimensional space where traditional clustering or classification methods can be applied. Even though our approach is able to deal with document labels that are discrete classes, continuous values, or associated vectors, we confine the experiments of this article to labelsthat represent non-overlapped topics. This approach is evaluated using a set of short and noisy documents, which is considered as a challenging task in the text mining literature.