INVESTIGADORES
KAMENETZKY Laura
artículos
Título:
Clustering biological data with SOMs: on topology preservation in non-linear dimensional reduction
Autor/es:
MILONE, D.; STEGMAYER, G.; KAMENETZKY, L.; LÓPEZ, M.; CARRARI, F.
Revista:
EXPERT SYSTEMS WITH APPLICATIONS
Editorial:
PERGAMON-ELSEVIER SCIENCE LTD
Referencias:
Lugar: Amsterdam; Año: 2012
ISSN:
0957-4174
Resumen:
Dimensional reduction is a widely used technique for exploratory analysis of large volume of data. In biologicaldatasets, each object is described by a large number of variables (or dimensions) and it is crucial toperform their analyses in a smaller space, to extract useful information. Kohonen self-organizing maps(SOMs) have been recently proposed in systems biology as a useful tool for exploratory analysis, dataintegration and discovery of new relationships in *omics datasets. SOMs have been traditionally usedfor clustering in several data mining problems, mainly due to their ability to preserve input data topologyand reduce a high dimensional input space into a 2-D map. In spite of this, the above-mentioned dimensionalreduction can lead to counterintuitive results. Sometimes, maps having almost the same size,trained on the same dataset, and with identical learning algorithms and parameters, may find differentclusters. However, one would expect that small changes in map sizes or another training condition wouldnot result in an abrupt different location of any of the grouped patterns. The aim of this work is to analyzeand explain this issue through a real case study involving transcriptomic and metabolomic data, since itmight have an important impact when interpreting clustering results over a biological dataset.