INVESTIGADORES
GRANITTO Pablo Miguel
artículos
Título:
Improved multiclass feature selection via list combination
Autor/es:
IZETTA, JAVIER; VERDES, PABLO F.; GRANITTO, PABLO M.
Revista:
EXPERT SYSTEMS WITH APPLICATIONS
Editorial:
PERGAMON-ELSEVIER SCIENCE LTD
Referencias:
Año: 2017 vol. 88C p. 205 - 216
ISSN:
0957-4174
Resumen:
Feature selection is a crucial machine learning technique aimed at reducing the dimensionality of the input space. By discarding useless or redundant variables, not only it improves model performance but also facilitates its interpretability. The well-known Support Vector Machines--Recursive Feature Elimination (SVM-RFE) algorithm provides good performance with moderate computational efforts, in particular for wide datasets. When using SVM-RFE on a multiclass classification problem, the usual strategy is to decompose it into a series of binary ones, and to generate an importance statistics for each feature on each binary problem. These importances are then averaged over the set of binary problems to synthesize a single value for feature ranking. In some cases, however, this procedure can lead to poor selection. In this paper we discuss six new strategies, based on list combination, designed to yield improved selections starting from the importances given by the binary problems. We evaluate them on artificial and real-world datasets, using both One--Vs--One (OVO) and One--Vs--All (OVA) strategies. Our results suggest that the OVO decomposition is most effective for feature selection on multiclass problems. We also find that in most situations the new K-First strategy can find better subsets of features than the traditional weight average approach.