INVESTIGADORES
SANCHEZ Jorge Adrian
congresos y reuniones científicas
Título:
Fisher vectors for fine-grained visual categorization
Autor/es:
JORGE SÁNCHEZ; FLORENT PERRONNIN; ZEYNEP AKATA
Lugar:
Colorado-Springs
Reunión:
Workshop; First Workshop on Fine-Grained Visual Categorization; 2011
Institución organizadora:
IEEE, CVPR
Resumen:
The bag-of-visual-words (BOW) is certainly the most popular image representation to date and it has been shown to yield good results in various problems including Fine-Grained Visual Categorization (FGVC) [3, 4]. Our contribution is to show that the Fisher Vector (FV) which describes an image by its deviation from an average model is an alternative which performs much better than the BOW for the FGVC problem. In this extended abstract we first provide a brief introduction to the FV. We then present theoretical as well as practical motivations for using the FV for FGVC. We finally provide experimental results on four ImageNet subsets: fungus, ungulate, vehicle and ImageNet10K. Compared to [4] which uses spatial pyramid (SP) BOW representations, we report significantly higher classification accuracies. For instance, on ImageNet10K we report 16.7% vs 6.4% top-1 accuracy which represents a 160% relative improvement.