ICC   25427
INSTITUTO DE INVESTIGACION EN CIENCIAS DE LA COMPUTACION
Unidad Ejecutora - UE
artículos
Título:
Performance improvements for search systems using an integrated cache of lists + intersections
Autor/es:
MARCHETTI-SPACAMELLA ALBERTO; BECHETTI LUCA; TOLOSA GABRIEL; FEUERSTEIN ESTEBAN
Revista:
INFORMATION RETRIEVAL
Editorial:
SPRINGER
Referencias:
Lugar: Berlin; Año: 2017 p. 1 - 27
ISSN:
1386-4564
Resumen:
Modern information retrieval systems use several levels of caching to speedup computation by exploiting frequent, recent or costly data used in the past. Previous studies show that the use of caching techniques is crucial in search engines, as it helps reducing query response times and processing workloads on search servers. In this work we propose and evaluate a static cache that acts simultaneously as list and intersection cache, offering a more efficient way of handling cache space. We also use a query resolution strategy that takes advantage of the existence of this cache to reorder the query execution sequence. In addition, we propose effective strategies to select the term pairs that should populate the cache. We also represent the data in cache in both raw and compressed forms and evaluate the differences between them using different configurations of cache sizes. The results show that the proposed Integrated Cache outperforms the standard posting lists cache in most of the cases, taking advantage not only of the intersection cache but also the query resolution strategy.