eprintid: 1449 rev_number: 10 eprint_status: archive userid: 43 dir: disk0/00/00/14/49 datestamp: 2012-12-19 09:15:48 lastmod: 2013-03-12 14:57:01 status_changed: 2012-12-19 09:15:48 type: book_section metadata_visibility: show creators_name: Ceccarelli, Diego creators_name: Lucchese, Claudio creators_name: Orlando, Salvatore creators_name: Perego, Raffaele creators_name: Silvestri, Fabrizio creators_id: diego.ceccarelli@imtlucca.it creators_id: creators_id: creators_id: creators_id: title: Caching query-biased snippets for efficient retrieval ispublished: pub subjects: QA75 divisions: EIC full_text_status: none keywords: caching, efficiency, snippet generation, throughput, web search engines note: EDBT/ICDT 2011 - 14th International Conference on Extending Database Technology - March 21-25, 2011, Uppsala, Sweden abstract: Web Search Engines' result pages contain references to the top-k documents relevant for the query submitted by a user. Each document is represented by a title, a snippet and a URL. Snippets, i.e. short sentences showing the portions of the document being relevant to the query, help users to select the most interesting results. The snippet generation process is very expensive, since it may require to access a number of documents for each issued query. We assert that caching, a popular technique used to enhance performance at various levels of any computing systems, can be very effective in this context. We design and experiment several cache organizations, and we introduce the concept of supersnippet, that is the set of sentences in a document that are more likely to answer future queries. We show that supersnippets can be built by exploiting query logs, and that in our experiments a supersnippet cache answers up to 62% of the requests, remarkably outperforming other caching approaches. date: 2011 date_type: published publisher: EDBT/ICDT pagerange: 93-104 event_title: Proceedings of the 14th International Conference on Extending Database Technology refereed: TRUE isbn: 978-1-4503-0528-0 book_title: Proceedings of EDBT ’11: 14th International Conference on Extending Database Technology official_url: http://www.edbt.org/Proceedings/2011-Uppsala/papers/edbt/a10-ceccarelli.pdf citation: Ceccarelli, Diego and Lucchese, Claudio and Orlando, Salvatore and Perego, Raffaele and Silvestri, Fabrizio Caching query-biased snippets for efficient retrieval. In: Proceedings of EDBT ’11: 14th International Conference on Extending Database Technology. EDBT/ICDT, pp. 93-104. ISBN 978-1-4503-0528-0 (2011)