eprintid: 1450 rev_number: 14 eprint_status: archive userid: 43 dir: disk0/00/00/14/50 datestamp: 2012-12-19 09:37:52 lastmod: 2013-03-12 14:57:00 status_changed: 2012-12-19 09:37:52 type: book_section metadata_visibility: show creators_name: Campinas, Stephane creators_name: Ceccarelli, Diego creators_name: Perry, Thomas E. creators_name: Delbru, Renaud creators_name: Balog, Krisztian creators_name: Tummarello, Giovanni creators_id: creators_id: diego.ceccarelli@imtlucca.it creators_id: creators_id: creators_id: creators_id: title: The Sindice-2011 Dataset for Entity-Oriented Search in the Web of Data ispublished: pub subjects: QA75 divisions: CSA full_text_status: none keywords: Entity search, Web of Data, Entity corpus note: Proceedings of the ACM SIGIR Workshop “Entity-Oriented Search (EOS)” held in conjunction with the 34th Annual International ACM SIGIR Conference 28 July 2011, Beijing, China abstract: The task of entity retrieval becomes increasingly prevalent as more and more (semi-) structured information about objects is available on the Web in the form of documents embedding metadata (RDF,RDFa, Microformats, and others). However, research and development in that direction is dependent on (1) the availability of a representative corpus of entities that are found on the Web, and (2) the availability of an entity-oriented search infrastructure for experimenting with new retrieval models. In this paper, we introduce the Sindice-2011 data collection which is derived from data collected by the Sindice semantic search engine. The data collection (available at http://data.sindice.com/trec2011/) is especially designed for supporting research in the domain of web entity retrieval. We describe how the corpus is organised, discuss statistics of the data collection, and introduce a search infrastructure to foster research and development. date: 2011 date_type: published publisher: ACM pagerange: 26-32 pages: 79 refereed: TRUE isbn: 978-94-6186-000-2 book_title: Proceedings of the 1st International Workshop on Entity-Oriented Search (EOS) official_url: http://research.microsoft.com/en-us/um/beijing/events/eos2011/13.pdf citation: Campinas, Stephane and Ceccarelli, Diego and Perry, Thomas E. and Delbru, Renaud and Balog, Krisztian and Tummarello, Giovanni The Sindice-2011 Dataset for Entity-Oriented Search in the Web of Data. In: Proceedings of the 1st International Workshop on Entity-Oriented Search (EOS). ACM, pp. 26-32. ISBN 978-94-6186-000-2 (2011)