<XML><RECORDS><RECORD><REFERENCE_TYPE>31</REFERENCE_TYPE><REFNUM>8525</REFNUM><AUTHORS><AUTHOR>Azzopardi,L.</AUTHOR><AUTHOR>de Rijke,M.</AUTHOR><AUTHOR>Balog,K.</AUTHOR></AUTHORS><YEAR>2007</YEAR><TITLE>Building Simulated Queries for Known-Item Topics: An Analysis using Six European Languages</TITLE><PLACE_PUBLISHED>To appear in the Proceedings of the 30th Annual ACM Conference on Research and Development in Information Retrieval (SIGIR 2007)</PLACE_PUBLISHED><PUBLISHER>N/A</PUBLISHER><LABEL>Azzopardi:2007:8525</LABEL><KEYWORDS><KEYWORD>Simulation</KEYWORD></KEYWORDS<ABSTRACT>There has been increased interest in the use of simulated queries for evaluation and estimation purposes in Informa- tion Retrieval. However, there are still many unaddressed is-sues regarding their usage and impact on evaluation because their quality, in terms of retrieval performance, is unlike real queries. In this paper, we focus on methods for building simulated known-item topics and explore their quality against real known-item topics. Using existing generation models as our starting point, we explore factors which may influence the generation of the known-item topic. Informed by this detailed analysis (on six European languages) we propose a model with improved document and term selection properties, showing that simulated known-item topics can be generated that are comparable to real known-item topics. This is a significant step towards validating the potential usefulness of simulated queries: for evaluation purposes, and because building models of querying behavior provides a deeper insight into the querying process so that better retrieval mechanisms can be developed to support the user.</ABSTRACT></RECORD></RECORDS></XML>