<XML><RECORDS><RECORD><REFERENCE_TYPE>3</REFERENCE_TYPE><REFNUM>8806</REFNUM><AUTHORS><AUTHOR>Balog,K.</AUTHOR><AUTHOR>Azzopardi,L.</AUTHOR><AUTHOR>de Rijke,M.</AUTHOR></AUTHORS><YEAR>2008</YEAR><TITLE>Personal Name Resolution of Web People Search</TITLE><PLACE_PUBLISHED>Proceedings of the WWW 2008 Workshop: NLP Challenges in the Information Explosion Era (NLPIX 2008)</PLACE_PUBLISHED><PUBLISHER>N/A</PUBLISHER><LABEL>Balog:2008:8806</LABEL><ABSTRACT>Disambiguating personal names in a set of documents (such as a set of web pages returned in response to a person name) is a difficult and challenging task. In this paper, we explore the extent to which the “cluster hypothesis” for this task holds (i.e., that similar documents tend to represent the same person). We explore two clustering techniques which used either (1) term based matching (single pass clustering) or (2) semantic based matching (Probabilistic Latent Semantic Analysis). We compare and contrast these strategies and provide strong evidence to suggest that the hypothesis holds for the former. And in fact, on the new evaluation platform of the SemEval 2007 Web People Search task, we show that using single pass clustering with a standard IR document representations fits well with the assumptions about the data and the task which yields state-of-the-art performance.</ABSTRACT></RECORD></RECORDS></XML>