<XML><RECORDS><RECORD><REFERENCE_TYPE>3</REFERENCE_TYPE><REFNUM>8699</REFNUM><AUTHORS><AUTHOR>Klampanos,I.A.</AUTHOR><AUTHOR>Jose,J.M.</AUTHOR><AUTHOR>Van Rijsbergen,C.J.</AUTHOR></AUTHORS><YEAR>2006</YEAR><TITLE>Single-pass clustering for peer-to-peer information retrieval: the effect of document ordering</TITLE><PLACE_PUBLISHED>INFOSCALE 2006</PLACE_PUBLISHED><PUBLISHER>N/A</PUBLISHER><LABEL>Klampanos:2006:8699</LABEL><KEYWORDS><KEYWORD>single-pass</KEYWORD></KEYWORDS<ABSTRACT>Document clustering has been a particularly active research field within the Information Retrieval (IR) community. Among the numerous clustering algorithms proposed, single-pass clustering stands out in terms of both time and space efficiency. However, it is generally acknowledged that single-pass clustering has a major defect, namely its output depends on the order in which documents are presented. Building on our previous work, and having identified single-pass clustering as potentially useful for P2P IR, we study the extent to which this is true in practical terms. We do so by experimenting with two large web-based testbeds, which are suitable for Peer-to-Peer IR evaluation. The results of our study show that document ordering does not practically matter for single-pass clustering.</ABSTRACT></RECORD></RECORDS></XML>