Badger: An Entropy-Based Web Search Clustering System with Randomization and Voting

dc.contributor.authorWang, Lidanen_US
dc.contributor.authorSchulze, Chloe Whyteen_US
dc.date.accessioned2012-03-15T17:19:30Z
dc.date.available2012-03-15T17:19:30Z
dc.date.created2005en_US
dc.date.issued2005en_US
dc.description.abstractWe have implemented and improved an entropy-based clustering algorithm. In addition to utilizing entropy as a clustering mechanism, our algorithm, Badger, uses randomization and a voting scheme to improve the quality of the resulting clusters. Using parsed web search result snippets, we have tested our algorithm and compared it against EigenCluster, a clustering meta-search engine developed by a research group at MIT. Our algorithm performs comparably to EigenCluster, but with slightly more overhead due to the extra work of the randomization step. We have found entropy to be a valid and interesting measure of document similarity and additionally we find it produces cohesive clusters.en_US
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationTR1537en_US
dc.identifier.urihttp://digital.library.wisc.edu/1793/60458
dc.publisherUniversity of Wisconsin-Madison Department of Computer Sciencesen_US
dc.titleBadger: An Entropy-Based Web Search Clustering System with Randomization and Votingen_US
dc.typeTechnical Reporten_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
TR1537.pdf
Size:
1.15 MB
Format:
Adobe Portable Document Format