Badger: An Entropy-Based Web Search Clustering System with Randomization and Voting
| dc.contributor.author | Wang, Lidan | en_US |
| dc.contributor.author | Schulze, Chloe Whyte | en_US |
| dc.date.accessioned | 2012-03-15T17:19:30Z | |
| dc.date.available | 2012-03-15T17:19:30Z | |
| dc.date.created | 2005 | en_US |
| dc.date.issued | 2005 | en_US |
| dc.description.abstract | We have implemented and improved an entropy-based clustering algorithm. In addition to utilizing entropy as a clustering mechanism, our algorithm, Badger, uses randomization and a voting scheme to improve the quality of the resulting clusters. Using parsed web search result snippets, we have tested our algorithm and compared it against EigenCluster, a clustering meta-search engine developed by a research group at MIT. Our algorithm performs comparably to EigenCluster, but with slightly more overhead due to the extra work of the randomization step. We have found entropy to be a valid and interesting measure of document similarity and additionally we find it produces cohesive clusters. | en_US |
| dc.format.mimetype | application/pdf | en_US |
| dc.identifier.citation | TR1537 | en_US |
| dc.identifier.uri | http://digital.library.wisc.edu/1793/60458 | |
| dc.publisher | University of Wisconsin-Madison Department of Computer Sciences | en_US |
| dc.title | Badger: An Entropy-Based Web Search Clustering System with Randomization and Voting | en_US |
| dc.type | Technical Report | en_US |
Files
Original bundle
1 - 1 of 1