Dragging: Density-Ratio Bagging
| dc.contributor.author | Zhu, Xiaojin | |
| dc.contributor.author | Tan, Yimin | |
| dc.date.accessioned | 2013-06-07T21:29:47Z | |
| dc.date.available | 2013-06-07T21:29:47Z | |
| dc.date.issued | 2013-06-06 | |
| dc.description.abstract | We propose density-ratio bagging (dragging), a semi-supervised extension of bootstrap aggregation (bagging) method. Additional unlabeled training data are used to calculate the weight on each labeled training point by a density-ratio estimator. The weight is then used to construct a weighted labeled empirical distribution, from which bags of bootstrap samples are drawn. Asymptotically, dragging is proved to be no worse than bagging and requires no semi-supervised learning assumptions other than $iid$-ness. We show that compared to bagging, the dragging predictor achieves less asymptotic variance, which leads to a smaller MSE. We conduct real experiments on several regression and classification tasks. The performance of dragging, bagging, semi-supervised learning with density-ratio estimator, and basic supervised learning is compared and discussed. | en |
| dc.identifier.citation | TR1795 | en |
| dc.identifier.uri | http://digital.library.wisc.edu/1793/65831 | |
| dc.subject | bagging | en |
| dc.subject | density ratio | en |
| dc.subject | semi-supervised learning | en |
| dc.title | Dragging: Density-Ratio Bagging | en |
| dc.type | Technical Report | en |