Large Scale Kernel Regression via Linear Programming

Loading...
Thumbnail Image

Date

Authors

Musicant, David
Mangasarian, Olvi

Advisors

License

DOI

Type

Technical Report

Journal Title

Journal ISSN

Volume Title

Publisher

Grantor

Abstract

The problem of tolerant data tting by a nonlinear surface, in- duced by a kernel-based support vector machine [24], is formulated as a linear program with fewer number of variables than that of other linear programming formulations [21]. A generalization of the lin- ear programming chunking algorithm [2] for arbitrary kernels [13] is implemented for solving problems with very large datasets wherein chunking is performed on both data points and problem variables. The proposed approach tolerates a small error, which is adjusted paramet- rically, while tting the given data. This leads to improved tting of noisy data (over ordinary least error solutions) as demonstrated com- putationally. Comparative numerical results indicate an average time reduction as high as 26.0% over other formulations, with a maximal time reduction of 79.7%. Additionally, linear programs with as many as 16,000 data points and more than a billion nonzero matrix elements are solved.

Description

Related Material and Data

Citation

99-02

Sponsorship

Endorsement

Review

Supplemented By

Referenced By