Chunking-Synthetic Approaches to Large-Scale Kernel Machines

Loading...
Thumbnail Image

Date

Authors

Meyer, Robert
Gonzalez-Castano, Francisco

Advisors

License

DOI

Type

Technical Report

Journal Title

Journal ISSN

Volume Title

Publisher

Grantor

Abstract

We consider a kernel-based approach to nonlinear classification that combines the generation of ?synthetic? points (to be used in the kernel) with ?chunking? (working with subsets of the data) in order to significantly reduce the size of the optimization problems required to construct classifiers for massive datasets. Rather than solving a single massive classification problem involving all points in the training set, we employ a series of problems that gradually increase in size and which consider kernels based on small numbers of synthetic points. These synthetic points are generated by solving relatively small nonlinear unconstrained optimization problems. In addition to greatly reducing optimization problem size, the procedure that we describe also has the advantage of being easily parallelized. Computational results show that our method efficiently generates high-performance classifiers on a variety of problems involving both real and randomly generated datasets.

Description

Related Material and Data

Citation

00-04

Sponsorship

Endorsement

Review

Supplemented By

Referenced By