Massive Data Discrimination via Linear Suppot Vector Machines

dc.contributor.authorMangasarian, O.L.
dc.contributor.authorBradley, P.S.
dc.date.accessioned2013-06-28T19:09:55Z
dc.date.available2013-06-28T19:09:55Z
dc.date.issued1999-03-31
dc.description.abstractA linear support vector machine formulation is used to generate a fast, finitely-terminating linear-programming algorithm for discriminating between two massive sets in n-dimensional space, where the number of points can be orders of magnitude larger than n. The algorithm creates a succession of sufficiently small linear programs that separate chunks of the data at a time. The key idea is that a small number of support vectors, corresponding to linear programming constrains with positive dual variables, are carried over between the successive small linear programs, each of which containing a chunk of the data. We prove that this procedure is monotonic and terminates in a finite number of steps at an exact solution leads to an optimal separating plane for the entire data set. Numerical results on full dense publicly available datasets, number 20,000 to 1 million points in 32-dimensional space, confirm the theoretical results and demonstrate the ability to handle very large problems.en
dc.identifier.citation98-05en
dc.identifier.urihttp://digital.library.wisc.edu/1793/66093
dc.subjectlinear programming chunkingen
dc.subjectsupport vector machinesen
dc.titleMassive Data Discrimination via Linear Suppot Vector Machinesen
dc.typeTechnical Reporten

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
98-05.pdf
Size:
180.55 KB
Format:
Adobe Portable Document Format
Description:
Massive Data Discrimination via Linear Support Vector Machines

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.03 KB
Format:
Item-specific license agreed upon to submission
Description: