Using Machine Teaching to Identify Optimal Training-Set Attacks on Machine Learners

mei, shike; zhu, xiaojin

Using Machine Teaching to Identify Optimal Training-Set Attacks on Machine Learners

Files

Primary TR1819.pdf (455.66 KB)

Date

2014-11-16

Authors

mei, shike

zhu, xiaojin

Type

Technical Report

Abstract

We investigate a problem at the intersection of machine learning and security: training-set attacks on machine learners. In such attacks an attacker contaminates the training data so that a specific learning algorithm would produce a model profitable to the attacker. Understanding training-set attacks is important as more intelligent agents (e.g. spam filters and robots) are equipped with learning capability and can potentially be hacked via data they receive from the environment. This paper identifies the optimal training-set attack on a broad family of machine learners. First we show that optimal training-set attack can be formulated as a bilevel optimization problem. Then we show that for machine learners with certain Karush-Kuhn-Tucker conditions we can solve the bilevel problem efficiently using gradient methods on an implicit function. As examples, we demonstrate optimal trainingset attacks on Support Vector Machines, logistic regression, and linear regression with extensive experiments. Finally, we discuss potential defenses against such attacks.

Citation

TR1819

URI

http://digital.library.wisc.edu/1793/72617

Collections

CS Technical Reports

Full item page

Using Machine Teaching to Identify Optimal Training-Set Attacks on Machine Learners

Files

Date

Authors

Advisors

License

DOI

Type

Journal Title

Journal ISSN

Volume Title

Publisher

Grantor

Abstract

Description

Keywords

Related Material and Data

Citation

Sponsorship

URI

Collections

Endorsement

Review

Supplemented By

Referenced By