论文标题
捍卫SVM免受中毒攻击:硬度和DBSCAN方法
Defending SVMs against Poisoning Attacks: the Hardness and DBSCAN Approach
论文作者
论文摘要
近年来,对抗机器学习引起了很多关注。在中毒攻击中,对手可以将少量精心制作的样本注入训练数据中,从而使决策边界严重偏离并导致意外的错误分类。由于支持向量机(SVM)的重要性和大众使用,我们考虑捍卫SVM免受本文中毒的攻击。我们研究了捍卫的两种常用策略:设计强大的SVM算法和数据消毒。尽管以前已经提出过几种强大的SVM算法,但其中大多数要么缺乏对抗性弹药,要么依靠对数据分布或攻击者行为的强有力的假设。此外,对其复杂性的研究仍然非常有限。据我们所知,我们是第一个证明即使是最简单的硬质量单级SVM具有异常值问题的SVM也是NP完整的,除非p $ = $ np,否则没有完全的PTA(这意味着很难实现甚至达到近似近似算法的算法)。对于数据消毒防御,我们将其链接到数据的内在维度;特别是,我们为指标提供了一个抽样定理,以解释DBSCAN(作为基于密度的基于密度的离群方法)的有效性来防御中毒攻击。在我们的经验实验中,我们比较了包括DBSCAN和鲁棒SVM方法在内的几种防御措施,并研究了固有维度和数据密度与其性能的影响。
Adversarial machine learning has attracted a great amount of attention in recent years. In a poisoning attack, the adversary can inject a small number of specially crafted samples into the training data which make the decision boundary severely deviate and cause unexpected misclassification. Due to the great importance and popular use of support vector machines (SVM), we consider defending SVM against poisoning attacks in this paper. We study two commonly used strategies for defending: designing robust SVM algorithms and data sanitization. Though several robust SVM algorithms have been proposed before, most of them either are in lack of adversarial-resilience, or rely on strong assumptions about the data distribution or the attacker's behavior. Moreover, the research on their complexities is still quite limited. We are the first, to the best of our knowledge, to prove that even the simplest hard-margin one-class SVM with outliers problem is NP-complete, and has no fully PTAS unless P$=$NP (that means it is hard to achieve an even approximate algorithm). For the data sanitization defense, we link it to the intrinsic dimensionality of data; in particular, we provide a sampling theorem in doubling metrics for explaining the effectiveness of DBSCAN (as a density-based outlier removal method) for defending against poisoning attacks. In our empirical experiments, we compare several defenses including the DBSCAN and robust SVM methods, and investigate the influences from the intrinsic dimensionality and data density to their performances.