Persistent Classification: A New Approach to Stability of Data and Adversarial Examples

Type: Preprint

Publication Date: 2024-04-11

Citations: 0

DOI: https://doi.org/10.48550/arxiv.2404.08069

Abstract

There are a number of hypotheses underlying the existence of adversarial examples for classification problems. These include the high-dimensionality of the data, high codimension in the ambient space of the data manifolds of interest, and that the structure of machine learning models may encourage classifiers to develop decision boundaries close to data points. This article proposes a new framework for studying adversarial examples that does not depend directly on the distance to the decision boundary. Similarly to the smoothed classifier literature, we define a (natural or adversarial) data point to be $(\gamma,\sigma)$-stable if the probability of the same classification is at least $\gamma$ for points sampled in a Gaussian neighborhood of the point with a given standard deviation $\sigma$. We focus on studying the differences between persistence metrics along interpolants of natural and adversarial points. We show that adversarial examples have significantly lower persistence than natural examples for large neural networks in the context of the MNIST and ImageNet datasets. We connect this lack of persistence with decision boundary geometry by measuring angles of interpolants with respect to decision boundaries. Finally, we connect this approach with robustness by developing a manifold alignment gradient metric and demonstrating the increase in robustness that can be achieved when training with the addition of this metric.

Locations

  • arXiv (Cornell University) - View - PDF

Similar Works

Action Title Year Authors
+ On the Geometry of Adversarial Examples 2018 Marc Khoury
Dylan Hadfield-Menell
+ A Boundary Tilting Persepective on the Phenomenon of Adversarial Examples 2016 Thomas Tanay
Lewis D. Griffin
+ PDF Chat A Geometric Framework for Adversarial Vulnerability in Machine Learning 2024 Brian Bell
+ A Boundary Tilting Persepective on the Phenomenon of Adversarial Examples 2016 Thomas Tanay
Lewis D. Griffin
+ The Dimpled Manifold Model of Adversarial Examples in Machine Learning 2021 Adi Shamir
Odelia Melamed
Oriel BenShmuel
+ Understanding the Interaction of Adversarial Training with Noisy Labels 2021 Jianing Zhu
Jingfeng Zhang
Bo Han
Tongliang Liu
Gang Niu
Hongxia Yang
Mohan Kankanhalli
Masashi Sugiyama
+ Adversarial Examples Might be Avoidable: The Role of Data Concentration in Adversarial Robustness 2023 Ambar Pal
Jeremias Sulam
René Vidal
+ What does a deep neural network confidently perceive? The effective dimension of high certainty class manifolds and their low confidence boundaries 2022 Stanislav Fort
Ekin D. Cubuk
Surya Ganguli
Samuel S. Schoenholz
+ Adversarial Training with Voronoi Constraints 2019 Marc Khoury
Dylan Hadfield-Menell
+ Adversarial Training with Voronoi Constraints 2019 Marc Khoury
Dylan Hadfield-Menell
+ Unique properties of adversarially trained linear classifiers on Gaussian data 2020 Jamie Hayes
+ On the Robustness of Neural Collapse and the Neural Collapse of Robustness 2023 Jingtong Su
Ya Shi Zhang
Nikolaos Tsilivis
Julia Kempe
+ PDF Chat Adversarial Vulnerability as a Consequence of On-Manifold Inseparibility 2024 Rajdeep Haldar
Yue Xing
Qifan Song
Guang Lin
+ The Manifold Assumption and Defenses Against Adversarial Perturbations 2017 Xi Wu
Uyeong Jang
Lingjiao Chen
Somesh Jha
+ When adversarial examples are excusable 2022 Pieter-Jan Kindermans
Charles Staats
+ Origins of Low-dimensional Adversarial Perturbations 2022 Elvis Dohmatob
Chuan Guo
Morgane Goibert
+ Decision boundaries and convex hulls in the feature space that deep learning functions learn from images 2022 Roozbeh Yousefzadeh
+ Adversarial Spheres 2018 Justin Gilmer
Luke Metz
Fartash Faghri
Samuel S. Schoenholz
Maithra Raghu
Martin Wattenberg
Ian Goodfellow
+ A Theoretical Framework for Robustness of (Deep) Classifiers against Adversarial Examples 2016 Beilun Wang
Ji Gao
Yanjun Qi
+ PDF Chat Decision boundaries and convex hulls in the feature space that deep learning functions learn from images 2022 Roozbeh Yousefzadeh

Works That Cite This (0)

Action Title Year Authors

Works Cited by This (0)

Action Title Year Authors