arxiv:2306.06081

CARSO: Counter-Adversarial Recall of Synthetic Observations

Published on May 25, 2023

Authors:

Emanuele Ballarin ,

Alessio Ansuini ,

Abstract

In this paper, we propose a novel adversarial defence mechanism for image classification -- CARSO -- inspired by cues from cognitive neuroscience. The method is synergistically complementary to adversarial training and relies on knowledge of the internal representation of the attacked classifier. Exploiting a generative model for adversarial purification, conditioned on such representation, it samples reconstructions of inputs to be finally classified. Experimental evaluation by a well-established benchmark of varied, strong adaptive attacks, across diverse image datasets and classifier architectures, shows that CARSO is able to defend the classifier significantly better than state-of-the-art adversarial training alone -- with a tolerable clean accuracy toll. Furthermore, the defensive architecture succeeds in effectively shielding itself from unforeseen threats, and end-to-end attacks adapted to fool stochastic defences. Code and pre-trained models are available at https://github.com/emaballarin/CARSO .

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment

No dataset linking this paper

Cite arxiv.org/abs/2306.06081 in a dataset README.md to link it from this page.

No Space linking this paper

Cite arxiv.org/abs/2306.06081 in a Space README.md to link it from this page.

No Collection including this paper

Add this paper to a collection to link it from this page.