2019
Conference article  Open Access

Exploiting CNN layer activations to improve adversarial image classification

Caldelli R., Becarelli R., Carrara F., Falchi F., Amato G.

adversarial detection  neural networks  layer activations  Adversarial images 

Neural networks are now used in many sectors of our daily life thanks to efficient solutions such instruments provide for diverse tasks. Leaving to artificial intelligence the chance to make choices on behalf of humans inevitably exposes these tools to be fraudulently attacked. In fact, adversarial examples, intentionally crafted to fool a neural network, can dangerously induce a misclassification though appearing innocuous for a human observer. On such a basis, this paper focuses on the problem of image classification and proposes an analysis to better insight what happens inside a convolutional neural network (CNN) when it evaluates an adversarial example. In particular, the activations of the internal network layers have been analyzed and exploited to design possible countermeasures to reduce CNN vulnerability. Experimental results confirm that layer activations can be adopted to detect adversarial inputs.

Source: ICIP 2019 - IEEE International Conference on Image Processing, pp. 2289–2293, Taipei, Taiwan, 22-25 September, 2019

Publisher: Institute of Electrical and Electronic Engineers ;, Red Hook, NY , Stati Uniti d'America


Metrics



Back to previous page
BibTeX entry
@inproceedings{oai:it.cnr:prodotti:422758,
	title = {Exploiting CNN layer activations to improve adversarial image classification},
	author = {Caldelli R. and Becarelli R. and Carrara F. and Falchi F. and Amato G.},
	publisher = {Institute of Electrical and Electronic Engineers ;, Red Hook, NY , Stati Uniti d'America},
	doi = {10.1109/icip.2019.8803776},
	booktitle = {ICIP 2019 - IEEE International Conference on Image Processing, pp. 2289–2293, Taipei, Taiwan, 22-25 September, 2019},
	year = {2019}
}