192
v1v2v3 (latest)

Diagnosing Vulnerability of Variational Auto-Encoders to Adversarial Attacks

Abstract

In this work, we explore adversarial attacks on the Variational Autoencoders (VAE). We show how to modify data point to obtain a prescribed latent code (supervised attack) or just get a drastically different code (unsupervised attack). We examine the influence of model modifications (β\beta-VAE, NVAE) on the robustness of VAEs and suggest metrics to quantify it.

View on arXiv
Comments on this paper