v1v2 (latest)

Support is All You Need for Certified VAE Training

International Conference on Learning Representations (ICLR), 2025

16 April 2025

ArXiv (abs)PDF HTML Github

Main:10 Pages

3 Figures

Bibliography:5 Pages

3 Tables

Appendix:6 Pages

Abstract

Variational Autoencoders (VAEs) have become increasingly popular and deployed in safety-critical applications. In such applications, we want to give certified probabilistic guarantees on performance under adversarial attacks. We propose a novel method, CIVET, for certified training of VAEs. CIVET depends on the key insight that we can bound worst-case VAE error by bounding the error on carefully chosen support sets at the latent layer. We show this point mathematically and present a novel training algorithm utilizing this insight. We show in an extensive evaluation across different datasets (in both the wireless and vision application areas), architectures, and perturbation magnitudes that our method outperforms SOTA methods achieving good standard performance with strong robustness guarantees.

View on arXiv

Comments on this paper