303
v1v2 (latest)

Support is All You Need for Certified VAE Training

International Conference on Learning Representations (ICLR), 2025
Main:10 Pages
3 Figures
Bibliography:5 Pages
3 Tables
Appendix:6 Pages
Abstract

Variational Autoencoders (VAEs) have become increasingly popular and deployed in safety-critical applications. In such applications, we want to give certified probabilistic guarantees on performance under adversarial attacks. We propose a novel method, CIVET, for certified training of VAEs. CIVET depends on the key insight that we can bound worst-case VAE error by bounding the error on carefully chosen support sets at the latent layer. We show this point mathematically and present a novel training algorithm utilizing this insight. We show in an extensive evaluation across different datasets (in both the wireless and vision application areas), architectures, and perturbation magnitudes that our method outperforms SOTA methods achieving good standard performance with strong robustness guarantees.

View on arXiv
Comments on this paper