Disposable-key-based image encryption for collaborative learning of Vision Transformer

Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2024

11 August 2024

Main:4 Pages

5 Figures

Bibliography:2 Pages

2 Tables

Abstract

We propose a novel method for securely training the vision transformer (ViT) with sensitive data shared from multiple clients similar to privacy-preserving federated learning. In the proposed method, training images are independently encrypted by each client where encryption keys can be prepared by each client, and ViT is trained by using these encrypted images for the first time. The method allows clients not only to dispose of the keys but to also reduce the communication costs between a central server and the clients. In image classification experiments, we verify the effectiveness of the proposed method on the CIFAR-10 dataset in terms of classification accuracy and the use of restricted random permutation matrices.

View on arXiv

Comments on this paper