Disposable-key-based image encryption for collaborative learning of
Vision Transformer
Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2024
- ViTFedML
Main:4 Pages
5 Figures
Bibliography:2 Pages
2 Tables
Abstract
We propose a novel method for securely training the vision transformer (ViT) with sensitive data shared from multiple clients similar to privacy-preserving federated learning. In the proposed method, training images are independently encrypted by each client where encryption keys can be prepared by each client, and ViT is trained by using these encrypted images for the first time. The method allows clients not only to dispose of the keys but to also reduce the communication costs between a central server and the clients. In image classification experiments, we verify the effectiveness of the proposed method on the CIFAR-10 dataset in terms of classification accuracy and the use of restricted random permutation matrices.
View on arXivComments on this paper
