Supercharging Federated Intelligence Retrieval
Dimitris Stripelis
Patrick Foley
Mohammad Naseri
William Lindskog-Münzing
Chong Shen Ng
Daniel Janes Beutel
Nicholas D. Lane
- FedMLSILM
Main:3 Pages
1 Figures
Bibliography:2 Pages
2 Tables
Appendix:1 Pages
Abstract
RAG typically assumes centralized access to documents, which breaks down when knowledge is distributed across private data silos. We propose a secure Federated RAG system built using Flower that performs local silo retrieval, while server-side aggregation and text generation run inside an attested, confidential compute environment, enabling confidential remote LLM inference even in the presence of honest-but-curious or compromised servers. We also propose a cascading inference approach that incorporates a non-confidential third-party model (e.g., Amazon Nova) as auxiliary context without weakening confidentiality.
View on arXivComments on this paper
