ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.18086
262
14
v1v2v3 (latest)

Differentially Private Stream Processing at Scale

Proceedings of the VLDB Endowment (PVLDB), 2023
31 March 2023
Bing Zhang
Vadym Doroshenko
Peter Kairouz
Thomas Steinke
Abhradeep Thakurta
Zi-Tang Ma
Eidan Cohen
Himani Apte
ArXiv (abs)PDFHTML
Abstract

We design, to the best of our knowledge, the first differentially private (DP) stream processing system at scale. Our system --Differential Privacy SQL Pipelines (DP-SQLP)-- is built using a streaming framework similar to Spark streaming, and is built on top of the Spanner database and the F1 query engine from Google. Towards designing DP-SQLP we make both algorithmic and systemic advances, namely, we (i) design a novel DP key selection algorithm that can operate on an unbounded set of possible keys, and can scale to one billion keys that users have contributed, (ii) design a preemptive execution scheme for DP key selection that avoids enumerating all the keys at each triggering time, and (iii) use algorithmic techniques from DP continual observation to release a continual DP histogram of user contributions to different keys over the stream length. We empirically demonstrate the efficacy by obtaining at least 16×16\times16× reduction in error over meaningful baselines we consider.

View on arXiv
Comments on this paper