30
7

The Power of Subsampling in Submodular Maximization

Abstract

We propose subsampling as a unified algorithmic technique for submodular maximization in centralized and online settings. The idea is simple: independently sample elements from the ground set, and use simple combinatorial techniques (such as greedy or local search) on these sampled elements. We show that this approach leads to optimal/state-of-the-art results despite being much simpler than existing methods. In the usual offline setting, we present SampleGreedy, which obtains a (p+2+o(1))(p + 2 + o(1))-approximation for maximizing a submodular function subject to a pp-extendible system using O(n+nk/p)O(n + nk/p) evaluation and feasibility queries, where kk is the size of the largest feasible set. The approximation ratio improves to p+1p+1 and pp for monotone submodular and linear objectives, respectively. In the streaming setting, we present SampleStreaming, which obtains a (4p+2o(1))(4p +2 - o(1))-approximation for maximizing a submodular function subject to a pp-matchoid using O(k)O(k) memory and O(km/p)O(km/p) evaluation and feasibility queries per element, where mm is the number of matroids defining the pp-matchoid. The approximation ratio improves to 4p4p for monotone submodular objectives. We empirically demonstrate the effectiveness of our algorithms on video summarization, location summarization, and movie recommendation tasks.

View on arXiv
Comments on this paper