The Power of Subsampling in Submodular Maximization

We propose subsampling as a unified algorithmic technique for submodular maximization in centralized and online settings. The idea is simple: independently sample elements from the ground set, and use simple combinatorial techniques (such as greedy or local search) on these sampled elements. We show that this approach leads to optimal/state-of-the-art results despite being much simpler than existing methods. In the usual offline setting, we present SampleGreedy, which obtains a -approximation for maximizing a submodular function subject to a -extendible system using evaluation and feasibility queries, where is the size of the largest feasible set. The approximation ratio improves to and for monotone submodular and linear objectives, respectively. In the streaming setting, we present SampleStreaming, which obtains a -approximation for maximizing a submodular function subject to a -matchoid using memory and evaluation and feasibility queries per element, where is the number of matroids defining the -matchoid. The approximation ratio improves to for monotone submodular objectives. We empirically demonstrate the effectiveness of our algorithms on video summarization, location summarization, and movie recommendation tasks.
View on arXiv