ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.02215
56
40
v1v2 (latest)

Distributed Estimation of Graph 4-Profiles

8 October 2015
Ethan R. Elenberg
Karthikeyan Shanmugam
Michael Borokhovich
A. Dimakis
ArXiv (abs)PDFHTML
Abstract

We present a novel distributed algorithm for counting all four-node induced subgraphs in a big graph. These counts, called the 444-profile, describe a graph's connectivity properties and have found several uses ranging from bioinformatics to spam detection. We also study the more complicated problem of estimating the local 444-profiles centered at each vertex of the graph. The local 444-profile embeds every vertex in an 111111-dimensional space that characterizes the local geometry of its neighborhood: vertices that connect different clusters will have different local 444-profiles compared to those that are only part of one dense cluster. Our algorithm is a local, distributed message-passing scheme on the graph and computes all the local 444-profiles in parallel. We rely on two novel theoretical contributions: we show that local 444-profiles can be calculated using compressed two-hop information and also establish novel concentration results that show that graphs can be substantially sparsified and still retain good approximation quality for the global 444-profile. We empirically evaluate our algorithm using a distributed GraphLab implementation that we scaled up to 640640640 cores. We show that our algorithm can compute global and local 444-profiles of graphs with millions of edges in a few minutes, significantly improving upon the previous state of the art.

View on arXiv
Comments on this paper