research-article

Growing a graph matching from a handful of seeds

Authors:
Ehsan Kazemi

School of Computer and Communication Sciences, EPFL

School of Computer and Communication Sciences, EPFL
View Profile

,
S. Hamed Hassani

ETHZ

ETHZ
View Profile

,
Matthias Grossglauser

School of Computer and Communication Sciences, EPFL

School of Computer and Communication Sciences, EPFL
View Profile

Proceedings of the VLDB Endowment Volume 8 Issue 10pp 1010–1021https://doi.org/10.14778/2794367.2794371

Published:01 June 2015Publication History

Proceedings of the VLDB Endowment

Abstract

In many graph--mining problems, two networks from different domains have to be matched. In the absence of reliable node attributes, graph matching has to rely on only the link structures of the two networks, which amounts to a generalization of the classic graph isomorphism problem. Graph matching has applications in social--network reconciliation and de-anonymization, protein--network alignment in biology, and computer vision.

The most scalable graph--matching approaches use ideas from percolation theory, where a matched node pair "infects" neighbouring pairs as additional potential matches. This class of matching algorithm requires an initial seed set of known matches to start the percolation. The size and correctness of the matching is very sensitive to the size of the seed set.

In this paper, we give a new graph--matching algorithm that can operate with a much smaller seed set than previous approaches, with only a small increase in matching errors. We characterize a phase transition in matching performance as a function of the seed set size, using a random bigraph model and ideas from bootstrap percolation theory. We also show the excellent performance in matching several real large-scale social networks, using only a handful of seeds.

References

F. Abel, N. Henze, E. Herder, and D. Krause. Interweaving Public User Profiles on the Web. In UMAP, HI, USA, pages 16--27, September 2010. Google ScholarDigital Library
L. Babai, P. Erdös, and S. M. Selkow. Random Graph Isomorphism. SIAM J. Comput., 9(3):628--635, 1980.Google ScholarCross Ref
L. Backstrom, C. Dwork, and J. M. Kleinberg. Wherefore art thou R3579X?: Anonymized Social Networks, Hidden Patterns, and Structural Steganography. Commun. ACM, 54(12):133--141, 2011. Google ScholarDigital Library
A.-L. Barabási and R. Albert. Emergence of Scaling in Random Networks. science, 286(5439):509--512, 1999.Google Scholar
B. Bollobás and O. Riordan. The Diameter of a Scale-Free Random Graph. Combinatorica, 24(1):5--34, 2004. Google ScholarDigital Library
F. Chung and L. Lu. Connected Components in Random Graphs with Given Expected Degree Sequences. Annals of Combinatorics, 6(2):125--145, 2002.Google ScholarCross Ref
A. Egozi, Y. Keller, and H. Guterman. A Probabilistic Approach to Spectral Graph Matching. IEEE Transactions on Pattern Analysis and Machine Intelligence, 35(1):18--27, 2013. Google ScholarDigital Library
P. Erdös and A. Rényi. On random graphs I. Publ. Math. Debrecen, 6:290--297, 1959.Google ScholarCross Ref
M. R. Garey and D. S. Johnson. Computers and Intractability; A Guide to the Theory of NP-Completeness. W. H. Freeman & Co., New York, NY, USA, 1990. Google ScholarDigital Library
K. Henderson, B. Gallagher, L. Li, L. Akoglu, T. Eliassi-Rad, H. Tong, and C. Faloutsos. It's Who You Know: Graph Mining Using Recursive Structural Features. In SIGKDD, San Diego, CA, USA, pages 663--671, August 2011. Google ScholarDigital Library
S. Janson, T. Luczak, T. Turova, and T. Vallier. Bootstrap Percolation On The Random Graph G_n,p. The Annals of Applied Probability, 22(5):1989--2047, 2012.Google ScholarCross Ref
R. M. Karp, C. Schindelhauer, S. Shenker, and B. Vöcking. Randomized Rumor Spreading. In FOCS, Redondo Beach, California, USA, pages 565--574, November 2000. Google ScholarDigital Library
G. Klau. A New Graph--Based Method for Pairwise Global Network Alignment. BMC Bioinformatics, 10(Suppl 1):S59, 2009.Google ScholarCross Ref
N. Korula and S. Lattanzi. An Efficient Reconciliation Algorithm for Social Networks. PVLDB, 7(5):377--388, 2014. Google ScholarDigital Library
J. Leskovec. Stanford Network Analysis Project. http://snap.stanford.edu/index.html.Google Scholar
A. Malhotra, L. C. Totti, W. M. Jr., P. Kumaraguru, and V. Almeida. Studying User Footprints in Different Online Social Networks. In ASONAM, Istanbul, Turkey, pages 1065--1070, August 2012. Google ScholarDigital Library
S. Melnik, H. Garcia-Molina, and E. Rahm. Similarity Flooding: A Versatile Graph Matching Algorithm and Its Application to Schema Matching. In ICDE, San Jose, CA, USA, pages 117--128, 2002. Google ScholarDigital Library
A. Mislove, B. Viswanath, P. K. Gummadi, and P. Druschel. You Are Who You Know: Inferring User Profiles in Online Social Networks. In WSDM, New York, NY, USA, pages 251--260, February 2010. Google ScholarDigital Library
A. Narayanan and V. Shmatikov. De-anonymizing Social Networks. In S&P, Oakland, California, USA, pages 173--187, May 2009. Google ScholarDigital Library
A. Nunes, P. Calado, and B. Martins. Resolving User Identities Over Social Networks Through Supervised Learning and Rich Similarity Features. In SAC, Riva, Trento, Italy, pages 728--729, March 2012. Google ScholarDigital Library
P. Pedarsani, D. R. Figueiredo, and M. Grossglauser. A Bayesian Method for Matching Two Similar Graphs without Seeds. In Conference on Communication, Control, and Computing, Allerton Park, Monticello, IL, USA, pages 1598--1607, October 2013.Google ScholarCross Ref
P. Pedarsani and M. Grossglauser. On the Privacy of Anonymized Networks. In SIGKDD, San Diego, CA, USA, pages 1235--1243, August 2011. Google ScholarDigital Library
D. M. Powers. Evaluation: from Precision, Recall and F-measure to ROC, Informedness, Markedness and Correlation. Journal of Machine Learning Technologies, 2(1):37--63, 2011.Google ScholarCross Ref
D. Shah. Gossip Algorithms. Now Publishers Inc, 2009.Google Scholar
R. Singh, J. Xu, and B. Berger. Global alignment of multiple protein interaction networks with application to functional orthology detection. Proceedings of the National Academy of Sciences, (35):12763--12768.Google Scholar
L. Torresani, V. Kolmogorov, and C. Rother. Feature Correspondence Via Graph Matching: Models and Global Optimization. In ECCV, Marseille, France, pages 596--609, October 2008. Google ScholarDigital Library
B. Von Bahr and A. Martin-Löf. Threshold Limit Theorems for Some Epidemic Processes. Advances in Applied Probability, pages 319--349, 1980.Google Scholar
L. Wiskott, J.-M. Fellous, N. Kuiger, and C. Von Der Malsburg. Face Recognition by Elastic Bunch Braph Matching. IEEE Transactions on Pattern Analysis and Machine Intelligence, 19(7):775--779, 1997. Google ScholarDigital Library
G. Wondracek, T. Holz, E. Kirda, and C. Kruegel. A practical attack to de-anonymize social network users. In S&P, Berleley/Oakland, California, USA, pages 223--238, May 2010. Google ScholarDigital Library
L. Yartseva and M. Grossglauser. On the performance of percolation graph matching. In COSN, Boston, MA, USA, pages 119--130, October 2013. Google ScholarDigital Library

Index Terms

Growing a graph matching from a handful of seeds

Recommendations

Graph matching with partially-correct seeds

Graph matching aims to find the latent vertex correspondence between two edge-correlated graphs and has found numerous applications across different fields. In this paper, we study a seeded graph matching problem, which assumes that a set of seeds, i.e., ...
Read More
Connectivity of Matching Graph of Hypercube

The matching graph $\mathcal{M}(G)$ of a graph $G$ has a vertex set of all perfect matchings of $G$, with two vertices being adjacent whenever the union of the corresponding perfect matchings forms a Hamiltonian cycle. We prove that the matching graph $\...
Read More
Many-to-many graph matching: a continuous relaxation approach
ECMLPKDD'10: Proceedings of the 2010th European Conference on Machine Learning and Knowledge Discovery in Databases - Volume Part III

Graphs provide an efficient tool for object representation in various machine learning applications. Once graph-based representations are constructed, an important question is how to compare graphs. This problem is often formulated as a graph matching ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
Proceedings of the VLDB Endowment Volume 8, Issue 10
June 2015
168 pages
ISSN:2150-8097
Editors:
Chen Li
University of California, Irvine
,
Volker Markl
TU Berlin
Issue’s Table of Contents
Sponsors
In-Cooperation
Publisher
VLDB Endowment
Publication History
- Published: 1 June 2015
Published in pvldb Volume 8, Issue 10
Qualifiers
- research-article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 32
  Total Citations
  View Citations
- 376
  Total Downloads
- Downloads (Last 12 months)34
- Downloads (Last 6 weeks)5
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Growing a graph matching from a handful of seeds

Proceedings of the VLDB Endowment

Abstract

References

Cited By

Index Terms

Recommendations

Graph matching with partially-correct seeds

Connectivity of Matching Graph of Hypercube

Many-to-many graph matching: a continuous relaxation approach

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Growing a graph matching from a handful of seeds

Proceedings of the VLDB Endowment

Abstract

References

Cited By

Index Terms

Recommendations

Graph matching with partially-correct seeds

Connectivity of Matching Graph of Hypercube

Many-to-many graph matching: a continuous relaxation approach

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media