Abstract
In many graph--mining problems, two networks from different domains have to be matched. In the absence of reliable node attributes, graph matching has to rely on only the link structures of the two networks, which amounts to a generalization of the classic graph isomorphism problem. Graph matching has applications in social--network reconciliation and de-anonymization, protein--network alignment in biology, and computer vision.
The most scalable graph--matching approaches use ideas from percolation theory, where a matched node pair "infects" neighbouring pairs as additional potential matches. This class of matching algorithm requires an initial seed set of known matches to start the percolation. The size and correctness of the matching is very sensitive to the size of the seed set.
In this paper, we give a new graph--matching algorithm that can operate with a much smaller seed set than previous approaches, with only a small increase in matching errors. We characterize a phase transition in matching performance as a function of the seed set size, using a random bigraph model and ideas from bootstrap percolation theory. We also show the excellent performance in matching several real large-scale social networks, using only a handful of seeds.
- F. Abel, N. Henze, E. Herder, and D. Krause. Interweaving Public User Profiles on the Web. In UMAP, HI, USA, pages 16--27, September 2010. Google ScholarDigital Library
- L. Babai, P. Erdös, and S. M. Selkow. Random Graph Isomorphism. SIAM J. Comput., 9(3):628--635, 1980.Google ScholarCross Ref
- L. Backstrom, C. Dwork, and J. M. Kleinberg. Wherefore art thou R3579X?: Anonymized Social Networks, Hidden Patterns, and Structural Steganography. Commun. ACM, 54(12):133--141, 2011. Google ScholarDigital Library
- A.-L. Barabási and R. Albert. Emergence of Scaling in Random Networks. science, 286(5439):509--512, 1999.Google Scholar
- B. Bollobás and O. Riordan. The Diameter of a Scale-Free Random Graph. Combinatorica, 24(1):5--34, 2004. Google ScholarDigital Library
- F. Chung and L. Lu. Connected Components in Random Graphs with Given Expected Degree Sequences. Annals of Combinatorics, 6(2):125--145, 2002.Google ScholarCross Ref
- A. Egozi, Y. Keller, and H. Guterman. A Probabilistic Approach to Spectral Graph Matching. IEEE Transactions on Pattern Analysis and Machine Intelligence, 35(1):18--27, 2013. Google ScholarDigital Library
- P. Erdös and A. Rényi. On random graphs I. Publ. Math. Debrecen, 6:290--297, 1959.Google ScholarCross Ref
- M. R. Garey and D. S. Johnson. Computers and Intractability; A Guide to the Theory of NP-Completeness. W. H. Freeman & Co., New York, NY, USA, 1990. Google ScholarDigital Library
- K. Henderson, B. Gallagher, L. Li, L. Akoglu, T. Eliassi-Rad, H. Tong, and C. Faloutsos. It's Who You Know: Graph Mining Using Recursive Structural Features. In SIGKDD, San Diego, CA, USA, pages 663--671, August 2011. Google ScholarDigital Library
- S. Janson, T. Luczak, T. Turova, and T. Vallier. Bootstrap Percolation On The Random Graph Gn,p. The Annals of Applied Probability, 22(5):1989--2047, 2012.Google ScholarCross Ref
- R. M. Karp, C. Schindelhauer, S. Shenker, and B. Vöcking. Randomized Rumor Spreading. In FOCS, Redondo Beach, California, USA, pages 565--574, November 2000. Google ScholarDigital Library
- G. Klau. A New Graph--Based Method for Pairwise Global Network Alignment. BMC Bioinformatics, 10(Suppl 1):S59, 2009.Google ScholarCross Ref
- N. Korula and S. Lattanzi. An Efficient Reconciliation Algorithm for Social Networks. PVLDB, 7(5):377--388, 2014. Google ScholarDigital Library
- J. Leskovec. Stanford Network Analysis Project. http://snap.stanford.edu/index.html.Google Scholar
- A. Malhotra, L. C. Totti, W. M. Jr., P. Kumaraguru, and V. Almeida. Studying User Footprints in Different Online Social Networks. In ASONAM, Istanbul, Turkey, pages 1065--1070, August 2012. Google ScholarDigital Library
- S. Melnik, H. Garcia-Molina, and E. Rahm. Similarity Flooding: A Versatile Graph Matching Algorithm and Its Application to Schema Matching. In ICDE, San Jose, CA, USA, pages 117--128, 2002. Google ScholarDigital Library
- A. Mislove, B. Viswanath, P. K. Gummadi, and P. Druschel. You Are Who You Know: Inferring User Profiles in Online Social Networks. In WSDM, New York, NY, USA, pages 251--260, February 2010. Google ScholarDigital Library
- A. Narayanan and V. Shmatikov. De-anonymizing Social Networks. In S&P, Oakland, California, USA, pages 173--187, May 2009. Google ScholarDigital Library
- A. Nunes, P. Calado, and B. Martins. Resolving User Identities Over Social Networks Through Supervised Learning and Rich Similarity Features. In SAC, Riva, Trento, Italy, pages 728--729, March 2012. Google ScholarDigital Library
- P. Pedarsani, D. R. Figueiredo, and M. Grossglauser. A Bayesian Method for Matching Two Similar Graphs without Seeds. In Conference on Communication, Control, and Computing, Allerton Park, Monticello, IL, USA, pages 1598--1607, October 2013.Google ScholarCross Ref
- P. Pedarsani and M. Grossglauser. On the Privacy of Anonymized Networks. In SIGKDD, San Diego, CA, USA, pages 1235--1243, August 2011. Google ScholarDigital Library
- D. M. Powers. Evaluation: from Precision, Recall and F-measure to ROC, Informedness, Markedness and Correlation. Journal of Machine Learning Technologies, 2(1):37--63, 2011.Google ScholarCross Ref
- D. Shah. Gossip Algorithms. Now Publishers Inc, 2009.Google Scholar
- R. Singh, J. Xu, and B. Berger. Global alignment of multiple protein interaction networks with application to functional orthology detection. Proceedings of the National Academy of Sciences, (35):12763--12768.Google Scholar
- L. Torresani, V. Kolmogorov, and C. Rother. Feature Correspondence Via Graph Matching: Models and Global Optimization. In ECCV, Marseille, France, pages 596--609, October 2008. Google ScholarDigital Library
- B. Von Bahr and A. Martin-Löf. Threshold Limit Theorems for Some Epidemic Processes. Advances in Applied Probability, pages 319--349, 1980.Google Scholar
- L. Wiskott, J.-M. Fellous, N. Kuiger, and C. Von Der Malsburg. Face Recognition by Elastic Bunch Braph Matching. IEEE Transactions on Pattern Analysis and Machine Intelligence, 19(7):775--779, 1997. Google ScholarDigital Library
- G. Wondracek, T. Holz, E. Kirda, and C. Kruegel. A practical attack to de-anonymize social network users. In S&P, Berleley/Oakland, California, USA, pages 223--238, May 2010. Google ScholarDigital Library
- L. Yartseva and M. Grossglauser. On the performance of percolation graph matching. In COSN, Boston, MA, USA, pages 119--130, October 2013. Google ScholarDigital Library
Index Terms
- Growing a graph matching from a handful of seeds
Recommendations
Graph matching with partially-correct seeds
Graph matching aims to find the latent vertex correspondence between two edge-correlated graphs and has found numerous applications across different fields. In this paper, we study a seeded graph matching problem, which assumes that a set of seeds, i.e., ...
Connectivity of Matching Graph of Hypercube
The matching graph $\mathcal{M}(G)$ of a graph $G$ has a vertex set of all perfect matchings of $G$, with two vertices being adjacent whenever the union of the corresponding perfect matchings forms a Hamiltonian cycle. We prove that the matching graph $\...
Many-to-many graph matching: a continuous relaxation approach
ECMLPKDD'10: Proceedings of the 2010th European Conference on Machine Learning and Knowledge Discovery in Databases - Volume Part IIIGraphs provide an efficient tool for object representation in various machine learning applications. Once graph-based representations are constructed, an important question is how to compare graphs. This problem is often formulated as a graph matching ...
Comments