ABSTRACT
Community detection for attributed graphs, also called attributed graph clustering, is a new challenging issue in data mining due to the increasing emergence of different kinds of real-word networks with rich attributes. The existing works for the attributed graph clustering can be divided into two classes, namely distanced-based approaches and model-based approaches. In this paper, we focus on a model-based approach called clustered attributed graph model proposed by Xu et al. [12]. Instead of the original variational Bayes EM algorithm (VBEM) for solving this model, we propose a new variational EM algorithm (VEM). Comparing with the VBEM algorithm, our proposed VEM algorithm can reduce the number of parameters when fitting the model, which brings the lower computational complexity and easier implementation in practice. Additionally, a good model selection criterion ICL can be easily derived under the VEM framework. Our proposed VEM algorithm is demonstrated to perform competitively over the existing state of the art VBEM algorithm in terms of the extensive simulations and the real data.
- L. A. Adamic and N. Glance. The political blogosphere and the 2004 us election: divided they blog. In Proceedings of the 3rd international workshop on Link discovery, pages 36--43. ACM, 2005. Google ScholarDigital Library
- C. Biernacki, G. Celeux, and G. Govaert. Assessing a mixture model for clustering with the integrated completed likelihood. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 22(7):719--725, 2000. Google ScholarDigital Library
- H. Cheng, Y. Zhou, and J. X. Yu. Clustering large attributed graphs: A balance between structural and attribute similarities. ACM Transactions on Knowledge Discovery from Data (TKDD), 5(2):12, 2011. Google ScholarDigital Library
- J.-J. Daudin, F. Picard, and S. Robin. A mixture model for random graphs. Statistics and computing, 18(2):173--183, 2008. Google ScholarDigital Library
- S. Fortunato. Community detection in graphs. Physics Reports, 486(3):75--174, 2010.Google ScholarCross Ref
- P. W. Holland, K. B. Laskey, and S. Leinhardt. Stochastic blockmodels: First steps. Social networks, 5(2):109--137, 1983.Google ScholarCross Ref
- Z. G. Matthew J Beal. The Variational Bayesian EM Algorithm for Incomplete Data: with Application to Scoring Graphical Model Structures, volume 7. Oxford University Press, Oxford, 2003.Google Scholar
- M. E. Newman and M. Girvan. Finding and evaluating community structure in networks. Physical review E, 69(2):026113, 2004.Google Scholar
- J. Scott and P. J. Carrington. The SAGE handbook of social network analysis. SAGE publications, 2011. Google ScholarDigital Library
- J. Shi and J. Malik. Normalized cuts and image segmentation. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 22(8):888--905, 2000. Google ScholarDigital Library
- Y. Wang, X. Chang, R. Li, and Z. Xu. Sparse k-means with the ℓq(0 < q < 1) constraint for high-dimensional data clustering. In Data Mining (ICDM), 2013 IEEE 13th International Conference on, pages 797--806. IEEE, 2013.Google ScholarCross Ref
- Z. Xu, Y. Ke, Y. Wang, H. Cheng, and J. Cheng. A model-based approach to attributed graph clustering. In Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data, pages 505--516. ACM, 2012. Google ScholarDigital Library
- H. Zanghi, S. Volant, and C. Ambroise. Clustering based on random graph model embedding vertex features. Pattern Recognition Letters, 31(9):830--836, 2010. Google ScholarDigital Library
- Y. Zhou, H. Cheng, and J. X. Yu. Graph clustering based on structural/attribute similarities. Proceedings of the VLDB Endowment, 2(1):718--729, 2009. Google ScholarDigital Library
- Y. Zhou, H. Cheng, and J. X. Yu. Clustering large attributed graphs: An efficient incremental approach. In Data Mining (ICDM), 2010 IEEE 10th International Conference on, pages 689--698. IEEE, 2010. Google ScholarDigital Library
Index Terms
- Community Detection for Clustered Attributed Graphs via a Variational EM Algorithm
Recommendations
A variational EM algorithm for the separation of time-varying convolutive audio mixtures
This paper addresses the problem of separating audio sources from time-varying convolutive mixtures. We propose a probabilistic framework based on the local complex-Gaussian model combined with non-negative matrix factorization. The time-varying mixing ...
Exploring multiple clusterings in attributed graphs
SAC '15: Proceedings of the 30th Annual ACM Symposium on Applied ComputingMany graph clustering algorithms aim at generating a single partitioning (clustering) of the data. However, there can be many ways a dataset can be clustered. From a exploratory analisys perspective, given a dataset, the availability of many different ...
A model-based approach to attributed graph clustering
SIGMOD '12: Proceedings of the 2012 ACM SIGMOD International Conference on Management of DataGraph clustering, also known as community detection, is a long-standing problem in data mining. However, with the proliferation of rich attribute information available for objects in real-world graphs, how to leverage structural and attribute ...
Comments