research-article

Community detection in content-sharing social networks

Authors:
Nagarajan Natarajan

UT Austin

UT Austin
View Profile

,
Prithviraj Sen

IBM Research - Almaden

IBM Research - Almaden
View Profile

,
Vineet Chaoji

Amazon, Bangalore

Amazon, Bangalore
View Profile

ASONAM '13: Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and MiningAugust 2013Pages 82–89https://doi.org/10.1145/2492517.2492546

Published:25 August 2013Publication History

ASONAM '13: Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining

Pages 82–89

ABSTRACT

Network structure and content in microblogging sites like Twitter influence each other ---user A on Twitter follows user B for the tweets that B posts on the network, and A may then re-tweet the content shared by B to his/her own followers. In this paper, we propose a probabilistic model to jointly model link communities and content topics by leveraging both the social graph and the content shared by users. We model a community as a distribution over users, use it as a source for topics of interest, and jointly infer both communities and topics using Gibbs sampling. While modeling communities using the social graph, or modeling topics using content have received a great deal of attention, a few recent approaches try to model topics in content-sharing platforms using both content and social graph. Our work differs from the existing generative models in that we explicitly model the social graph of users along with the user-generated content, mimicking how the two entities co-evolve in content-sharing platforms. Recent studies have found Twitter to be more of a content-sharing network and less a social network, and it seems hard to detect tightly knit communities from the follower-followee links. Still, the question of whether we can extract Twitter communities using both links and content is open. In this paper, we answer this question in the affirmative. Our model discovers coherent communities and topics, as evinced by qualitative results on sub-graphs of Twitter users. Furthermore, we evaluate our model on the task of predicting follower-followee links. We show that joint modeling of links and content significantly improves link prediction performance on a sub-graph of Twitter (consisting of about 0.7 million users and over 27 million tweets), compared to generative models based on only structure or only content and paths-based methods such as Katz.

References

C. J. Anderson, S. Wasserman, and K. Faust. Building stochastic blockmodels. Social Networks, 1992.Google Scholar
B. Ball, B. Karrer, and M. Newman. An efficient and principled method for detecting communities in networks. CoRR, 2011.Google ScholarCross Ref
D. Blei, A. Ng, and M. Jordan. Latent dirichlet allocation. JMLR, 2003. Google ScholarDigital Library
W. L. Buntine. Operations for learning with graphical models. JAIR'94. Google ScholarDigital Library
D. Cohn and T. Hofmann. The missing link - a probabilistic model of document content and hypertext connectivity. In NIPS, 2000.Google Scholar
L. Dietz, S. Bickel, and T. Scheffer. Unsupervised prediction of citation influences. In ICML, 2007. Google ScholarDigital Library
E. Eroshev, S. Fienberg, and J. Lafferty. Mixed-membership models of scientific publications. PNAS, 2004.Google ScholarCross Ref
S. Fortunato. Community detection in graphs. Physics Reports, 2010.Google ScholarCross Ref
S. Geman and D. Geman. Stochastic relaxation, gibbs distributions, and bayesian restoration of images. PAMI, 1984. Google ScholarDigital Library
M. Girvan and M. Newman. Community structure in social and biological networks. In PNAS, 2002.Google ScholarCross Ref
T. Griffiths and M. Steyvers. Finding scientific topics. In PNAS, 2004.Google ScholarCross Ref
B. Hu, Z. Song, and M. Ester. User features and social networks for topic modeling in online social media. In ASONAM, 2012, pages 202--209. IEEE, 2012. Google ScholarDigital Library
B. Karrer and M. Newman. Stochastic blockmodels and community structure in networks. Phys. Rev. E, 2011.Google ScholarCross Ref
H. Kwak, C. Lee, H. Park, and S. Moon. What is Twitter, a social network or a news media? In WWW, 2010. Google ScholarDigital Library
J. Leskovec, D. Chakrabarti, J. Kleinberg, C. Faloutsos, and Z. Ghahramani. Kronecker graphs: An approach to modeling networks. JMLR'10. Google ScholarDigital Library
J. Leskovec and C. Faloutsos. Sampling from large graphs. In KDD'06. Google ScholarDigital Library
D. Liben-Nowell and J. Kleinberg. The link-prediction problem for social networks. JASIST, 2007. Google ScholarDigital Library
Y. Liu, A. Niculescu-Mizil, and W. Gryc. Topic-link lda: Joint models of topic and author community. In ICML, 2009. Google ScholarDigital Library
Z. Lu, B. Savas, W. Tang, and I. Dhillon. Supervised link prediction using multiple sources. In ICDM, 2010. Google ScholarDigital Library
A. McCallum, A. Corrada-Emmanuel, and X. Wang. Topic and role discovery in social networks. In IJCAI, 2005. Google ScholarDigital Library
A. McCallum, X. Wang, and A. Corrada-Emmanuel. Topic and role discovery in social networks with experiments on enron and academic email. JAIR, 2007. Google ScholarDigital Library
T. P. Minka. Estimating a dirichlet distribution. Technical report, Microsoft Research, 2003.Google Scholar
R. Nallapati and W. Cohen. Link-plsa-lda: A new unsupervised model for topics and influence of blogs. In ICWSM, 2008.Google Scholar
R. M. Nallapati, A. Ahmed, E. P. Xing, and W. Cohen. Joint latent topic models for text and citations. In KDD, 2008. Google ScholarDigital Library
M. Newman. Detecting community structure in networks. The European Physical Journal B, 2004.Google ScholarCross Ref
M. Newman and M. Girvan. Finding and evaluating community structure in networks. Physical Review, 2004.Google ScholarCross Ref
N. Pathak, C. Delong, A. Banerjee, and K. Erickson. Social Topic Models for Community Extraction. In SNA-KDD, 2008.Google Scholar
I. Porteous, D. Newman, A. Ihler, A. Asuncion, P. Smyth, and M. Welling. Fast collapsed gibbs sampling for LDA. KDD'08. Google ScholarDigital Library
M. Rosen-Zvi, T. Griffiths, M. Steyvers, and P. Smyth. The author-topic model for authors and documents. In UAI, 2004. Google ScholarDigital Library
M. Sachan, D. Contractor, T. Faruquie, and L. V. Subramaniam. Using content and interactions for discovering communities in social networks. In WWW, 2012. Google ScholarDigital Library
Y. Teh, M. Jordan, M. Beal, and D. Blei. Hierarchical dirichlet processes. Journal of American Statistical Association, 2005.Google Scholar
J. Yang and J. Leskovec. Patterns of temporal variation in online media. In WSDM, 2011. Google ScholarDigital Library
W. Zachary. An information flow model for conflict and fission in small groups. Journal of anthropological research, 1977.Google Scholar
D. Zhou, E. Manavoglu, J. Li, C. L. Giles, and H. Zha. Probabilistic models for discovering e-communities. WWW'06. Google ScholarDigital Library

Recommendations

Community detection in large-scale social networks
WebKDD/SNA-KDD '07: Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 workshop on Web mining and social network analysis

Recent years have seen that WWW is becoming a flourishing social media which enables individuals to easily share opinions, experiences and expertise at the push of a single button. With the pervasive usage of instant messaging systems and the ...
Read More
Overlapping community detection in networks: The state-of-the-art and comparative study

This article reviews the state-of-the-art in overlapping community detection algorithms, quality measures, and benchmarks. A thorough comparison of different algorithms (a total of fourteen) is provided. In addition to community-level evaluation, we ...
Read More
Efficient community detection in large networks using content and links
WWW '13: Proceedings of the 22nd international conference on World Wide Web

In this paper we discuss a very simple approach of combining content and link information in graph structures for the purpose of community discovery, a fundamental task in network analysis. Our approach hinges on the basic intuition that many networks ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ASONAM '13: Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
August 2013
1558 pages
ISBN:9781450322409
DOI:10.1145/2492517
General Chairs:
Jon Rokne
University of Calgary, Calgary, AB, Canada
,
Christos Faloutsos
Carnegie Mellon University, Pittsburgh, PA
Copyright © 2013 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 25 August 2013
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate116of549submissions,21%
Upcoming Conference
KDD '24

Sponsor:

sigkdd

sigkdd

The 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

August 25 - 29, 2024

Barcelona , Spain
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 34
  Total Citations
  View Citations
- 403
  Total Downloads
- Downloads (Last 12 months)18
- Downloads (Last 6 weeks)3
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Community detection in content-sharing social networks

ASONAM '13: Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining

ABSTRACT

References

Cited By

Recommendations

Community detection in large-scale social networks

Overlapping community detection in networks: The state-of-the-art and comparative study

Efficient community detection in large networks using content and links