skip to main content
10.1145/1081870.1081925acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
Article

Modeling and predicting personal information dissemination behavior

Published:21 August 2005Publication History

ABSTRACT

In this paper, we propose a new way to automatically model and predict human behavior of receiving and disseminating information by analyzing the contact and content of personal communications. A personal profile, called CommunityNet, is established for each individual based on a novel algorithm incorporating contact, content, and time information simultaneously. It can be used for personal social capital management. Clusters of CommunityNets provide a view of informal networks for organization management. Our new algorithm is developed based on the combination of dynamic algorithms in the social network field and the semantic content classification methods in the natural language processing and machine learning literatures. We tested CommunityNets on the Enron Email corpus and report experimental results including filtering, prediction, and recommendation capabilities. We show that the personal behavior and intention are somewhat predictable based on these models. For instance, "to whom a person is going to send a specific email" can be predicted by one's personal social network and content analysis. Experimental results show the prediction accuracy of the proposed adaptive algorithm is 58% better than the social network-based predictions, and is 75% better than an aggregated model based on Latent Dirichlet Allocation with social network enhancement. Two online demo systems we developed that allow interactive exploration of CommunityNet are also discussed.

References

  1. B. A. Nardi, S. Whittaker, and H. Schwarz. "It's not what you know, it's who you know: work in the information age," First Mon., 5, 2000.Google ScholarGoogle Scholar
  2. D. Krackhardt, "Panel on Informal Networks within Formal Organizations," XXV Intl. Social Network Conf., Feb. 2005.Google ScholarGoogle Scholar
  3. D. Krackhardt and M. Kilduff, "Structure, culture and Simmelian ties in entrepreneurial firms," Social Networks, Vol. 24, 2002.Google ScholarGoogle Scholar
  4. B. Nardi, S. Whittaker, E. Isaacs, M. Creech, J. Johnson, and J. Hainsworth, "ContactMap: Integrating Communication and Information Through Visualizing Personal Social Networks," Com. of the Association for Computing Machinery. April, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. https://www.linkedin.com/home?trk=logo.Google ScholarGoogle Scholar
  6. https://www.orkut.com/Login.aspx.Google ScholarGoogle Scholar
  7. http://www.friendster.com/.Google ScholarGoogle Scholar
  8. N. Lin, "Social Capital," Cambridge Univ. Press, 2001.Google ScholarGoogle Scholar
  9. W. Cohen. http://www-2.cs.cmu.edu/~enron/.Google ScholarGoogle Scholar
  10. S. Milgram. "The Small World Problem," Psychology Today, pp 60--67, May 1967.Google ScholarGoogle Scholar
  11. M. Schwartz and D. Wood, "Discovering Shared Interests Among People Using Graph Analysis", Comm. ACM, v. 36, Aug. 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. H. Kautz, B. Selman, and M. Shah. "Referral Web: Combining social networks and collaborative filtering," Comm. ACM, March 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. G. W. Flake, S. Lawrence, C. Lee Giles, and F. M. Coetzee. "Self-organization and identification of Web communities," IEEE Computer, 35(3):66--70, March 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. J. Tyler, D. Wilkinson, and B. A. Huberman. "Email as spectroscopy: Automated Discovery of Community Structure Within Organizations," Intl. Conf. on Communities and Technologies., 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. L. Page, S. Brin, R. Motwani and T. Winograd. "The PageRank Citation Ranking: Bringing Order to the Web," Stanford Digital Libraries Working Paper, 1998.Google ScholarGoogle Scholar
  16. J. Kleinberg. "Authoritative sources in a hyperlinked environment," In Proc. 9th ACM-SIAM Symposium on Discrete Algorithms, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. S. Wasserman, and P. E. Pattison, "Logit models and logistic regression for social networks: I. An introduction to Markov graphs and p*", Psychometrika, 61: 401-- 425, 1996.Google ScholarGoogle ScholarCross RefCross Ref
  18. T. A.B. Snijders. "Models for Longitudinal Network Data," Chapter 11 in Models and methods in social network analysis, New York: Cambridge University Press, 2004.Google ScholarGoogle Scholar
  19. D. L.-Nowell and J. Kleinberg, "The Link Prediction Problem for Social Networks," In Proceedings of the 12th Intl. Conf. on Information and Knowledge Management, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. J. Kubica, A. Moore, J. Schneider, and Y. Yang. "Stochastic Link and Group Detection," In Proceedings of the 2002 AAAI Conference. Edmonton, Alberta, 798--804, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. M. Handcock and D. Hunter, "Curved Exponential Family Models for Networks," XXV Intl. Social Network Conf., Feb. 2005.Google ScholarGoogle Scholar
  22. T. Hofmann, "Probabilistic Latent Semantic Analysis," Proc. of the Conf. on Uncertainty in Artificial Intelligence, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. D. Blei, A. Ng, and M. Jordan, "Latent Dirichlet allocation," Journal of Machine Learning Research, 3:993--1022, January 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. T. Griffiths and M. Steyvers, "Finding Scientific Topics," Proc. of the National Academy of Sciences, 5228--5235, 2004.Google ScholarGoogle Scholar
  25. M. R.-Zvi, T. Griffiths, M. Steyvers and P. Smyth, "The Author-Topic Model for Authors and Documents", Proc. of the Conference on Uncertainty in Artificial Intelligence volume 21, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. A. McCallum, A. Corrada-Emmanuel, and X. Wang, "The Author-Recipient-Topic Model for Topic and Role Discovery in Social Networks: Experiments with Enron and Academic Email," Technical Report UM-CS-2004-096, 2004.Google ScholarGoogle Scholar
  27. X. Song, B. L. Tseng, C.-Y. Lin, and M.-T. Sun, "ExpertiseNet: Relational and Evolutionary Expert Modeling," 10th Intl. Conf. on User Modeling, Edinburgh, UK, July 24-30, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. J. Allan, R. Papka, and V. Lavrenko. "On-line New Event Detection and Tracking," Proc. of 21st ACM SIGIR, pp.37--45, August 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. http://en.wikipedia.org/wiki/Timeline_of_the_Enron_scandal.Google ScholarGoogle Scholar
  30. J. Breese, D. Heckerman, and C. Kadie. "Empirical analysis of predictive algorithms for collaborative filtering," Conf. on Uncertainty in Artificial Intelligence, Madison,WI, July 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Modeling and predicting personal information dissemination behavior

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      KDD '05: Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
      August 2005
      844 pages
      ISBN:159593135X
      DOI:10.1145/1081870

      Copyright © 2005 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 21 August 2005

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • Article

      Acceptance Rates

      Overall Acceptance Rate1,133of8,635submissions,13%

      Upcoming Conference

      KDD '24

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader