ABSTRACT
In this paper, we propose a new way to automatically model and predict human behavior of receiving and disseminating information by analyzing the contact and content of personal communications. A personal profile, called CommunityNet, is established for each individual based on a novel algorithm incorporating contact, content, and time information simultaneously. It can be used for personal social capital management. Clusters of CommunityNets provide a view of informal networks for organization management. Our new algorithm is developed based on the combination of dynamic algorithms in the social network field and the semantic content classification methods in the natural language processing and machine learning literatures. We tested CommunityNets on the Enron Email corpus and report experimental results including filtering, prediction, and recommendation capabilities. We show that the personal behavior and intention are somewhat predictable based on these models. For instance, "to whom a person is going to send a specific email" can be predicted by one's personal social network and content analysis. Experimental results show the prediction accuracy of the proposed adaptive algorithm is 58% better than the social network-based predictions, and is 75% better than an aggregated model based on Latent Dirichlet Allocation with social network enhancement. Two online demo systems we developed that allow interactive exploration of CommunityNet are also discussed.
- B. A. Nardi, S. Whittaker, and H. Schwarz. "It's not what you know, it's who you know: work in the information age," First Mon., 5, 2000.Google Scholar
- D. Krackhardt, "Panel on Informal Networks within Formal Organizations," XXV Intl. Social Network Conf., Feb. 2005.Google Scholar
- D. Krackhardt and M. Kilduff, "Structure, culture and Simmelian ties in entrepreneurial firms," Social Networks, Vol. 24, 2002.Google Scholar
- B. Nardi, S. Whittaker, E. Isaacs, M. Creech, J. Johnson, and J. Hainsworth, "ContactMap: Integrating Communication and Information Through Visualizing Personal Social Networks," Com. of the Association for Computing Machinery. April, 2002. Google ScholarDigital Library
- https://www.linkedin.com/home?trk=logo.Google Scholar
- https://www.orkut.com/Login.aspx.Google Scholar
- http://www.friendster.com/.Google Scholar
- N. Lin, "Social Capital," Cambridge Univ. Press, 2001.Google Scholar
- W. Cohen. http://www-2.cs.cmu.edu/~enron/.Google Scholar
- S. Milgram. "The Small World Problem," Psychology Today, pp 60--67, May 1967.Google Scholar
- M. Schwartz and D. Wood, "Discovering Shared Interests Among People Using Graph Analysis", Comm. ACM, v. 36, Aug. 1993. Google ScholarDigital Library
- H. Kautz, B. Selman, and M. Shah. "Referral Web: Combining social networks and collaborative filtering," Comm. ACM, March 1997. Google ScholarDigital Library
- G. W. Flake, S. Lawrence, C. Lee Giles, and F. M. Coetzee. "Self-organization and identification of Web communities," IEEE Computer, 35(3):66--70, March 2002. Google ScholarDigital Library
- J. Tyler, D. Wilkinson, and B. A. Huberman. "Email as spectroscopy: Automated Discovery of Community Structure Within Organizations," Intl. Conf. on Communities and Technologies., 2003. Google ScholarDigital Library
- L. Page, S. Brin, R. Motwani and T. Winograd. "The PageRank Citation Ranking: Bringing Order to the Web," Stanford Digital Libraries Working Paper, 1998.Google Scholar
- J. Kleinberg. "Authoritative sources in a hyperlinked environment," In Proc. 9th ACM-SIAM Symposium on Discrete Algorithms, 1998. Google ScholarDigital Library
- S. Wasserman, and P. E. Pattison, "Logit models and logistic regression for social networks: I. An introduction to Markov graphs and p*", Psychometrika, 61: 401-- 425, 1996.Google ScholarCross Ref
- T. A.B. Snijders. "Models for Longitudinal Network Data," Chapter 11 in Models and methods in social network analysis, New York: Cambridge University Press, 2004.Google Scholar
- D. L.-Nowell and J. Kleinberg, "The Link Prediction Problem for Social Networks," In Proceedings of the 12th Intl. Conf. on Information and Knowledge Management, 2003. Google ScholarDigital Library
- J. Kubica, A. Moore, J. Schneider, and Y. Yang. "Stochastic Link and Group Detection," In Proceedings of the 2002 AAAI Conference. Edmonton, Alberta, 798--804, 2002. Google ScholarDigital Library
- M. Handcock and D. Hunter, "Curved Exponential Family Models for Networks," XXV Intl. Social Network Conf., Feb. 2005.Google Scholar
- T. Hofmann, "Probabilistic Latent Semantic Analysis," Proc. of the Conf. on Uncertainty in Artificial Intelligence, 1999. Google ScholarDigital Library
- D. Blei, A. Ng, and M. Jordan, "Latent Dirichlet allocation," Journal of Machine Learning Research, 3:993--1022, January 2003. Google ScholarDigital Library
- T. Griffiths and M. Steyvers, "Finding Scientific Topics," Proc. of the National Academy of Sciences, 5228--5235, 2004.Google Scholar
- M. R.-Zvi, T. Griffiths, M. Steyvers and P. Smyth, "The Author-Topic Model for Authors and Documents", Proc. of the Conference on Uncertainty in Artificial Intelligence volume 21, 2004. Google ScholarDigital Library
- A. McCallum, A. Corrada-Emmanuel, and X. Wang, "The Author-Recipient-Topic Model for Topic and Role Discovery in Social Networks: Experiments with Enron and Academic Email," Technical Report UM-CS-2004-096, 2004.Google Scholar
- X. Song, B. L. Tseng, C.-Y. Lin, and M.-T. Sun, "ExpertiseNet: Relational and Evolutionary Expert Modeling," 10th Intl. Conf. on User Modeling, Edinburgh, UK, July 24-30, 2005. Google ScholarDigital Library
- J. Allan, R. Papka, and V. Lavrenko. "On-line New Event Detection and Tracking," Proc. of 21st ACM SIGIR, pp.37--45, August 1998. Google ScholarDigital Library
- http://en.wikipedia.org/wiki/Timeline_of_the_Enron_scandal.Google Scholar
- J. Breese, D. Heckerman, and C. Kadie. "Empirical analysis of predictive algorithms for collaborative filtering," Conf. on Uncertainty in Artificial Intelligence, Madison,WI, July 1998. Google ScholarDigital Library
Index Terms
- Modeling and predicting personal information dissemination behavior
Recommendations
Characterizing and Predicting Enterprise Email Reply Behavior
SIGIR '17: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information RetrievalEmail is still among the most popular online activities. People spend a significant amount of time sending, reading and responding to email in order to communicate with others, manage tasks and archive personal information. Most previous research on ...
Project contexts to situate personal information
SIGIR '06: Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrievalThe Personal Project Planner prototype works as an extension to the file manager to provide people with rich-text overlays to their information (folders, files and also email, web pages, notes). Rich-text, document-like project plans can be created ...
Tagging personal information: a contrast between attitudes and behavior
ASIST '13: Proceedings of the 76th ASIS&T Annual Meeting: Beyond the Cloud: Rethinking Information BoundariesIn a previous work we tested users' preferences with systems that allow to store and retrieve information either using tags or folders. In the current study we asked participants sampled from the same population about their attitudes towards tags by ...
Comments