research-article

Bayesian Poisson Tensor Factorization for Inferring Multilateral Relations from Sparse Dyadic Event Counts

Authors:
Aaron Schein

University of Massachusetts Amherst, Amherst, MA, USA

University of Massachusetts Amherst, Amherst, MA, USA
View Profile

,
John Paisley

Columbia University, New York, NY, USA

Columbia University, New York, NY, USA
View Profile

,
David M. Blei

Columbia University, New York, NY, USA

Columbia University, New York, NY, USA
View Profile

,
Hanna Wallach

Microsoft, New York, NY, USA

Microsoft, New York, NY, USA
View Profile

KDD '15: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data MiningAugust 2015Pages 1045–1054https://doi.org/10.1145/2783258.2783414

Published:10 August 2015Publication History

KDD '15: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

Pages 1045–1054

ABSTRACT

We present a Bayesian tensor factorization model for inferring latent group structures from dynamic pairwise interaction patterns. For decades, political scientists have collected and analyzed records of the form "country i took action a toward country j at time t" - known as dyadic events - in order to form and test theories of international relations. We represent these event data as a tensor of counts and develop Bayesian Poisson tensor factorization to infer a low-dimensional, interpretable representation of their salient patterns. We demonstrate that our model's predictive performance is better than that of standard non-negative tensor factorization methods. We also provide a comparison of our variational updates to their maximum likelihood counterparts. In doing so, we identify a better way to form point estimates of the latent factors than that typically used in Bayesian Poisson matrix factorization. Finally, we showcase our model as an exploratory analysis tool for political scientists. We show that the inferred latent factor matrices capture interpretable multilateral relations that both conform to and inform our knowledge of international a airs.

Supplemental Material

p1045.mp4

mp4

341.1 MB

Download

References

A. Cemgil. Bayesian inference for nonnegative matrix factorisation models. Computational Intelligence and Neuroscience, 2009. Google ScholarDigital Library
E. Chi and T. Kolda. On tensors, sparsity, and nonnegative factorizations. SIAM Journal on Matrix Analysis and Applications, 33(4):1272--1299, 2012.Google ScholarDigital Library
R. Dorfman. A formula for the Gini coefficient. The Review of Economics and Statistics, pages 146--149, 1979.Google ScholarCross Ref
R. Erikson, P. Pinto, and K. Rader. Dyadic analysis in international relations: A cautionary tale. Political Analysis, 22(4):457--463.Google ScholarCross Ref
B. Ermis and A. Cemgil. A Bayesian tensor factorization model via variational inference for link prediction. arXiv:1409.8276, 2014.Google Scholar
D. Gerner, P. Schrodt, R. Abu-Jabr, and Ö. Yilmaz. Conflict and mediation event observations (CAMEO): A new event data framework for the analysis of foreign policy interactions. Working paper.Google Scholar
N. Gillis and F. Glineur. Nonnegative factorization and the maximum edge biclique problem. arXiv:0810.4225, 2008.Google Scholar
E. Gonzalez and Y. Zhang. Accelerating the Lee-Seung algorithm for non-negative matrix factorization. Technical Report TR-05-02, Department of Computational and Applied Mathematics, Rice University, 2005.Google Scholar
P. Gopalan and D. Blei. Efficient discovery of overlapping communities in massive networks. Proceedings of the National Academy of Sciences, 2013.Google ScholarCross Ref
P. Gopalan, S. Gerrish, M. Freedman, D. Blei, and D. Mimno. Scalable inference of overlapping communities. In Advances in Neural Information Processing Systems Twenty-Five, 2012.Google Scholar
P. Gopalan, J. Hofman, and D. Blei. Scalable recommendation with Poisson factorization. In Proceedings of the Thirty-First Conference on Uncertainty in Artificial Intelligence, 2015.Google ScholarDigital Library
D. Green, S. Kim, and D. Yoon. Dirty pool. International Organization, 55(2):441--468, 2001.Google ScholarCross Ref
R. Harshman. Foundations of the PARAFAC procedure: Models and conditions for an "explanatory" multimodal factor analysis. UCLA Working Papers in Phonetics, 16:1--84, 1970.Google Scholar
P. Hoff. Equivariant and scale-free Tucker decomposition models. arXiv:1312.6397, 2013.Google Scholar
P. Hoff. Multilinear tensor regression for longitudinal relational data. arXiv:1412.0048, 2014.Google Scholar
P. Hoff and M. Ward. Modeling dependencies in international relations networks. Political Analysis, 12(2):160--175, 2004.Google ScholarCross Ref
G. King. Proper nouns and methodological propriety: Pooling dyads in international relations data. International Organization, 55(2):497--507, 2001.Google ScholarCross Ref
T. Kolda and J. Sun. Scalable tensor decompositions for multi-aspect data mining. In Proceedings of the Eighth IEEE International Conference on Data Mining, pages 363--372, 2008. Google ScholarDigital Library
D. Lee and S. Seung. Learning the parts of objects by non-negative matrix factorization. Nature, 401:788--791, 1999.Google ScholarCross Ref
K. Leetaru and P. Schrodt. GDELT: Global data on events, location, and tone, 1979--2012. Working paper, 2013.Google Scholar
D. Liang, J. Paisley, and D. Ellis. Codebook-based scalable music tagging with Poisson matrix factorization. In Proceedings of the Fifteenth International Society for Music Information Retrieval Conference, 2015.Google Scholar
C. Lin. On the convergence of multiplicative update algorithms for nonnegative matrix factorization. IEEE Transactions on Neural Networks, 18(6):1589--1596, 2007. Google ScholarDigital Library
B. Marlin. Collaborative filtering: A machine learning perspective. Master's thesis, University of Toronto, 2004.Google Scholar
S. O'Brien. Crisis early warning and decision support: Contemporary approaches and thoughts on future research. International Studies Review, 12(1):87--104, 2010.Google ScholarCross Ref
J. Paisley, D. Blei, and M. Jordan. Bayesian nonnegative matrix factorization with stochastic variational inference. In Handbook of Mixed Membership Models and Their Applications. Chapman and Hall/CRC, 2014.Google Scholar
P. Poast. (Mis)using dyadic data to analyze multilateral events. Political Analysis, 2010.Google Scholar
A. Schein, J. Paisley, D. Blei, and H. Wallach. Inferring polyadic events with Poisson tensor factorization. In Proceedings of the NIPS 2014 Workshop on "Networks: From Graphs to Rich Data", 2014.Google Scholar
D. Singer and M. S. (producers). Correlates of war project: International and civil war data, 1816-1992 (computer file). Inter-University Consortium for Political and Social Research (distributor), 1994.Google Scholar
B. Stewart. Latent factor regressions for the social sciences. Working paper, 2014.Google Scholar
L. Tucker. Some mathematical notes on three-mode factor analysis. Psychometrika, 31(3):279--311, 1966.Google ScholarCross Ref
M. Ward, A. Beger, J. Cutler, M. Dickenson, C. Doorff, and B. Radford. Comparing GDELT and ICEWS event data. Working paper, 2013.Google Scholar
M. Welling and M. Weber. Positive tensor factorization. Pattern Recognition Letters, 22(12):1255--1261, 2001. Google ScholarDigital Library
Wikipedia. Cruise missile strikes on Afghanistan and Sudan (August 1998). Accessed June 8, 2015.Google Scholar
Wikipedia. Embassy of Ecuador, London. Accessed October 30, 2014.Google Scholar
Wikipedia. Japanese Iraq reconstruction and support group. Accessed June 8, 2015.Google Scholar
Wikipedia. Jyllands-Posten Muhammad cartoons controversy. Accessed June 8, 2015.Google Scholar
Wikipedia. Six-party talks. Accessed June 8, 2015.Google Scholar
M. Zhou and L. Carin. Negative binomial process count and mixture modeling. IEEE Transactions on Pattern Analysis and Machine Intelligence, 37(2):307--320, 2015.Google ScholarDigital Library
M. Zhou, L. Hannah, D. Dunson, and L. Carin. Beta-negative binomial process and Poisson factor analysis. arXiv:1112.3605, 2011.Google Scholar

Index Terms

Bayesian Poisson Tensor Factorization for Inferring Multilateral Relations from Sparse Dyadic Event Counts
1. Applied computing
  1. Law, social and behavioral sciences

Recommendations

Tensor Completion Algorithms in Big Data Analytics

Tensor completion is a problem of filling the missing or unobserved entries of partially observed tensors. Due to the multidimensional character of tensors in describing complex datasets, tensor completion algorithms and their applications have received ...
Read More
Bayesian probabilistic matrix factorization using Markov chain Monte Carlo
ICML '08: Proceedings of the 25th international conference on Machine learning

Low-rank matrix approximation methods provide one of the simplest and most effective approaches to collaborative filtering. Such models are usually fitted to data by finding a MAP estimate of the model parameters, a procedure that can be performed ...
Read More
Dynamic Poisson Factorization
RecSys '15: Proceedings of the 9th ACM Conference on Recommender Systems

Models for recommender systems use latent factors to explain the preferences and behaviors of users with respect to a set of items (e.g., movies, books, academic papers). Typically, the latent factors are assumed to be static and, given these factors, ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
KDD '15: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
August 2015
2378 pages
ISBN:9781450336642
DOI:10.1145/2783258
General Chairs:
Longbing Cao
University of Technology, Sydney
,
Chengqi Zhang
University of Technology, Sydney
,
Program Chairs:
Thorsten Joachims
Cornell University
,
Geoff Webb
Monash University
,
Dragos D. Margineantu
Boeing Research
,
Graham Williams
Australian Taxation Office
Copyright © 2015 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 10 August 2015
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
bayesian inference
dyadic data
international relations
poisson tensor factorization
Qualifiers
- research-article
Conference

Acceptance Rates
KDD '15 Paper Acceptance Rate160of819submissions,20%Overall Acceptance Rate1,133of8,635submissions,13%
More
Upcoming Conference
KDD '24

Sponsor:

sigkdd

sigkdd

The 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

August 25 - 29, 2024

Barcelona , Spain
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 43
  Total Citations
  View Citations
- 1,057
  Total Downloads
- Downloads (Last 12 months)48
- Downloads (Last 6 weeks)11
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Bayesian Poisson Tensor Factorization for Inferring Multilateral Relations from Sparse Dyadic Event Counts

KDD '15: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

ABSTRACT

Supplemental Material

References

Cited By

Index Terms

Recommendations

Tensor Completion Algorithms in Big Data Analytics

Bayesian probabilistic matrix factorization using Markov chain Monte Carlo

Dynamic Poisson Factorization