ABSTRACT
The purpose of this study is to leverage modern technology (mobile or web apps) to enrich epidemiology data and infer the transmission of disease. We develop hierarchical Graph-Coupled Hidden Markov Models (hGCHMMs) to simultaneously track the spread of infection in a small cell phone community and capture person-specific infection parameters by leveraging a link prior that incorporates additional covariates. In this paper we investigate two link functions, the beta-exponential link and sigmoid link, both of which allow the development of a principled Bayesian hierarchical framework for disease transmission. The results of our model allow us to predict the probability of infection for each persons on each day, and also to infer personal physical vulnerability and the relevant association with covariates. We demonstrate our approach theoretically and experimentally on both simulation data and real epidemiological records.
- R. Beckman, K. R. Bisset, J. Chen, B. Lewis, M. Marathe, and P. Stretz. Isis: A networked-epidemiology based pervasive web app for infectious disease pandemic planning and response. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining , pages 1847--1856. ACM, 2014. Google ScholarDigital Library
- D. M. Blei, T. L. Griffiths, and M. I. Jordan. The nested chinese restaurant process and bayesian nonparametric inference of topic hierarchies. Journal of the ACM (JACM), 57(2):7, 2010. Google ScholarDigital Library
- M. Brand, N. Oliver, and A. Pentland. Coupled hidden markov models for complex action recognition. In Computer Vision and Pattern Recognition, 1997. Proceedings., 1997 IEEE Computer Society Conference on, pages 994--999. IEEE, 1997. Google ScholarDigital Library
- G. Celeux, D. Chauveau, J. Diebolt, et al. On stochastic versions of the em algorithm. 1995.Google Scholar
- J. Chang, D. M. Blei, et al. Hierarchical relational models for document networks. The Annals of Applied Statistics , 4(1):124--150, 2010.Google ScholarCross Ref
- R. M. Christley, G. Pinchbeck, R. Bowers, D. Clancy, N. French, R. Bennett, and J. Turner. Infection in social networks: using network analysis to identify high-risk individuals. American journal of epidemiology , 162(10):1024--1031, 2005.Google Scholar
- B. Delyon, M. Lavielle, and E. Moulines. Convergence of a stochastic approximation version of the em algorithm. Annals of Statistics, pages 94--128, 1999.Google Scholar
- W. Dong, B. Lepri, and A. S. Pentland. Modeling the co-evolution of behaviors and social relationships using mobile phone data. In Proceedings of the 10th International Conference on Mobile and Ubiquitous Multimedia, pages 134--143. ACM, 2011. Google ScholarDigital Library
- W. Dong, A. Pentland, and K. A. Heller. Graph-coupled hmms for modeling the spread of infection. Association for Uncertainty in Artificial Intelligence, 2012.Google Scholar
- R. Durrett. Probability: theory and examples. Cambridge university press, 2010. Google ScholarDigital Library
- A. Gruber, Y. Weiss, and M. Rosen-Zvi. Hidden topic markov models. In International Conference on Artificial Intelligence and Statistics, pages 163--170, 2007.Google Scholar
- D. L. Knowles, K. G. Stanley, and N. D. Osgood. A field-validated architecture for the collection of health-relevant behavioural data. In Healthcare Informatics (ICHI), 2014 IEEE International Conference on, pages 79--88. IEEE, 2014. Google ScholarDigital Library
- E. Kuhn and M. Lavielle. Coupling a stochastic approximation version of em with an mcmc procedure. ESAIM: Probability and Statistics, 8:115--131, 2004.Google ScholarCross Ref
- K. Lange. A gradient algorithm locally equivalent to the em algorithm. Journal of the Royal Statistical Society. Series B (Methodological), pages 425--437, 1995.Google Scholar
- A. Madan, M. Cebrian, S. Moturu, K. Farrahi, and A. Pentland. Sensing the" health state" of a community. IEEE Pervasive Computing, 11(4):36--45, 2012. Google ScholarDigital Library
- J. Nocedal and S. Wright. Numerical optimization, series in operations research and financial engineering. Springer, New York, USA, 2006.Google Scholar
- J. Paisley, C. Wang, D. M. Blei, and M. I. Jordan. Nested hierarchical dirichlet processes. 2012.Google Scholar
- R. Price. A useful theorem for nonlinear devices having gaussian inputs. Information Theory, IRE Transactions on, 4(2):69--72, 1958.Google Scholar
- D. J. Rezende, S. Mohamed, and D. Wierstra. Stochastic backpropagation and approximate inference in deep generative models. In Proceedings of The 31st International Conference on Machine Learning, pages 1278--1286, 2014.Google ScholarDigital Library
- M. Salathé, M. Kazandjieva, J. W. Lee, P. Levis, M. W. Feldman, and J. H. Jones. A high-resolution human contact network for infectious disease transmission. Proceedings of the National Academy of Sciences, 107(51):22020--22025, 2010.Google ScholarCross Ref
- Y. W. Teh, M. I. Jordan, M. J. Beal, and D. M. Blei. Hierarchical dirichlet processes. Journal of the american statistical association, 101(476), 2006.Google Scholar
- G. C. Wei and M. A. Tanner. A monte carlo implementation of the em algorithm and the poor man's data augmentation algorithms. Journal of the American Statistical Association, 85(411):699--704, 1990Google ScholarCross Ref
Index Terms
- Hierarchical Graph-Coupled HMMs for Heterogeneous Personalized Health Data
Recommendations
Dynamically Modeling Patient's Health State from Electronic Medical Records: A Time Series Approach
KDD '15: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data MiningIn this paper, we present a method to dynamically estimate the probability of mortality inside the Intensive Care Unit (ICU) by combining heterogeneous data. We propose a method based on Generalized Linear Dynamic Models that models the probability of ...
Patient Subtyping via Time-Aware LSTM Networks
KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data MiningIn the study of various diseases, heterogeneity among patients usually leads to different progression patterns and may require different types of therapeutic intervention. Therefore, it is important to study patient subtyping, which is grouping of ...
Predicting Clinical Outcomes Across Changing Electronic Health Record Systems
KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data MiningExisting machine learning methods typically assume consistency in how semantically equivalent information is encoded. However, the way information is recorded in databases differs across institutions and over time, often rendering potentially useful ...
Comments