skip to main content
research-article
Public Access

An Unsupervised Approach to Inferring the Localness of People Using Incomplete Geotemporal Online Check-In Data

Published:25 August 2017Publication History
Skip Abstract Section

Abstract

Inferring the localness of people is to classify people who are local residents in a city from people who visit the city by analyzing online check-in points that are contributed by online users. This information is critical for the urban planning, user profiling, and localized recommendation systems. Supervised learning approaches have been developed to infer the location of people in a city by assuming the availability of high-quality training datasets with complete geotemporal information. In this article, we develop an unsupervised model to accurately identify local people in a city by using the incomplete online check-in data that are publicly available. In particular, we develop an incomplete geotemporal expectation maximization (IGT-EM) scheme, which incorporates a set of hidden variables to represent the localness of people and a set of estimation parameters to represent the likelihood of venues to attract local and nonlocal people, respectively. Our solution can accurately classify local people from nonlocal nones without requiring any training data. We also implement a parallel IGT-EM algorithm by leveraging the computing power of a graphic processing unit (GPU) that consists of 2,496 cores. In the evaluation, we compare our new approach with the existing solutions through four real-world case studies using data from the New York City, Chicago, Boston, and Washington, DC. The results show that our approach can identify the local people and significantly outperform the compared baselines in estimation accuracy and execution time.

References

  1. Amr Ahmed, Yucheng Low, Mohamed Aly, Vanja Josifovski, and Alexander J. Smola. 2011. Scalable distributed inference of dynamic user interests for behavioral targeting. In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, New York, NY, 114--122. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Lars Backstrom, Eric Sun, and Cameron Marlow. 2010. Find me if you can: Improving geographical prediction with social and spatial proximity. In Proceedings of the 19th International Conference on World Wide Web. ACM, New York, NY, 61--70. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Paul N. Bennett, Ryen W. White, Wei Chu, Susan T. Dumais, Peter Bailey, Fedor Borisyuk, and Xiaoyuan Cui. 2012. Modeling the impact of short-and long-term behavior on search personalization. In Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York, NY, 185--194. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Giuseppe Cardone, Luca Foschini, Paolo Bellavista, Antonio Corradi, Cristian Borcea, Manoop Talasila, and Reza Curtmola. 2013. Fostering participation in smart cities: A geo-social crowdsensing platform. IEEE Communications Magazine 51, 6, 112--119. Google ScholarGoogle ScholarCross RefCross Ref
  5. Xuefeng Chen, Yifeng Zeng, Gao Cong, Shengchao Qin, Yanping Xiang, and Yuanshun Dai. 2015. On information coverage for location category based point-of-interest recommendation. In Proceedings of the 29th AAAI Conference on Artificial Intelligence.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Zhiyuan Cheng, James Caverlee, and Kyumin Lee. 2010. You are where you tweet: A content-based approach to geo-locating Twitter users. In Proceedings of the 19th ACM International Conference on Information and Knowledge Management. ACM, New York, NY, 759--768. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Zhiyuan Cheng, James Caverlee, and Kyumin Lee. 2013. A content-driven framework for geolocating microblog users. ACM Transactions on Intelligent Systems and Technology 4, 1, 2.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Zhiyuan Cheng, James Caverlee, Kyumin Lee, and Daniel Z. Sui. 2011. Exploring millions of footprints in location sharing services. In Proceedings of the 5th International AAAI Conference on Weblogs and Social Media. 81--88.Google ScholarGoogle Scholar
  9. Jennifer Golbeck. 2009. Trust and nuanced profile similarity in online social networks. ACM Transactions on the Web 3, 4, 12.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Marta C. Gonzalez, Cesar A. Hidalgo, and Albert-Laszlo Barabasi. 2008. Understanding individual human mobility patterns. Nature 453, 7196, 779--782. Google ScholarGoogle ScholarCross RefCross Ref
  11. Tian-Ran Hu, Jie-Bo Luo, Henry Kautz, and Adam Sadilek. 2015. Home location inference from sparse and noisy data: Models and applications. In IProceedings of the 2015 IEEE International Conference on Data Mining Workshop. IEEE, Los Alamitos, CA, 1382--1387.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Chao Huang and Dong Wang. 2016a. Exploiting spatial-temporal-social constraints for localness inference using online social media. In Proceedings of the 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining. IEEE, Los Alamitos, CA, 287--294. Google ScholarGoogle ScholarCross RefCross Ref
  13. Chao Huang and Dong Wang. 2016b. Topic-aware social sensing with arbitrary source dependency graphs. In Proceedings of the 2016 15th ACM/IEEE International Conference on Information Processing in Sensor Networks. IEEE, Los Alamitos, CA, 1--12. Google ScholarGoogle ScholarCross RefCross Ref
  14. Chao Huang, Dong Wang, and Nitesh Chawla. 2015. Towards time-sensitive truth discovery in social sensing applications. In Proceedings of the 2015 IEEE 12th International Conference on Mobile Ad Hoc and Sensor Systems. IEEE, Los Alamitos, CA, 154--162. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. David Jurgens. 2013. That’s what friends are for: Inferring location in online social media platforms based on social relationships. In Proceedings of the 7th International AAAI Conference on Weblogs and Social Media. 273--282.Google ScholarGoogle Scholar
  16. Kazuki Kodama, Yuichi Iijima, Xi Guo, and Yoshiharu Ishikawa. 2009. Skyline queries based on user locations and preferences for making location-based recommendations. In Proceedings of the 2009 International Workshop on Location Based Social Networks. ACM, New York, NY, 9--16. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Cliff A. C. Lampe, Nicole Ellison, and Charles Steinfield. 2007. A familiar face (book): Profile elements as signals in an online social network. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. ACM, New York, NY, 435--444. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Rui Li, Shengjie Wang, and Kevin Chen-Chuan Chang. 2012a. Multiple location profiling for users and relationships from social network and content. Proceedings of the VLDB Endowment 5, 11, 1603--1614. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Rui Li, Shengjie Wang, Hongbo Deng, Rui Wang, and Kevin Chen-Chuan Chang. 2012b. Towards social user profiling: Unified and discriminative influence model for inferring home locations. In Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, New York, NY, 1023--1031. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Chris Y. T. Ma, David K. Y. Yau, Nung Kwan Yip, and Nageswara S. V. Rao. 2013. Privacy vulnerability of published anonymous mobility traces. IEEE/ACM Transactions on Networking 21, 3, 720--733. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Jermaine Marshall, Munira Syed, and Dong Wang. 2016. Hardness-aware truth discovery in social sensing applications. In Proceedings of the 2016 International Conference on Distributed Computing in Sensor Systems. IEEE, Los Alamitos, CA, 143--152. Google ScholarGoogle ScholarCross RefCross Ref
  22. Jermaine Marshall and Dong Wang. 2016. Mood-sensitive truth discovery for reliable recommendation systems in social sensing. In Proceedings of the 10th ACM Conference on Recommender Systems. ACM, New York, NY, 167--174. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Jeffrey McGee, James Caverlee, and Zhiyuan Cheng. 2013. Location prediction in social media based on tie strength. In Proceedings of the 22nd ACM International Conference on Conference on Information and Knowledge Management. ACM, New York, NY, 459--468. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. CUDA NVIDIA. 2008. Programming Guide.Google ScholarGoogle Scholar
  25. Moon-Hee Park, Jin-Hyuk Hong, and Sung-Bae Cho. 2007. Location-based recommendation system using Bayesian users preference model in mobile devices. In Ubiquitous Intelligence and Computing. Springer, 1130--1139. Google ScholarGoogle ScholarCross RefCross Ref
  26. Ulrike Pfeil, Raj Arjan, and Panayiotis Zaphiris. 2009. Age differences in online social networking—a study of user profiles and the social capital divide among teenagers and older users in MySpace. Computers in Human Behavior 25, 3, 643--654. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Foster Provost, Brian Dalessandro, Rod Hook, Xiaohan Zhang, and Alan Murray. 2009. Audience selection for on-line brand advertising: Privacy-friendly social network targeting. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, New York, NY, 707--716. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. L. Ramaswamy, P. Deepak, R. Polavarapu, K. Gunasekera, D. Garg, K. Visweswariah, and S. Kalyanaraman. 2009. Caesar: A context-aware, social recommender system for low-end mobile devices. In Proceedings of the 10th International Conference on Mobile Data Management: Systems, Services, and Middleware. IEEE, Los Alamitos, CA, 338--347. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. C. Ratti, S. Williams, D. Frenchman, and R. M. Pulselli. 2006. Mobile landscapes: Using location data from cell phones for urban analysis. Environment and Planning B: Urban Analytics and City Science 33, 5, 727.Google ScholarGoogle Scholar
  30. Dong Wang, Tarek Abdelzaher, and Lance Kaplan. 2014a. Surrogate mobile sensing. IEEE Communications Magazine 52, 8, 36--41. Google ScholarGoogle ScholarCross RefCross Ref
  31. Dong Wang, Md Tanvir Al Amin, Tarek Abdelzaher, Dan Roth, Clare R. Voss, Lance M. Kaplan, Stephen Tratz, Jamal Laoudi, and Douglas Briesch. 2014b. Provenance-assisted classification in social networks. IEEE Journal of Selected Topics in Signal Processing 8, 4, 624--637. Google ScholarGoogle ScholarCross RefCross Ref
  32. Dong Wang and Chao Huang. 2015. Confidence-aware truth estimation in social sensing applications. In Proceedings of the 12th Annual IEEE International Conference on Sensing, Communication, and Networking. IEEE, Los Alamitos, CA, 336--344. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Dong Wang, Lance Kaplan, Hieu Le, and Tarek Abdelzaher. 2012. On truth discovery in social sensing: A maximum likelihood estimation approach. In Proceedings of the 11th International Conference on Information Processing in Sensor Networks. ACM, New York, NY, 233--244. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Ryen W. White, Peter Bailey, and Liwei Chen. 2009. Predicting user interests from contextual information. In Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York, NY, 363--370. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Hongzhi Yin, Yizhou Sun, Bin Cui, Zhiting Hu, and Ling Chen. 2013. LCARS: A location-content-aware recommender system. In Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, New York, NY, 221--229. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Jia-Dong Zhang and Chi-Yin Chow. 2015. GeoSoCa: Exploiting geographical, social and categorical correlations for point-of-interest recommendations. In Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York, NY, 443--452. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. An Unsupervised Approach to Inferring the Localness of People Using Incomplete Geotemporal Online Check-In Data

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in

          Full Access

          • Published in

            cover image ACM Transactions on Intelligent Systems and Technology
            ACM Transactions on Intelligent Systems and Technology  Volume 8, Issue 6
            Survey Paper, Regular Papers and Special Issue: Social Media Processing
            November 2017
            265 pages
            ISSN:2157-6904
            EISSN:2157-6912
            DOI:10.1145/3127339
            • Editor:
            • Yu Zheng
            Issue’s Table of Contents

            Copyright © 2017 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 25 August 2017
            • Accepted: 1 December 2016
            • Revised: 1 October 2016
            • Received: 1 January 2016
            Published in tist Volume 8, Issue 6

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article
            • Research
            • Refereed

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader