Abstract
Inferring the localness of people is to classify people who are local residents in a city from people who visit the city by analyzing online check-in points that are contributed by online users. This information is critical for the urban planning, user profiling, and localized recommendation systems. Supervised learning approaches have been developed to infer the location of people in a city by assuming the availability of high-quality training datasets with complete geotemporal information. In this article, we develop an unsupervised model to accurately identify local people in a city by using the incomplete online check-in data that are publicly available. In particular, we develop an incomplete geotemporal expectation maximization (IGT-EM) scheme, which incorporates a set of hidden variables to represent the localness of people and a set of estimation parameters to represent the likelihood of venues to attract local and nonlocal people, respectively. Our solution can accurately classify local people from nonlocal nones without requiring any training data. We also implement a parallel IGT-EM algorithm by leveraging the computing power of a graphic processing unit (GPU) that consists of 2,496 cores. In the evaluation, we compare our new approach with the existing solutions through four real-world case studies using data from the New York City, Chicago, Boston, and Washington, DC. The results show that our approach can identify the local people and significantly outperform the compared baselines in estimation accuracy and execution time.
- Amr Ahmed, Yucheng Low, Mohamed Aly, Vanja Josifovski, and Alexander J. Smola. 2011. Scalable distributed inference of dynamic user interests for behavioral targeting. In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, New York, NY, 114--122. Google ScholarDigital Library
- Lars Backstrom, Eric Sun, and Cameron Marlow. 2010. Find me if you can: Improving geographical prediction with social and spatial proximity. In Proceedings of the 19th International Conference on World Wide Web. ACM, New York, NY, 61--70. Google ScholarDigital Library
- Paul N. Bennett, Ryen W. White, Wei Chu, Susan T. Dumais, Peter Bailey, Fedor Borisyuk, and Xiaoyuan Cui. 2012. Modeling the impact of short-and long-term behavior on search personalization. In Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York, NY, 185--194. Google ScholarDigital Library
- Giuseppe Cardone, Luca Foschini, Paolo Bellavista, Antonio Corradi, Cristian Borcea, Manoop Talasila, and Reza Curtmola. 2013. Fostering participation in smart cities: A geo-social crowdsensing platform. IEEE Communications Magazine 51, 6, 112--119. Google ScholarCross Ref
- Xuefeng Chen, Yifeng Zeng, Gao Cong, Shengchao Qin, Yanping Xiang, and Yuanshun Dai. 2015. On information coverage for location category based point-of-interest recommendation. In Proceedings of the 29th AAAI Conference on Artificial Intelligence.Google ScholarDigital Library
- Zhiyuan Cheng, James Caverlee, and Kyumin Lee. 2010. You are where you tweet: A content-based approach to geo-locating Twitter users. In Proceedings of the 19th ACM International Conference on Information and Knowledge Management. ACM, New York, NY, 759--768. Google ScholarDigital Library
- Zhiyuan Cheng, James Caverlee, and Kyumin Lee. 2013. A content-driven framework for geolocating microblog users. ACM Transactions on Intelligent Systems and Technology 4, 1, 2.Google ScholarDigital Library
- Zhiyuan Cheng, James Caverlee, Kyumin Lee, and Daniel Z. Sui. 2011. Exploring millions of footprints in location sharing services. In Proceedings of the 5th International AAAI Conference on Weblogs and Social Media. 81--88.Google Scholar
- Jennifer Golbeck. 2009. Trust and nuanced profile similarity in online social networks. ACM Transactions on the Web 3, 4, 12.Google ScholarDigital Library
- Marta C. Gonzalez, Cesar A. Hidalgo, and Albert-Laszlo Barabasi. 2008. Understanding individual human mobility patterns. Nature 453, 7196, 779--782. Google ScholarCross Ref
- Tian-Ran Hu, Jie-Bo Luo, Henry Kautz, and Adam Sadilek. 2015. Home location inference from sparse and noisy data: Models and applications. In IProceedings of the 2015 IEEE International Conference on Data Mining Workshop. IEEE, Los Alamitos, CA, 1382--1387.Google ScholarDigital Library
- Chao Huang and Dong Wang. 2016a. Exploiting spatial-temporal-social constraints for localness inference using online social media. In Proceedings of the 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining. IEEE, Los Alamitos, CA, 287--294. Google ScholarCross Ref
- Chao Huang and Dong Wang. 2016b. Topic-aware social sensing with arbitrary source dependency graphs. In Proceedings of the 2016 15th ACM/IEEE International Conference on Information Processing in Sensor Networks. IEEE, Los Alamitos, CA, 1--12. Google ScholarCross Ref
- Chao Huang, Dong Wang, and Nitesh Chawla. 2015. Towards time-sensitive truth discovery in social sensing applications. In Proceedings of the 2015 IEEE 12th International Conference on Mobile Ad Hoc and Sensor Systems. IEEE, Los Alamitos, CA, 154--162. Google ScholarDigital Library
- David Jurgens. 2013. That’s what friends are for: Inferring location in online social media platforms based on social relationships. In Proceedings of the 7th International AAAI Conference on Weblogs and Social Media. 273--282.Google Scholar
- Kazuki Kodama, Yuichi Iijima, Xi Guo, and Yoshiharu Ishikawa. 2009. Skyline queries based on user locations and preferences for making location-based recommendations. In Proceedings of the 2009 International Workshop on Location Based Social Networks. ACM, New York, NY, 9--16. Google ScholarDigital Library
- Cliff A. C. Lampe, Nicole Ellison, and Charles Steinfield. 2007. A familiar face (book): Profile elements as signals in an online social network. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. ACM, New York, NY, 435--444. Google ScholarDigital Library
- Rui Li, Shengjie Wang, and Kevin Chen-Chuan Chang. 2012a. Multiple location profiling for users and relationships from social network and content. Proceedings of the VLDB Endowment 5, 11, 1603--1614. Google ScholarDigital Library
- Rui Li, Shengjie Wang, Hongbo Deng, Rui Wang, and Kevin Chen-Chuan Chang. 2012b. Towards social user profiling: Unified and discriminative influence model for inferring home locations. In Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, New York, NY, 1023--1031. Google ScholarDigital Library
- Chris Y. T. Ma, David K. Y. Yau, Nung Kwan Yip, and Nageswara S. V. Rao. 2013. Privacy vulnerability of published anonymous mobility traces. IEEE/ACM Transactions on Networking 21, 3, 720--733. Google ScholarDigital Library
- Jermaine Marshall, Munira Syed, and Dong Wang. 2016. Hardness-aware truth discovery in social sensing applications. In Proceedings of the 2016 International Conference on Distributed Computing in Sensor Systems. IEEE, Los Alamitos, CA, 143--152. Google ScholarCross Ref
- Jermaine Marshall and Dong Wang. 2016. Mood-sensitive truth discovery for reliable recommendation systems in social sensing. In Proceedings of the 10th ACM Conference on Recommender Systems. ACM, New York, NY, 167--174. Google ScholarDigital Library
- Jeffrey McGee, James Caverlee, and Zhiyuan Cheng. 2013. Location prediction in social media based on tie strength. In Proceedings of the 22nd ACM International Conference on Conference on Information and Knowledge Management. ACM, New York, NY, 459--468. Google ScholarDigital Library
- CUDA NVIDIA. 2008. Programming Guide.Google Scholar
- Moon-Hee Park, Jin-Hyuk Hong, and Sung-Bae Cho. 2007. Location-based recommendation system using Bayesian users preference model in mobile devices. In Ubiquitous Intelligence and Computing. Springer, 1130--1139. Google ScholarCross Ref
- Ulrike Pfeil, Raj Arjan, and Panayiotis Zaphiris. 2009. Age differences in online social networking—a study of user profiles and the social capital divide among teenagers and older users in MySpace. Computers in Human Behavior 25, 3, 643--654. Google ScholarDigital Library
- Foster Provost, Brian Dalessandro, Rod Hook, Xiaohan Zhang, and Alan Murray. 2009. Audience selection for on-line brand advertising: Privacy-friendly social network targeting. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, New York, NY, 707--716. Google ScholarDigital Library
- L. Ramaswamy, P. Deepak, R. Polavarapu, K. Gunasekera, D. Garg, K. Visweswariah, and S. Kalyanaraman. 2009. Caesar: A context-aware, social recommender system for low-end mobile devices. In Proceedings of the 10th International Conference on Mobile Data Management: Systems, Services, and Middleware. IEEE, Los Alamitos, CA, 338--347. Google ScholarDigital Library
- C. Ratti, S. Williams, D. Frenchman, and R. M. Pulselli. 2006. Mobile landscapes: Using location data from cell phones for urban analysis. Environment and Planning B: Urban Analytics and City Science 33, 5, 727.Google Scholar
- Dong Wang, Tarek Abdelzaher, and Lance Kaplan. 2014a. Surrogate mobile sensing. IEEE Communications Magazine 52, 8, 36--41. Google ScholarCross Ref
- Dong Wang, Md Tanvir Al Amin, Tarek Abdelzaher, Dan Roth, Clare R. Voss, Lance M. Kaplan, Stephen Tratz, Jamal Laoudi, and Douglas Briesch. 2014b. Provenance-assisted classification in social networks. IEEE Journal of Selected Topics in Signal Processing 8, 4, 624--637. Google ScholarCross Ref
- Dong Wang and Chao Huang. 2015. Confidence-aware truth estimation in social sensing applications. In Proceedings of the 12th Annual IEEE International Conference on Sensing, Communication, and Networking. IEEE, Los Alamitos, CA, 336--344. Google ScholarDigital Library
- Dong Wang, Lance Kaplan, Hieu Le, and Tarek Abdelzaher. 2012. On truth discovery in social sensing: A maximum likelihood estimation approach. In Proceedings of the 11th International Conference on Information Processing in Sensor Networks. ACM, New York, NY, 233--244. Google ScholarDigital Library
- Ryen W. White, Peter Bailey, and Liwei Chen. 2009. Predicting user interests from contextual information. In Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York, NY, 363--370. Google ScholarDigital Library
- Hongzhi Yin, Yizhou Sun, Bin Cui, Zhiting Hu, and Ling Chen. 2013. LCARS: A location-content-aware recommender system. In Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, New York, NY, 221--229. Google ScholarDigital Library
- Jia-Dong Zhang and Chi-Yin Chow. 2015. GeoSoCa: Exploiting geographical, social and categorical correlations for point-of-interest recommendations. In Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York, NY, 443--452. Google ScholarDigital Library
Index Terms
- An Unsupervised Approach to Inferring the Localness of People Using Incomplete Geotemporal Online Check-In Data
Recommendations
Exploiting spatial-temporal-social constraints for localness inference using online social media
ASONAM '16: Proceedings of the 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and MiningThe localness inference problem is to identify whether a person is a local resident in a city or not and the likelihood of a venue to attract local people. This information is critical for many applications such as targeted ads of local business, urban ...
Inferring visiting time distributions of locations from incomplete check-in data
WWW '14 Companion: Proceedings of the 23rd International Conference on World Wide WebOnline location-based services, such as Foursquare and Facebook, provide a great resource for location recommendation. As we know the time is one of the important factors on recommending places with proper time for users, since the pleasure of visiting ...
VGI as a dynamically updating data source in location-based services in urban environments
UbiCrowd '11: Proceedings of the 2nd international workshop on Ubiquitous crowdsouringUrban environments, or living cities, share the characteristic of a high degree of organized complexity. This complexity arises from the components of the urban fabric: streets, shops, offices, houses, pedestrian zones, green spaces, plazas, parking ...
Comments