skip to main content
research-article

Methods for extracting place semantics from Flickr tags

Published:17 January 2009Publication History
Skip Abstract Section

Abstract

We describe an approach for extracting semantics for tags, unstructured text-labels assigned to resources on the Web, based on each tag's usage patterns. In particular, we focus on the problem of extracting place semantics for tags that are assigned to photos on Flickr, a popular-photo sharing Web site that supports location (latitude/longitude) metadata for photos. We propose the adaptation of two baseline methods, inspired by well-known burst-analysis techniques, for the task; we also describe two novel methods, TagMaps and scale-structure identification. We evaluate the methods on a subset of Flickr data. We show that our scale-structure identification method outperforms existing techniques and that a hybrid approach generates further improvements (achieving 85% precision at 81% recall). The approach and methods described in this work can be used in other domains such as geo-annotated Web pages, where text terms can be extracted and associated with usage patterns.

References

  1. Ahern, S., Naaman, M., Nair, R., and Yang, J. H.-I. 2007. World Explorer: Visualizing aggregate data from unstructured text in geo-referenced collections. In Proceedings of the Conference on Digital Libraries (JCDL). ACM, New York, 1--10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Aipperspach, R., Rattenbury, T., Woodruff, A., and Canny, J. 2006. A quantitative method for revealing and comparing places in the home. In Proceedings of the International Conference on Ubiquitous Computing (Ubicomp). Springer. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Ames, M. and Naaman, M. 2007. Why we tag: Motivations for annotation in mobile and online media. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Amitay, E., Har'El, N., Sivan, R., and Soffer, A. 2004. Web-a-Where: Geotagging Web content. In Proceedings of the 27th Annual International Conference on Research and Development in Information Retrieval (SIGIR). ACM Press, 273--280. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Arampatzis, A., van Kreveld, M., Reinbacher, I., Clough, P., Joho, H., Sanderson, M., Jones, C. B., Vaid, S., Benkert, M., and Wolff, A. 2004. Web-Based delineation of imprecise regions. In Proceedings of the Workshop on Geographic Information Retrieval.Google ScholarGoogle Scholar
  6. Box, G. and Jenkins, G. 1976. Time Series Analysis: Forecasting and Control. Cambridge University Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Brunsdon, C. 1995. Estimating probability surfaces for geographical point data: An adaptive kernel algorithm. In Comput. Geosci. 21, 7, 877--894. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Brunsdon, C., Fotheringham, A., and Charlton, M. 2002. Geographically weighted summary statistics: A framework for localized exploratory data analysis. In Comput. Environm. Urban Syst. 26, 501--524.Google ScholarGoogle ScholarCross RefCross Ref
  9. Bulterman, D. C. 2004. Is it time for a moratorium on metadata? IEEE MultiMedia 11, 4 (Oct.), 10--17. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Buyukokkten, O., Cho, J., Garcia-Molina, H., Gravano, L., and Shivakumar, N. 1999. Exploiting geographical location information of Web pages. In Proceedings of the Workshop on Web Databases (WebDB). Held in conjunction with ACM SIGMOD'99. http://dbpubs.stanford.edu/pub/1999-4.Google ScholarGoogle Scholar
  11. Cai, L. and Hofmann, T. 2003. Text categorization by boosting automatically extracted concepts. In Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, 182--189. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Davis, M., King, S., Good, N., and Sarvas, R. 2004. From context to content: Leveraging context to infer media metadata. In Proceedings of the ACM International Conference on Multimedia. ACM, 188--195. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Ding, J., Gravano, L., and Shivakumar, N. 2000. Computing geographical scopes of Web resources. In Proceedings of the 26th International Conference on Very Large Databases. Morgan Kaufmann, 545--556. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Dubinko, M., Kumar, R., Magnani, J., Novak, J., Raghavan, P., and Tomkins, A. 2006. Visualizing tags over time. In Proceedings of the 15th International Conference on World Wide Web (WWW). ACM Press, New York, 193--202. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Epshtein, B., Ofek, E., Wexler, Y., and Zhang, P. 2007. Hierarchical photo organization using geo-relevance. In Proceedings of the ACM International Symposium on Advances in Geographic Information Systems. ACM, 1--7. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Golder, S. A. and Huberman, B. A. 2006. Usage patterns of collaborative tagging systems. J. Inf. Sci. 32, 2, 198--208. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Guralnik, V. and Srivastava, J. 1999. Event detection from time series data. In Proceedings of the 5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD). ACM Press, New York, 33--42. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Jaffe, A., Naaman, M., Tassa, T., and Davis, M. 2006. Generating summaries and visualization for large collections of geo-referenced photographs. In Proceedings of the 8th ACM International Workshop on Multimedia Information Retrieval (MIR). ACM Press, New York, 89--98. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Jones, C., Alani, H., and Tudhope, D. 2001. Geographical information retrieval with ontologies of place. In Proceedings of the Conference on Spatial Information Theory. Vol. 2205. Springer, 322--335. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Kleinberg, J. 2003. Bursty and hierarchical structure in streams. Data Mining Knowl. Discov. 7, 4, 373--397. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Kruskal, J. B. 1956. On the shortest spanning subtree of a graph and the traveling salesman problem. In Proc. Amer. Math. Soc. 7, 1, 48--50.Google ScholarGoogle ScholarCross RefCross Ref
  22. Kulldorff, M. 1999. Spatial scan statistics: Models, calculations, and applications. In Scan Statistics and Applications, Glaz and Balakrishnan, eds., Springer, Boston, Birkhauser, 303--322.Google ScholarGoogle Scholar
  23. Marlow, C., Naaman, M., Boyd, D., and Davis, M. 2006. Ht06, tagging paper, taxonomy, flickr, academic article, to read. In Proceedings of the 7th Conference on Hypertext and Hypermedia. ACM, 31--40. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. McDowall, D., McCleary, R., Meidinger, E. E., and Jr., R. A. H. 1980. Interrupted Time Series Analysis. Sage University PaperSeries on Quantitative Applications in the Social Sciences.Google ScholarGoogle Scholar
  25. Naaman, M., Paepcke, A., and Garcia-Molina, H. 2003. From where to what: Metadata sharing for digital photographs with geographic coordinates. In Proceedings of the 10th International Conference on Cooperative Information Systems (CoopIS). Springer, Berlin, 196--217.Google ScholarGoogle Scholar
  26. Ng, A., Jordan, M., and Weiss, Y. 2001. On spectral clustering: Analysis and an algorithm. In Advances in Neural Information Processing Systems. Vol. 14.Google ScholarGoogle Scholar
  27. Openshaw, S. 1984. The Modifiable Areal Unit Problem: Concepts and Techniques in Modern Geography. Geo Books, Norwich.Google ScholarGoogle Scholar
  28. Openshaw, S., Charlton, M., Wymer, C., and Craft, A. 1987. A mark 1 geographical analysis machine for the automated analysis of point data sets. Int. J. Geograph. Inf. Syst. 1, 4, 335--358.Google ScholarGoogle ScholarCross RefCross Ref
  29. Purves, R., Clough, P., and Joho, H. 2005. Identifying imprecise regions for geographic information retrieval using the web. In Proceedings of the Conference GISRUK.Google ScholarGoogle Scholar
  30. Rattenbury, T., Good, N., and Naaman, M. 2007. Towards automatic extraction of event and place semantics from Flickr tags. In Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York, 103--110. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Sarin, S., Nagahashi, T., Miyosawa, T., and Kameyama, W. 2007. Exploiting users' personal and public information for personal photo annotation. In Proceedings of the IEEE International Conference on Multimedia. IEEE, 564--567.Google ScholarGoogle Scholar
  32. Schmitz, P. 2006. Inducing ontology from Flickr tags. In Proceedings of the Workshop on Collaborative Web Tagging at WWW2006.Google ScholarGoogle Scholar
  33. Vlachos, M., Meek, C., Vagena, Z., and Gunopulos, D. 2004. Identifying similarities, periodicities and bursts for online search queries. In Proceedings of the ACM SIGMOD International Conference on Management of Data. ACM Press, New York, 131--142. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Wang, C., Wang, J., Xie, X., and Ma, W.-Y. 2007. Mining geographic knowledge using location aware topic model. In Proceedings of the ACM Workshop on Geographical Information Retrieval. ACM, 65--70. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Wang, L., Wang, C., Xie, X., Forman, J., Lu, Y., Ma, W.-Y., and Li, Y. 2005. Detecting dominant locations from search queries. In Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 424--431. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Witkin, A. 1983. Scale space filtering. In Proceedings of the International Joint Conference on Artificial Intelligence.Google ScholarGoogle Scholar
  37. Zaragoza, H., Rode, H., Mika, P., Atserias, J., Ciaramita, M., and Attardi, G. 2007. Ranking very many typed entities on Wikipedia. In Proceedings of the ACM Conference on Information and Knowledge Management. ACM, 1015--1018. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Zhou, C., Frankowski, D., Ludford, P., Shekhar, S., and Terveen, L. 2007. Discovering personally meaningful places: An interactive clustering approach. ACM Trans. Inf. Syst. 25, 3, 1--31. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Methods for extracting place semantics from Flickr tags

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on the Web
      ACM Transactions on the Web  Volume 3, Issue 1
      January 2009
      123 pages
      ISSN:1559-1131
      EISSN:1559-114X
      DOI:10.1145/1462148
      Issue’s Table of Contents

      Copyright © 2009 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 17 January 2009
      • Accepted: 1 August 2008
      • Revised: 1 June 2008
      • Received: 1 December 2007
      Published in tweb Volume 3, Issue 1

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader