skip to main content
10.1145/2461466.2461481acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
research-article

Geo-visual ranking for location prediction of social images

Authors Info & Claims
Published:16 April 2013Publication History

ABSTRACT

Predicting geographic location using exclusively the visual content of images holds the promise of greatly benefiting users' access to media collections. In this paper, we present a visual-content-based approach that predicts where in the world a social image was taken. We employ a ranking method that assigns a query photo the geo-location of its most likely geo-visual neighbor in the social image collection. The novelty of the approach is that ranking makes use not only of the photos themselves, but also their geo-visual neighbors. In contrast to other approaches, we do not restrict the locations we predict to landmarks or specific cities. The approach is evaluated on a set of 3 million geo-tagged photos from Flickr, released by MediaEval 2012. Experiments show that the proposed system delivers a substantive performance improvement compared with previously proposed, related visual content-based approaches. The discussion illustrates how photo densities, geo-visual redundancy and uploader patterns characteristic of social image collections impacts the performance.

References

  1. H. Bay, A. Ess, T. Tuytelaars, and L. Van Gool. Speeded-up robust features (SURF). Computer vision and image understanding, 110(3):346--359, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. D. Chen et al. City-scale landmark identification on mobile devices. In Proc. CVPR '11, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. D. J. Crandall, L. Backstrom, D. Huttenlocher, and J. Kleinberg. Mapping the world's photos. In Proc. WWW '09, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. G. Friedland, J. Choi, H. Lei, and A. Janin. Multimodal location estimation on Flickr videos. In Proc. WSM '11, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Q. Hao et al. Travelscope: standing on the shoulders of dedicated travelers. In Proc. MM '09, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. C. Hauff and G. Houben. Placing images on the world map: a microblog-based enrichment approach. In Proc. SIGIR '12, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. J. Hays and A. Efros. IM2GPS: estimating geographic information from a single image. In Proc. CVPR '08, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  8. H. Jégou, M. Douze, and C. Schmid. Improving bag-of-features for large scale image search. International Journal of Computer Vision, 87(3):316--336, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. L. Juan and O. Gwun. A comparison of SIFT, PCA-SIFT and SURF. International Journal of Image Processing (IJIP), 3(4):143--152, 2009.Google ScholarGoogle Scholar
  10. L. S. Kennedy and M. Naaman. Generating diverse and representative image search results for landmarks. In Proc. WWW '08, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. H. Kretzschmar, C. Stachniss, C. Plagemann, and W. Burgard. Estimating landmark locations from geo-referenced photographs. In Proc. IROS '08, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  12. M. Larson et al. Automatic tagging and geotagging in video collections and communities. In Proc. ICMR '11, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Y. Li, D. Crandall, and D. Huttenlocher. Landmark classification in large-scale image collections. In Proc. ICCV '09, 2009.Google ScholarGoogle Scholar
  14. D. Lowe. Distinctive image features from scale-invariant keypoints. International journal of computer vision, 60(2):91--110, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. J. Luo, D. Joshi, J. Yu, and A. Gallagher. Geotagging in multimedia and computer vision-a survey. Multimedia Tools Appl., 51(1):187--211, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. K. Mikolajczyk and C. Schmid. Scale & affine invariant interest point detectors. International journal of computer vision, 60(1):63--86, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. O. A. B. Penatti, L. T. Li, J. Almeida, and R. da S. Torres. A visual approach for video geocoding using bag-of-scenes. In Proc. ICMR '12, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. A. Rae and P. Kelm. Working notes for the Placing Task at MediaEval 2012. In MediaEval 2012 Workshop, 2012.Google ScholarGoogle Scholar
  19. T. Rattenbury, N. Good, and M. Naaman. Towards automatic extraction of event and place semantics from Flickr tags. In Proc. SIGIR '07, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. P. Serdyukov, V. Murdock, and R. van Zwol. Placing Flickr photos on a map. In Proc. SIGIR '09, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. J. Sivic and A. Zisserman. Video Google: a text retrieval approach to object matching in videos. In Proc. ICCV '03, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. U. Steinhoff et al. How computer vision can help in outdoor positioning. In Proc. AmI '07, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. O. Van Laere, S. Schockaert, and B. Dhoedt. Finding locations of Flickr resources using language models and similarity search. In Proc. ICMR '11, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. K. Yannis et al. VIRaL: Visual image retrieval and localization. Multimedia Tools and Applications, 51:555--592, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. A. R. Zamir and M. Shah. Accurate image localization based on Google Maps street view. In Proc. ECCV '10, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. W. Zhang and J. Kosecka. Image based localization in urban environments. In Proc. 3DPVT '06, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Geo-visual ranking for location prediction of social images

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        ICMR '13: Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
        April 2013
        362 pages
        ISBN:9781450320337
        DOI:10.1145/2461466

        Copyright © 2013 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 16 April 2013

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        ICMR '13 Paper Acceptance Rate38of96submissions,40%Overall Acceptance Rate254of830submissions,31%

        Upcoming Conference

        ICMR '24
        International Conference on Multimedia Retrieval
        June 10 - 14, 2024
        Phuket , Thailand

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader