research-article

Inferring object relevance from gaze in dynamic scenes

Authors:
Melih Kandemir

Helsinki University of Technology

Helsinki University of Technology
View Profile

,
Veli-Matti Saarinen

Helsinki University of Technology

Helsinki University of Technology
View Profile

,
Samuel Kaski

Helsinki University of Technology

Helsinki University of Technology
View Profile

ETRA '10: Proceedings of the 2010 Symposium on Eye-Tracking Research & ApplicationsMarch 2010Pages 105–108https://doi.org/10.1145/1743666.1743692

Published:22 March 2010Publication History

ETRA '10: Proceedings of the 2010 Symposium on Eye-Tracking Research & Applications

Pages 105–108

ABSTRACT

As prototypes of data glasses having both data augmentation and gaze tracking capabilities are becoming available, it is now possible to develop proactive gaze-controlled user interfaces to display information about objects, people, and other entities in real-world setups. In order to decide which objects the augmented information should be about, and how saliently to augment, the system needs an estimate of the importance or relevance of the objects of the scene for the user at a given time. The estimates will be used to minimize distraction of the user, and for providing efficient spatial management of the augmented items. This work is a feasibility study on inferring the relevance of objects in dynamic scenes from gaze. We collected gaze data from subjects watching a video for a pre-defined task. The results show that a simple ordinal logistic regression model gives relevance rankings of scene objects with a promising accuracy.

References

Hardoon, D., Shawe-Taylor, J., Ajanki, A., Puolamäki, K., and Kaski, S. 2007. Information retrieval by inferring implicit queries from eye movements. In International Conference on Artificial Intelligence and Statistics (AISTATS '07).Google Scholar
Henderson, J. M. 2003. Human gaze control during real-world scene perception. Trends in Cognitive Sciences 7, 11, 498--504.Google ScholarCross Ref
Hyrskykari, A., Majaranta, P., Aaltonen, A., and Räihä, K.-J. 2000. Design issues of 'idict': A gaze-assisted translation aid. In Proceedings of ETRA 2000, Eye Tracking Research and Applications Symposium, ACM Press, ACM Press, 9--14. Google ScholarDigital Library
Itti, L., Koch, C., and Niebur, E. 1998. A model of saliency-based visual attention for rapid scene analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence 20, 11, 1254--1259. Google ScholarDigital Library
Kandemir, M., Saarinen, V.-M., and Kaski, S. 2010. Inferring object relevance from gaze in dynamic scenes. In To Appear in Short Paper Proceedings of ETRA 2000, Eye Tracking Research and Applications Symposium. Google ScholarDigital Library
Klami, A., Saunders, C., de Campos, T. E., and Kaski, S. 2008. Can relevance of images be inferred from eye movements? In MIR '08: Proceeding of the 1st ACM international conference on Multimedia information retrieval, ACM, New York, NY, USA, 134--140. Google ScholarDigital Library
Kozma, L., Klami, A., and Kaski, S. 2009. GaZIR: Gaze-based zooming interface for image retrieval. In Proc. ICMIMLMI 2009, The Eleventh International Conference on Multimodal Interfaces and The Sixth Workshop on Machine Learning for Multimodal Interaction, ACM, New York, NY, USA, 305--312. Google ScholarDigital Library
McCullagh, P., and Nelder, J. 1989. Generalized Linear Models. Chapman & Hall/CRC.Google Scholar
Qvarfordt, P., and Zhai, S. 2005. Conversing with the user based on eye-gaze patterns. In CHI '05: Proceedings of the SIGCHI conference on Human factors in computing systems, ACM, New York, NY, USA, 221--230. Google ScholarDigital Library
Torralba, A., Oliva, A., Castelhano, M. S., and Henderson, J. M. 2006. Contextual guidance of eye movements and attention in real-world scenes: the role of global features in object search. Psychological Review 113, 4, 766--786.Google ScholarCross Ref
Ward, D. J., and MacKay, D. J. C. 2002. Fast hands-free writing by gaze direction. Nature 418, 6900, 838.Google ScholarCross Ref
Zhang, L., Tong, M. H., Marks, T. K., Shan, H., and Cottrell, G. W. 2008. Sun: A bayesian framework for saliency using natural statistics. Journal of Vision 8, 7 (12), 1--20.Google ScholarCross Ref

Index Terms

Inferring object relevance from gaze in dynamic scenes
1. Human-centered computing
  1. Human computer interaction (HCI)
    1. Interactive systems and tools
      1. User interface management systems

Recommendations

Vibrotactile stimulation of the head enables faster gaze gestures

Gaze gestures are a promising input technology for wearable devices especially in the smart glasses form factor because gaze gesturing is unobtrusive and leaves the hands free for other tasks. We were interested in how gaze gestures can be enhanced with ...
Read More
Mobile 3D Gaze Tracking Calibration
CRV '15: Proceedings of the 2015 12th Conference on Computer and Robot Vision

We present a new calibration method to combine a mobile eye tracker with an external tracking system to obtain a 3D gaze vector. Our method captures calibration points of varying distances, pupil positions and head positions/orientations. With these ...
Read More
Haptic feedback of gaze gestures with glasses: localization accuracy and effectiveness
UbiComp/ISWC'15 Adjunct: Adjunct Proceedings of the 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2015 ACM International Symposium on Wearable Computers

Wearable devices including smart eyewear require new interaction methods between the device and the user. In this paper, we describe our work on the combined use of eye tracking for input and haptic (touch) stimulation for output with eyewear. Input ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ETRA '10: Proceedings of the 2010 Symposium on Eye-Tracking Research & Applications
March 2010
353 pages
ISBN:9781605589947
DOI:10.1145/1743666
Conference Chairs:
Carlos Hitoshi Morimoto
University of Sao Paulo, Brazil
,
Howell Istance
De Montfort University, UK
,
Program Chairs:
Aulikki Hyrskykari
University of Tampere, Finland
,
Qiang Ji
Rensselaer Polytechnic Institute
Copyright © 2010 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 22 March 2010
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
augmented reality
gaze tracking
information retrieval
intelligent user interfaces
machine learning
ordinal logistic regression
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate69of137submissions,50%
Upcoming Conference
ETRA '24

Sponsor:

sigchi

sigchi

The 2024 Symposium on Eye Tracking Research and Applications

June 4 - 7, 2024

Glasgow , United Kingdom
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 15
  Total Citations
  View Citations
- 398
  Total Downloads
- Downloads (Last 12 months)8
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Inferring object relevance from gaze in dynamic scenes

ETRA '10: Proceedings of the 2010 Symposium on Eye-Tracking Research & Applications

ABSTRACT

References

Cited By

Index Terms

Recommendations

Vibrotactile stimulation of the head enables faster gaze gestures

Mobile 3D Gaze Tracking Calibration

Haptic feedback of gaze gestures with glasses: localization accuracy and effectiveness