research-article

Recognizing the Presence of Hidden Visual Markers in Digital Images

Authors:
Liming Xu

University of Nottingham Ningbo China, Ningbo, China

University of Nottingham Ningbo China, Ningbo, China
View Profile

,
Andrew P. French

University of Nottingham, Nottingham, United Kingdom

University of Nottingham, Nottingham, United Kingdom
View Profile

,
Dave Towey

University of Nottingham Ningbo China, Ningbo, China

University of Nottingham Ningbo China, Ningbo, China
View Profile

,
Steve Benford

University of Nottingham, Nottingham, United Kingdom

University of Nottingham, Nottingham, United Kingdom
View Profile

Thematic Workshops '17: Proceedings of the on Thematic Workshops of ACM Multimedia 2017October 2017Pages 210–218https://doi.org/10.1145/3126686.3126761

Published:23 October 2017Publication History

Thematic Workshops '17: Proceedings of the on Thematic Workshops of ACM Multimedia 2017

Pages 210–218

ABSTRACT

As the promise of Virtual and Augmented Reality (VR and AR) becomes more realistic, an interesting aspect of our enhanced living environment includes the availability ? indeed the potential ubiquity ? of scannable markers. Such markers could represent an initial step into the AR and VR worlds. In this paper, we address the important question of how to recognise the presence of visual markers in freeform digital photos. We use a particularly challenging marker format that is only minimally constrained in structure, called Artcodes. Artcodes are a type of topological marker system enabling people, by following very simple drawing rules, to design markers that are both aesthetically beautiful and machine readable. Artcodes can be used to decorate the surface of any objects, and yet can also contain a hidden digital meaning. Like some other more commonly used markers (such as Barcodes, QR codes), it is possible to use codes to link physical objects to digital data, augmenting everyday objects. Obviously, in order to trigger the behaviour of scanning and further decoding of such codes, it is first necessary for devices to be aware of the presence of Artcodes in the image.

Although considerable literature exists related to the detection of rigidly formatted structures and geometrical feature descriptors such as Harris, SIFT, and SURF, these approaches are not sufficient for describing freeform topological structures, such as Artcode images. In this paper, we propose a new topological feature descriptor that can be used in the detection of freeform topological markers, including Artcodes. This feature descriptor is called a Shape of Orientation Histogram (SOH). We construct this SOH feature vector by quantifying the level of symmetry and smoothness of the orientation histogram, and then use a Random Forest machine learning approach to classify images that contain Artcodes using the new feature vector. This system represents a potential first step for an eventual mobile device application that would detect where in an image such an unconstrained code appears. We also explain how the system handles imbalanced datasets ? important for rare, handcrafted codes such as Artcodes ? and how it is evaluated. Our experimental evaluation shows good performance of the proposed classification model in the detection of Artcodes: obtaining an overall accuracy of approx. 0.83, F₂ measure 0.83, MCC 0.68, AUC-ROC 0.93, and AUC-PR 0.91.

References

Herbert Bay, Andreas Ess, Tinne Tuytelaars, and Luc Van Gool. 2008. Speeded-up robust features (SURF). Computer vision and image understanding Vol. 110, 3 (2008), 346--359. Google ScholarDigital Library
Serge Belongie, Jitendra Malik, and Jan Puzicha. 2001. Matching shapes Computer Vision, 2001. ICCV 2001. Proceedings. Eighth IEEE International Conference on, Vol. 1. IEEE, 454--461.Google Scholar
Ross Bencina, Martin Kaltenbrunner, and Sergi Jorda. 2005. Improved topological fiducial tracking in the reactivision system Computer Vision and Pattern Recognition-Workshops, 2005. CVPR Workshops. IEEE Computer Society Conference on. IEEE, 99--99. Google ScholarDigital Library
Steve Benford, Adrian Hazzard, Alan Chamberlain, Kevin Glover, Chris Greenhalgh, Liming Xu, Michaela Hoare, and Dimitrios Darzentas. 2016. Accountable artefacts: the case of the Carolan guitar Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI'16). ACM, San Jose, CA, USA, 1163--1175. Google ScholarDigital Library
Steve Benford, Adrain Hazzard, Alan Chamberlain, and Liming Xu. 2015. Augmenting a Guitar with Its Digital Footprint. In Proceedings of International Conference on New Interfaces for Musical Expression (NIME'15). Louisiana, USA, 303--306. Google ScholarDigital Library
Leo Breiman. 2001. Random forests. Machine learning, Vol. 45, 1 (2001), 5--32. Google ScholarDigital Library
Michael Calonder, Vincent Lepetit, Christoph Strecha, and Pascal Fua. 2010. Brief: Binary robust independent elementary features. Proceedings of the 2010 European Conference on Computer Vision (ECCV'10) Vol. 6314 (2010), 778--792. Google ScholarDigital Library
Nitesh V. Chawla, Kevin W. Bowyer, Lawrence O. Hall, and W. Philip Kegelmeyer. 2002. SMOTE: synthetic minority over-sampling technique. Journal of artificial intelligence research Vol. 16 (2002), 321--357. Google ScholarCross Ref
Nitesh V. Chawla, Aleksandar Lazarevic, Lawrence O. Hall, and Kevin W. Bowyer. 2003. SMOTEBoost: Improving prediction of the minority class in boosting European Conference on Principles of Data Mining and Knowledge Discovery. Springer, 107--119.Google Scholar
Corinna Cortes and Vladimir Vapnik. 1995. Support-vector networks. Machine learning, Vol. 20, 3 (1995), 273--297. Google ScholarDigital Library
Enrico Costanza and Jeffrey Huang. 2009. Designable visual markers. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. ACM, 1879--1888. Google ScholarDigital Library
Navneet Dalal and Bill Triggs. 2005. Histograms of oriented gradients for human detection 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), Vol. 1. IEEE, 886--893. Google ScholarDigital Library
Charles Elkan. 2001. The foundations of cost-sensitive learning. In International joint conference on artificial intelligence, Vol. 17. Lawrence Erlbaum Associates Ltd, 973--978. Google ScholarDigital Library
Wei Fan, Salvatore J. Stolfo, Junxin Zhang, and Philip K. Chan. 1999. AdaCost: misclassification cost-sensitive boosting Proceedings of the Sixteenth International Conference on Machine Learning (ICML'99). Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 97--105. Google ScholarDigital Library
Mark Fiala. 2005. ARTag, a fiducial marker system using digital techniques Proceedings of 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), Vol. 2. IEEE, San Diego, CA, USA, 590--596. Google ScholarDigital Library
William T. Freeman and Michal Roth. 1995. Orientation histograms for hand gesture recognition International workshop on automatic face and gesture recognition, Vol. 12. 296--301.Google Scholar
Yoav Freund and Robert E Schapire. 1995. A desicion-theoretic generalization of on-line learning and an application to boosting European conference on computational learning theory. Springer, 23--37. Google ScholarDigital Library
Hui Han, Wen-Yuan Wang, and Bing-Huan Mao. 2005. Borderline-SMOTE: a new over-sampling method in imbalanced data sets learning International Conference on Intelligent Computing. Springer, 878--887. Google ScholarDigital Library
Chris Harris and Mike Stephens. 1988. A combined corner and edge detector. In Alvey vision conference, Vol. 15. Manchester, UK, 50.Google Scholar
Haibo He and Edwardo A. Garcia. 2009. Learning from imbalanced data. IEEE Transactions on knowledge and data engineering, Vol. 21, 9 (2009), 1263--1284. Google ScholarDigital Library
Hirokazu Kato and Mark Billinghurst. 1999. Marker tracking and hmd calibration for a video-based augmented reality conferencing system. In Augmented Reality, 1999.(IWAR'99) Proceedings. 2nd IEEE and ACM International Workshop on. IEEE, 85--94. Google ScholarDigital Library
David G. Lowe. 2004. Distinctive image features from scale-invariant keypoints. International journal of computer vision Vol. 60, 2 (2004), 91--110. Google ScholarDigital Library
Brian W. Matthews. 1975. Comparison of the predicted and observed secondary structure of T4 phage lysozyme. Biochimica et Biophysica Acta (BBA)-Protein Structure, Vol. 405, 2 (1975), 442--451.Google ScholarCross Ref
Michael W. Maynard. 1993. Classifying apparatus and method. US Patent 5,232,099. (3 Aug.. 1993).Google Scholar
Rupert Meese, Shakir Ali, Emily-Clare Thorne, Steve D. Benford, Anthony Quinn, Richard Mortier, Boriana N. Koleva, Tony Pridmore, and Sharon L. Baurley. 2013. From codes to patterns: designing interactive decoration for tableware Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI'13). ACM, Paris, France, 931--940. Google ScholarDigital Library
Krystian Mikolajczyk, Cordelia Schmid, and Andrew Zisserman. 2004. Human detection based on a probabilistic assembly of robust part detectors Proceedings of 2004 European Conference on Computer Vision (ECCV'04). Springer, 69--82.Google Scholar
Anuj Mohan, Constantine Papageorgiou, and Tomaso Poggio. 2001. Example-based object detection in images by components. IEEE transactions on pattern analysis and machine intelligence, Vol. 23, 4 (2001), 349--361. Google ScholarDigital Library
Kher Hui Ng and Shazia Paras Shaikh. 2016. Design of a mobile garden guide based on Artcodes. Proceedings of 2016 International Conference on User Science and Engineering (i-USEr'16), IEEE, 23--28.Google ScholarCross Ref
Constantine Papageorgiou and Tomaso Poggio. 2000. A trainable system for object detection. International Journal of Computer Vision Vol. 38, 1 (2000), 15--33. Google ScholarDigital Library
David M. W. Powers. 2011. Evaluation: From precision, recall and f-measure to roc., informedness, markedness & correlation. Journal of Machine Learning Technologies Vol. 2, 1 (2011), 37--63.Google ScholarCross Ref
Edward Rosten and Tom Drummond. 2006. Machine learning for high-speed corner detection. Proceedings of 2006 European Conference on Computer Vision (ECCV'06). Springer, 430--443. Google ScholarDigital Library
Ethan Rublee, Vincent Rabaud, Kurt Konolige, and Gary Bradski. 2011. ORB: an efficient alternative to SIFT or SURF. In Proceedings of 2011 IEEE International Conference on Computer Vision (ICCV'11). IEEE, 2564--2571. Google ScholarDigital Library
Chris Seiffert, Taghi M Khoshgoftaar, Jason Van Hulse, and Amri Napolitano. 2010. RUSBoost: A hybrid approach to alleviating class imbalance. IEEE Transactions on Systems, Man, and Cybernetics-Part A: Systems and Humans, Vol. 40, 1 (2010), 185--197. Google ScholarDigital Library
ISO/IEC International Standard. 2000. QR Code. (2000).Google Scholar
Emily-Clare Thorn, Stefan Rennick-Egglestone, Boriana Koleva, William Preston, Steve Benford, Anthony Quinn, and Richard Mortier. 2016. Exploring large-scale interactive public illustrations Proceedings of the 2016 ACM Conference on Designing Interactive Systems. ACM, 17--27. Google ScholarDigital Library
Engin Tola, Vincent Lepetit, and Pascal Fua. 2008. A fast local descriptor for dense matching. In Proceedings of 2008 IEEE Conference on Computer Vision and Pattern Recognition (CVPR'08). IEEE, 1--8.Google ScholarCross Ref

Index Terms

Recognizing the Presence of Hidden Visual Markers in Digital Images
1. Computing methodologies
  1. Computer graphics
    1. Graphics systems and interfaces
      1. Mixed / augmented reality
  2. Machine learning
    1. Learning paradigms
      1. Supervised learning
        Supervised learning by classification
    2. Machine learning algorithms
      1. Ensemble methods
        Bagging
2. Human-centered computing
  1. Human computer interaction (HCI)
    1. Interaction paradigms
      1. Mixed / augmented reality
  2. Ubiquitous and mobile computing
    1. Ubiquitous and mobile computing theory, concepts and paradigms
      1. Mobile computing

Recommendations

Enabling Hand-Crafted Visual Markers at Scale
DIS '17: Proceedings of the 2017 Conference on Designing Interactive Systems

As locative media and augmented reality spread into the everyday world so it becomes important to create aesthetic visual markers at scale. We explore a designer-centred approach in which skilled designers handcraft seed designs that are automatically ...
Read More
Clothing integrated augmented reality markers
MUM '17: Proceedings of the 16th International Conference on Mobile and Ubiquitous Multimedia

The future vision of commonplace, everyday wear of augmented reality glasses, brings potential to the fashion world, where physical and virtual aspects can be blended to create an overall aesthetic. An enabler for this is the integration of computer-...
Read More
Clothes integrated visual markers as self-expression tool
MobileHCI '16: Proceedings of the 18th International Conference on Human-Computer Interaction with Mobile Devices and Services Adjunct

In this demo, we present a concept where garment-integrated visual markers are used for self-expression. We present a wearable design, where clothing design style integrates with the visual design of AR markers, which are read with a mobile phone or ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
Thematic Workshops '17: Proceedings of the on Thematic Workshops of ACM Multimedia 2017
October 2017
558 pages
ISBN:9781450354165
DOI:10.1145/3126686
Program Chairs:
Wanmin Wu
Google, USA
,
Jianchao Yang
Snap Inc., USA
,
Qi Tian
The University of Texas at San Antonio, USA
,
Roger Zimmermann
National University of Singapore, Singapore
Copyright © 2017 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 23 October 2017
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
artcodes
classifier
topological feature descriptor
visual markers
Qualifiers
- research-article
Conference
Upcoming Conference
MM '24

Sponsor:

sigmm

MM '24: The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 2
  Total Citations
  View Citations
- 172
  Total Downloads
- Downloads (Last 12 months)8
- Downloads (Last 6 weeks)2
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Recognizing the Presence of Hidden Visual Markers in Digital Images

Thematic Workshops '17: Proceedings of the on Thematic Workshops of ACM Multimedia 2017

ABSTRACT

References

Cited By

Index Terms

Recommendations

Enabling Hand-Crafted Visual Markers at Scale

Clothing integrated augmented reality markers

Clothes integrated visual markers as self-expression tool