ABSTRACT
We argue that the most desirable architecture for mobile image recognition runs the complete algorithm on the mobile device. Alternative solutions that run the recognizer on a remote server will not be as desirable because of the delay between image capture and receipt of a result that can cause users to abandon the technique. We present a method for mobile recognition of paper documents and an application to newspapers that lets readers retrieve electronic data linked to articles, photos, and advertisements. We show that the index for a reasonable collection of daily newspapers can be downloaded in less than a minute and will fit in the memory of today's mid-range smart phones. Experimental results show that the recognition system has an overall error rate of less than 1%. We achieved a run time of 1.2 secs. per image with a collection of 140 newspaper pages on an HTC-8282 Windows Mobile phone.
- Commercial offerings that recognize individual images submitted from camera phones include www.doog.mobi and www.snaptell.com.Google Scholar
- J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman, "Object retrieval with large vocabularies and fast spatial matching," Proc. of the IEEE CVPR, 2007.Google Scholar
- J. J. Hull, B. Erol, J. Graham, Q. Ke, H. Kishi, J. Moraleda, and D. Van Olst, "Paper-Based Augmented Reality," 17th Int. Conf. on Augmented Reality and Telexistence, Esbjerg, Denmark, Nov. 28--30 2007, 205--209. Google ScholarDigital Library
- B. Erol, E. Antunez, and J. J. Hull, "HOTPAPER: multimedia interaction with paper using mobile phones," Proc. of the 16th ACM Intl. Conf. on Multimedia, Vancouver, Canada, 2008, pp. 399--408. Google ScholarDigital Library
- S. Dekleva, J. P. Shim, U. Varshney, and G. Knoerzer, "Evolution and emerging issues in mobile wireless networks," Comm. ACM, v. 50, no. 6, June 2007, pp. 38--43. Google ScholarDigital Library
- T. Nakai, K. Kise, and M. Iwamura, "Use of affine invariants in locally likely arrangement hashing for camera-based document image retrieval," Lecture Notes in Computer Science (7th International Workshop DAS2006, vol. 3872, 2006. Google ScholarDigital Library
- X. Liu and D. Doermann, "Mobile Retriever: access to digital documents from their physical source," International Journal on Document Analysis and Recognition, vol. 11, 2008, pp. 19--27. Google ScholarDigital Library
- D. Wagner, G. Reitmayr, A. Mulloni, T. Drummond, and D. Schmalstieg, "Pose tracking from natural features on mobile phones," Proc. of the 7th IEEE/ACM Int. Symp. on Mixed and Augmented Reality (Sept. 15--18, 2008), pp. 125--134. Google ScholarDigital Library
- G. Takacs, V. Chandrasekhar, N. Gelfand, Y. Xiong, W. C. Chen, T. Bismpigiannis, R. Grzeszczuk, K. Pulli, and B. Girod, "Outdoors augmented reality on mobile phone using loxel-based visual feature organization," ACM International Conference on Multimedia Information Retrieval (MIR'08), Vancouver, Canada, Oct. 2008. Google ScholarDigital Library
Index Terms
- Mobile image recognition: architectures and tradeoffs
Recommendations
Mobile Retriever: access to digital documents from their physical source
In this paper, we describe an image based document retrieval system which runs on camera enabled mobile devices. “Mobile Retriever” aims to seamlessly link physical and digital documents by allowing users to snap a picture of the text of a document and ...
Cascaded search for similar documents between mobile devices
ICCOMP'08: Proceedings of the 12th WSEAS international conference on ComputersThis paper presents a novel method for searching documents which have similar topics to a given document set. It is designed for mobile device users to search for documents which have similar topic to the ones on the users own device. The algorithms are ...
Mobile projectors versus mobile displays: an assessment of task performance
SAP '12: Proceedings of the ACM Symposium on Applied PerceptionMobile projectors are gaining momentum, with many pocket sized products reaching the market and projector phones being close to production. Although the usefulness of such devices for entertainment, collaboration as well as many other tasks is obvious, ...
Comments