ABSTRACT
We present RetroDepth, a new vision-based system for accurately sensing the 3D silhouettes of hands, styluses, and other objects, as they interact on and above physical surfaces. Our setup is simple, cheap, and easily reproducible, comprising of two infrared cameras, diffuse infrared LEDs, and any off-the-shelf retro-reflective material. The retro-reflector aids image segmentation, creating a strong contrast between the surface and any object in proximity. A new highly efficient stereo matching algorithm precisely estimates the 3D contours of interacting objects and the retro-reflective surfaces. A novel pipeline enables 3D finger, hand and object tracking, as well as gesture recognition, purely using these 3D contours. We demonstrate high-precision sensing, allowing robust disambiguation between a finger or stylus touching, pressing or interacting above the surface. This allows many interactive scenarios that seamlessly mix together freehand 3D interactions with touch, pressure and stylus input. As shown, these rich modalities of input are enabled on and above any retro-reflective surface, including custom "physical widgets" fabricated by users. We compare our system with Kinect and Leap Motion, and conclude with limitations and future work.
Supplemental Material
- 3Gear Systems Inc. http://threegear.com/, 2013.Google Scholar
- Agarwal, A., Izadi, S., Chandraker, M., and Blake, A. High precision multi-touch sensing on surfaces using cameras. In Tabletop'07, 197--200.Google Scholar
- Akaoka, E., Ginn, T., and Vertegaal, R. Displayobjects: prototyping functional physical interfaces on 3d styrofoam, paper or cardboard models. In TEI'10, ACM (2010), 49--56. Google ScholarDigital Library
- Annett, M., Grossman, T., Wigdor, D., and Fitzmaurice, G. Medusa: a proximity-aware multi-touch tabletop. In UIST'11, ACM (2011), 337--346. Google ScholarDigital Library
- Benko, H., Jota, R., and Wilson, A. Miragetable: freehand interaction on a projected augmented reality tabletop. In CHI'12, ACM (2012), 199--208. Google ScholarDigital Library
- Bleyer, M., Rhemann, C., and Rother, C. Patchmatch stereo - stereo matching with slanted support windows. In British Machine Vision Conference (2011).Google ScholarCross Ref
- Breiman, L. Random Forests. Machine Learning 45, 1 (Oct. 2001), 5--32. Google ScholarDigital Library
- Brown, M. Z., Burschka, D., and Hager, G. D. Advances in computational stereo. PAMI 25, 8 (2003), 993--1008. Google ScholarDigital Library
- Davis, J. W., and Bobick, A. F. Sideshow: A silhouettebased interactive dual-screen environment. Tech. rep., MIT, 1998.Google Scholar
- Dippon, A., and Klinker, G. Kinecttouch: accuracy test for a very low-cost 2.5 d multitouch tracking system. In ITS'11, ACM (2011), 49--52. Google ScholarDigital Library
- Haptix. http://www.haptixtouch.com/, 2013.Google Scholar
- Haubner, N., Schwanecke, U., Dörner, R., Lehmann, S., and Luderschmidt, J. Detecting interaction above digital tabletops using a single depth camera. Machine Vision and Applications (2013), 1--13. Google ScholarDigital Library
- Hilliges, O., Izadi, S., Wilson, A. D., Hodges, S., Garcia-Mendoza, A., and Butz, A. Interactions in the air: adding further depth to interactive tabletops. In UIST'09, ACM (2009), 139--148. Google ScholarDigital Library
- Hilliges, O., Kim, D., Izadi, S., Weiss, M., and Wilson, A. Holodesk: direct 3d interactions with a situated seethrough display. In CHI'12, ACM (2012), 2421--2430. Google ScholarDigital Library
- Hirsch, M., Lanman, D., Holtzman, H., and Raskar, R. Bidi screen: a thin, depth-sensing lcd for 3d interaction using light fields. In TOG, vol. 28, ACM (2009), 159. Google ScholarDigital Library
- Izadi, S., Agarwal, A., Criminisi, A., Winn, J., Blake, A., and Fitzgibbon, A. C-slate: a multi-touch and object recognition system for remote collaboration using horizontal surfaces. In Tabletop'07, IEEE (2007), 3--10.Google Scholar
- Izadi, S., Hodges, S., Butler, A., West, D., Rrustemi, A., Molloy, M., and Buxton,W. Thinsight: a thin form-factor interactive surface technology. CACM 52, 12, 90--98. Google ScholarDigital Library
- Izadi, S., Hodges, S., Taylor, S., Rosenfeld, D., Villar, N., Butler, A., and Westhues, J. Going beyond the display: a surface technology with an electronically switchable diffuser. In UIST'08, ACM (2008), 269--278. Google ScholarDigital Library
- Keskin, C., Kirac, F., Kara, Y. E., and Akarun, L. Hand Pose Estimation and Hand Shape Classification Using Multi-layered Randomized Decision Forests. In ECCV'12 (2012). Google ScholarDigital Library
- Koike, H., Sato, Y., and Kobayashi, Y. Integrating paper and digital information on enhanceddesk: a method for realtime finger tracking on an augmented desk system. TOCHI 8, 4 (2001), 307--322. Google ScholarDigital Library
- Krueger, M. W. Artificial reality II, vol. 10. Addison-Wesley Reading (Ma), 1991.Google Scholar
- Lanman, D., and Taubin, G. Build your own 3d scanner: 3d photography for beginners. In ACM SIGGRAPH 2009 Courses, ACM (2009), 8. Google ScholarDigital Library
- Lee, J., Park, K. S., and Hahn, M. The 3d sensor table for bare hand tracking and posture recognition. In Advances in Multimedia Modeling. Springer, 2006, 138--146. Google ScholarDigital Library
- Liu, Y., Weibel, N., and Hollan, J. Interactive space: A framework for prototyping multi-touch interaction on and above the desktop. In CHI'13, vol. 13 (2013).Google Scholar
- Malik, S., and Laszlo, J. Visual touchpad: a two-handed gestural input device. In ICMI'04, 289--296. Google ScholarDigital Library
- Marquardt, N., Jota, R., Greenberg, S., and Jorge, J. A. The continuous interaction space: interaction techniques unifying touch and gesture on and above a digital surface. In INTERACT'11. Springer, 2011, 461--476. Google ScholarDigital Library
- Marroquim, R., Kraus, M., and Cavalcanti, P. R. Efficient point-based rendering using image reconstruction. In SPBG (2007), 101--108.Google Scholar
- Matsushita, N., and Rekimoto, J. Holowall: designing a finger, hand, body, and object sensitive wall. In UIST'97, ACM (1997), 209--210. Google ScholarDigital Library
- Moeller, J., Kerne, A., and Moeller, J. Zerotouch: An optical multi-touch and free-air interaction architecture. In CH'12, ACM (2012), 2165--2174. Google ScholarDigital Library
- Paradiso, J. A. Several sensor approaches that retrofit large surfaces for interactivity. In UbiComp 2002 (2002).Google Scholar
- Remondino, F., and Stoppa, D. Tof range-imaging cameras. Springer, 2013. Google ScholarDigital Library
- Scharstein, D., and Szeliski, R. A taxonomy and evaluation of dense two-frame stereo correspondence algorithms. In IJCV (2002). Google ScholarDigital Library
- Shotton, J., Fitzgibbon, A., Cook, M., Sharp, T., Finocchio, M., Moore, R., Kipman, A., and Blake, A. Realtime Human Pose Recognition in Parts from Single Depth Images. In CVPR (2011). Google ScholarDigital Library
- Strecha, C., and Van Gool, L. PDE-based multi-view depth estimation. In Proc. 3DIMPVT (2002), 416--425.Google ScholarCross Ref
- Subramanian, S., Aliakseyeu, D., and Lucero, A. Multilayer interaction for digital tables. In UIST'06, ACM (2006), 269--272. Google ScholarDigital Library
- Takeoka, Y., Miyaki, T., and Rekimoto, J. Z-touch: an infrastructure for 3d gesture interaction in the proximity of tabletop surfaces. In ITS'10, ACM (2010), 91--94. Google ScholarDigital Library
- Ullmer, B., and Ishii, H. The metadesk: models and prototypes for tangible user interfaces. In UIST'97, ACM (1997), 223--232. Google ScholarDigital Library
- Wang, R., Paris, S., and Popović, J. 6d hands: markerless hand-tracking for computer aided design. In UIST'11, ACM (2011), 549--558. Google ScholarDigital Library
- Wellner, P. Interacting with paper on the digitaldesk. CACM 36, 7 (1993), 87--96. Google ScholarDigital Library
- Wesche, G., and Seidel, H.-P. Freedrawer: a free-form sketching system on the responsive workbench. In VRST, ACM (2001), 167--174. Google ScholarDigital Library
- Wilson, A. D. Playanywhere: a compact interactive tabletop projection-vision system. In UIST'05, ACM (2005), 83--92. Google ScholarDigital Library
- Wilson, A. D. Robust computer vision-based detection of pinching for one and two-handed gesture input. In UIST'06, ACM (2006), 255--258. Google ScholarDigital Library
- Wilson, A. D. Depth-sensing video cameras for 3d tangible tabletop interaction. In Tabletop'07, IEEE (2007), 201--204.Google Scholar
- Wilson, A. D. Using a depth camera as a touch sensor. In ITS'10, ACM (2010), 69--72. Google ScholarDigital Library
Index Terms
- RetroDepth: 3D silhouette sensing for high-precision input on and above physical surfaces
Recommendations
Occlusion-aware Hand Posture Based Interaction on Tabletop Projector
UIST '19 Adjunct: Adjunct Proceedings of the 32nd Annual ACM Symposium on User Interface Software and TechnologyConventional camera-based hand interaction technique suffered from self-occlusion among fingers, which lowers the detection accuracy of fingertip positions, leading to uncomfortable UI controls. Based on observations, self-occlusion depends on hand ...
Easy vs. Tricky: The Shape Effect in Tracing, Selecting, and Steering With Mouse, Stylus, and Touch
AcademicMindTrek '13: Proceedings of International Conference on Making Sense of Converging MediaThis short paper is a work-in-progress report on an experimental comparison and evaluation of users' performance in four line-tracing tasks based on two shapes and performed with three input methods (mouse, stylus, and touch-input). The shapes' ...
Reflective haptics: enhancing stylus-based interactions on touch screens
EuroHaptics'10: Proceedings of the 2010 international conference on Haptics: generating and perceiving tangible sensations, Part IIn this paper, we introduce the prototype of a low cost haptically augmented stylus for pen computing on touch screens. The stylus supports human-computer interaction through a dynamic haptic feedback. This reflective feedback is generated by a ...
Comments