research-article

Floating-point Precision and Deformation Awareness for Scalable and Robust 3D Face Alignment

Authors:
Jacob Morton

POSTECH, Pohang

POSTECH, Pohang
View Profile

,
Seungyong Lee

POSTECH, Pohang, Republic of Korea

POSTECH, Pohang, Republic of Korea
View Profile

VRST '19: Proceedings of the 25th ACM Symposium on Virtual Reality Software and TechnologyNovember 2019Article No.: 25Pages 1–10https://doi.org/10.1145/3359996.3364260

Published:12 November 2019Publication History

VRST '19: Proceedings of the 25th ACM Symposium on Virtual Reality Software and Technology

Pages 1–10

ABSTRACT

This paper improves the accuracy of heatmap-based 3D face alignment neural networks. Many current approaches in face alignment are limited by two major problems, quantization and the lack of regularization of heatmaps. The first limitation is caused by the non-differentiable argmax function, which extracts landmark coordinates from heatmaps as integer indices. Heatmaps are generated at low-resolution to reduce the memory and computational costs, which results in heatmaps far lower than the input image’s resolution. We propose a heatmap generator network producing floating-point precision heatmaps that are scalable to higher-resolutions. To resolve the second limitation, we propose a novel deformation constraint on heatmaps. The constraint is based on graph-Laplacian and enables a heatmap generator to regularize overall shape of the output face landmarks using the global face structure. By eliminating quantization and including regularization, our method can vastly improve landmark localization accuracy, and achieves the state-of-the-art performance without adding complex network structures.

References

Adrian Bulat and Georgios Tzimiropoulos. 2017. How Far Are We From Solving the 2D & 3D Face Alignment Problem? (And a Dataset of 230,000 3D Facial Landmarks). In Proc. ICCV.Google ScholarCross Ref
Yao Feng, Fan Wu, Xiaohu Shao, Yanfeng Wang, and Xi Zhou. 2018. Joint 3D Face Reconstruction and Dense Alignment with Position Map Regression Network. In Proc. ECCV.Google ScholarDigital Library
Shiming Ge, Jia Li, Qiting Ye, and Zhao Luo. 2017. Detecting Masked Faces in the Wild with LLE-CNNs. In Proc. CVPR.Google ScholarCross Ref
Thibault Groueix, Matthew Fisher, Vladimir G. Kim, Bryan C. Russell, and Mathieu Aubry. 2018. 3D-Coded: 3D Correspondences by Deep Deformation. In Proc. ECCV.Google ScholarDigital Library
Lie Gu and Takeo Kanade. 2006. 3D Alignment of Face in a Single Image. In Proc. CVPR.Google Scholar
Jia Guo, Jiankang Deng, Niannan Xue, and Stefanos Zafeiriou. 2018. Stacked Dense U-Nets with Dual Transformers for Robust Face Alignment. In Proc. BMVC.Google Scholar
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep Residual Learning for Image Recognition. In Proc. CVPR.Google ScholarCross Ref
Patrik Huber, Philipp Kopp, William Christmas, Matthias Rätsch, and Josef Kittler. 2016. Real-Time 3D Face Fitting and Texture Fusion on in-the-Wild Videos. IEEE Signal Processing Letters 24, 4 (2016), 437–441.Google ScholarCross Ref
Vahid Kazemi and Josephine Sullivan. 2014. One Millisecond Face Alignment with an Ensemble of Regression Trees. In Proc. CVPR.Google ScholarDigital Library
Martin Köstinger, Paul Wohlhart, Peter M. Roth, and Horst Bischof. 2011. Annotated Facial Landmarks in the Wild: A Large-scale, Real-World Database for Facial Landmark Localization. In Proc. First IEEE International Workshop on Benchmarking Facial Image Analysis Technologies.Google ScholarCross Ref
Der-Tsai Lee and Bruce J Schachter. 1980. Two Algorithms for Constructing a Delaunay Triangulation. International Journal of Computer & Information Sciences 9, 3(1980), 219–242.Google ScholarCross Ref
Hao Li, Jihun Yu, Yuting Ye, and Chris Bregler. 2013. Realtime Facial Animation with On-the-fly Correctives. ACM Transactions on Graphics 32, 4, Article 42(2013), 42:1–42:10 pages.Google ScholarDigital Library
Yaojie Liu, Amin Jourabloo, William Ren, and Xiaoming Liu. 2017. Dense Face Alignment. In Proc. ICCV Workshops.Google ScholarCross Ref
Diogo C. Luvizon, David Picard, and Hedi Tabia. 2018. 2D/3D Pose Estimation and Action Recognition Using Multitask Deep Learning. In Proc. CVPR.Google ScholarCross Ref
Diogo C. Luvizon, Hedi Tabia, and David Picard. 2019. Human Pose Regression by Combining Indirect Part Detection and Contextual Information. Computers & Graphics(2019).Google Scholar
Alejandro Newell, Kaiyu Yang, and Jia Deng. 2016. Stacked Hourglass Networks for Human Pose Estimation. In Proc. ECCV.Google ScholarCross Ref
Aiden Nibali, Zhen He, Stuart Morgan, and Luke Prendergast. 2019. 3D Human Pose Estimation with 2D Marginal Heatmaps. In Proc. WACV.Google ScholarCross Ref
Kyle Olszewski, Joseph J. Lim, Shunsuke Saito, and Hao Li. 2016. High-fidelity Facial and Speech Animation for VR HMDs. ACM Transactions on Graphics 35, 6, Article 221(2016), 221:1–221:14 pages.Google ScholarDigital Library
Georgios Pavlakos, Xiaowei Zhou, Konstantinos G. Derpanis, and Kostas Daniilidis. 2017. Coarse-to-Fine Volumetric Prediction for Single-Image 3D Human Pose. In Proc. CVPR.Google ScholarCross Ref
Pascal Paysan, Reinhard Knothe, Brian Amberg, Sami Romdhani, and Thomas Vetter. 2009. A 3D Face Model for Pose and Illumination Invariant Face Recognition. In Proc. AVSS.Google ScholarDigital Library
Christos Sagonas, Epameinondas Antonakos, Georgios Tzimiropoulos, Stefanos P. Zafeiriou, and Maja Pantic. 2016. 300 Faces in-the-Wild Challenge: Database and Results. Image and Vision Computing 47 (2016), 3–18.Google ScholarDigital Library
Christos Sagonas, Georgios Tzimiropoulos, Stefanos Zafeiriou, and Maja Pantic. 2013a. A Semi-automatic Methodology for Facial Landmark Annotation. In Proc. CVPR, 5th Workshop on Analysis and Modeling of Faces and Gestures (AMFG 2013).Google ScholarDigital Library
Christos Sagonas, Georgios Tzimiropoulos, Stefanos P. Zafeiriou, and Maja Pantic. 2013b. 300 Faces in-the-Wild Challenge: The First Facial Landmark Localization Challenge. In Proc. ICCV, 300 Faces in-the-Wild Challenge (300-W).Google ScholarDigital Library
Olga Sorkine, Daniel Cohen-Or, Yaron Lipman, Marc Alexa, Christian Rössl, and Hans-Peter Seidel. 2004. Laplacian Surface Editing. In Proc. SGP.Google ScholarDigital Library
Ying Tai, Yicong Liang, Xiaoming Liu, Lei Duan, Jilin Li, Chengjie Wang, Feiyue Huang, and Yu Chen. 2019. Towards Highly Accurate and Stable Face Alignment for High-Resolution Videos. In Proc. AAAI.Google ScholarDigital Library
Zhiqiang Tang, Xi Peng, Shijie Geng, Yizhe Zhu, and Dimitris Metaxas. 2018. CU-Net: Coupled U-Nets. In Proc. BMVC.Google Scholar
Justus Thies, Michael Zollhofer, Marc Stamminger, Christian Theobalt, and Matthias Niessner. 2016. Face2Face: Real-Time Face Capture and Reenactment of RGB Videos. In Proc. CVPR.Google ScholarDigital Library
Tijmen Tieleman and Geoffrey Hinton. 2012. Lecture 6.5-RMSProp: Divide the Gradient by a Running Average of its Recent Magnitude. COURSERA: Neural Networks for Machine Learning 4, 2 (2012), 26–31.Google Scholar
Wayne Wu, Chen Qian, Shuo Yang, Quan Wang, Yici Cai, and Qiang Zhou. 2018. Look at Boundary: A Boundary-Aware Face Alignment Algorithm. In Proc. CVPR.Google ScholarCross Ref
Yue Wu and Qiang Ji. 2019. Facial Landmark Detection: A Literature Survey. International Journal of Computer Vision 127, 2 (2019), 115–142.Google ScholarDigital Library
Sangdoo Yun, Dongyoon Han, Seong Joon Oh, Sanghyuk Chun, Junsuk Choe, and Youngjoon Yoo. 2019. CutMix: Regularization Strategy to Train Strong Classifiers with Localizable Features. In Proc. ICCV.Google ScholarCross Ref
Xiangyu Zhu, Zhen Lei, Xiaoming Liu, Hailin Shi, and Stan Z. Li. 2016. Face Alignment Across Large Poses: A 3D Solution. In Proc. CVPR. 146–155.Google ScholarCross Ref

Recommendations

Frontal face synthesis based on multiple pose-variant images for face recognition
ICB'07: Proceedings of the 2007 international conference on Advances in Biometrics

Pose variance remains a challenging problem for face recognition. In this paper, a stereoscopic synthesis method for generating a frontal face image is proposed to improve the performance of automatic face recognition system. Through this method, a ...
Read More
Deformation Modeling for Robust 3D Face Matching

Face recognition based on 3D surface matching is promising for overcoming some of the limitations of current 2D image-based face recognition systems. The 3D shape is generally invariant to the pose and lighting changes, but not invariant to the non-...
Read More
Nonlinear deformation learning for face alignment across expression and pose

As a fundamental work for the automatic emotional health analysis, we present a non-linear deformation learning approach to align face images and extract feature points undergoing a variety of expression and pose variations. To face the application in ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
VRST '19: Proceedings of the 25th ACM Symposium on Virtual Reality Software and Technology
November 2019
498 pages
ISBN:9781450370011
DOI:10.1145/3359996
Editors:
Tomas Trescak
Western Sydney University
,
Simeon Simoff
Western Sydney University
,
Deborah Richards
Macquarie University
,
Anton Bogdanovych
Western Sydney University
,
Thierry Duval
IMT Atlantique
,
Torsten Kuhlen
RWTH Aachen University
,
Huyen Nguyen
University of New South Wales
,
Shigeo Morishima
University of Waseda
,
Yuichi Itoh
University of Osaka
,
Richard Skarbez
La Trobe
,
Anton Bogdanovych
Western Sydney University
,
Martin Masek
Edith Cowan University
Copyright © 2019 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 12 November 2019
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
face alignment
graph Laplacian
heatmap
neural networks
soft-argmax
Qualifiers
- research-article
- Research
- Refereed limited
Conference

Acceptance Rates
Overall Acceptance Rate66of254submissions,26%
Upcoming Conference
VRST '24

Sponsor:

sigchi

sigchi

30th ACM Symposium on Virtual Reality Software and Technology

October 9 - 11, 2024

Trier , Germany
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 297
  Total Downloads
- Downloads (Last 12 months)16
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

Floating-point Precision and Deformation Awareness for Scalable and Robust 3D Face Alignment

VRST '19: Proceedings of the 25th ACM Symposium on Virtual Reality Software and Technology

ABSTRACT

References

Cited By

Recommendations

Frontal face synthesis based on multiple pose-variant images for face recognition

Deformation Modeling for Robust 3D Face Matching

Nonlinear deformation learning for face alignment across expression and pose