How far are we from solving the 2D & 3D Face Alignment problem? (and a dataset of 230,000 3D facial landmarks)

被引:790
作者
Bulat, Adrian [1 ]
Tzimiropoulos, Georgios [1 ]
机构
[1] Univ Nottingham, Comp Vis Lab, Nottingham, England
来源
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2017年
基金
英国工程与自然科学研究理事会;
关键词
D O I
10.1109/ICCV.2017.116
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates how far a very deep neural network is from attaining close to saturating performance on existing 2D and 3D face alignment datasets. To this end, we make the following 5 contributions: (a) we construct, for the first time, a very strong baseline by combining a state-of-the-art architecture for landmark localization with a state-of-the-art residual block, train it on a very large yet synthetically expanded 2D facial landmark dataset and finally evaluate it on all other 2D facial landmark datasets. (b) We create a guided by 2D landmarks network which converts 2D landmark annotations to 3D and unifies all existing datasets, leading to the creation of LS3D-W, the largest and most challenging 3D facial landmark dataset to date (similar to 230,000 images). (c) Following that, we train a neural network for 3D face alignment and evaluate it on the newly introduced LS3D-W. (d) We further look into the effect of all "traditional" factors affecting face alignment performance like large pose, initialization and resolution, and introduce a "new" one, namely the size of the network. (e) We show that both 2D and 3D face alignment networks achieve performance of remarkable accuracy which is probably close to saturating the datasets used. Training and testing code as well as the dataset can be downloaded from https://www.adrianbulat.com/face-alignment/
引用
收藏
页码:1021 / 1030
页数:10
相关论文
共 48 条
  • [1] 2D Human Pose Estimation: New Benchmark and State of the Art Analysis
    Andriluka, Mykhaylo
    Pishchulin, Leonid
    Gehler, Peter
    Schiele, Bernt
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3686 - 3693
  • [2] [Anonymous], 2010, UMCS2010009
  • [3] [Anonymous], 2016, P BRIT MACH VIS C
  • [4] [Anonymous], 2012, ECCV
  • [5] [Anonymous], 2013, TPAMI
  • [6] [Anonymous], 2013, CVPR
  • [7] [Anonymous], 2013, CVPR
  • [8] [Anonymous], 2012, CVPR
  • [9] [Anonymous], 2016, DEEPERCUT DEEPER STR
  • [10] [Anonymous], 2016, ECCV