Fully Automatic Facial Deformation Transfer

Bian, Shaojun; Zheng, Anzong; Gao, Lin; Maguire, Greg; Kokke, Willem; Macey, Jon; You, Lihua; Zhang, Jian J.

doi:10.3390/sym12010027

Open AccessArticle

Fully Automatic Facial Deformation Transfer

by

Shaojun Bian

^1,2,*,

Anzong Zheng

^1,2,

Lin Gao

³,

Greg Maguire

^2,4,

Willem Kokke

²,

Jon Macey

¹,

Lihua You

¹ and

Jian J. Zhang

¹

The National Centre for Computer Animation, Bournemouth University, Poole BH12 5BB, UK

²

Humain Ltd., Belfast BT1 2LA, UK

³

Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China

⁴

Belfast School of Art, Ulster University, York St, Belfast BT15 1ED, UK

^*

Author to whom correspondence should be addressed.

Symmetry 2020, 12(1), 27; https://doi.org/10.3390/sym12010027

Submission received: 21 November 2019 / Revised: 13 December 2019 / Accepted: 17 December 2019 / Published: 21 December 2019

Download

Browse Figures

Versions Notes

Abstract

:

Facial Animation is a serious and ongoing challenge for the Computer Graphic industry. Because diverse and complex emotions need to be expressed by different facial deformation and animation, copying facial deformations from existing character to another is widely needed in both industry and academia, to reduce time-consuming and repetitive manual work of modeling to create the 3D shape sequences for every new character. But transfer of realistic facial animations between two 3D models is limited and inconvenient for general use. Modern deformation transfer methods require correspondences mapping, in most cases, which are tedious to get. In this paper, we present a fast and automatic approach to transfer the deformations of the facial mesh models by obtaining the 3D point-wise correspondences in the automatic manner. The key idea is that we could estimate the correspondences with different facial meshes using the robust facial landmark detection method by projecting the 3D model to the 2D image. Experiments show that without any manual labelling efforts, our method detects reliable correspondences faster and simpler compared with the state-of-the-art automatic deformation transfer method on the facial models.

Keywords:

face landmark detection; orthographic projection; point-wise correspondences; automatic deformation transfer

Graphical Abstract

1. Introduction

Character modeling, especially character facial modeling, plays a pivotal role in computer graphics and computer animation. Copying facial deformations from one character to another is widely needed in both industry and academia, to replace time consuming modeling work to create the shape sequences for every new character. The main limitation of current deformation transfer methods is the dependence on tedious work to get the reliable correspondence map between the source character and the new target character. The existing state-of-the-art method introduced by Sumner et al. [1] for geometry-based deformation transfer reply on the point-wise correspondences between source and target models.

Therefore, the research on automatically detecting the correspondences between source and target characters, then employing the skin deformation method [1] to fully automatically obtain high-quality deformation transfer results becomes very important in computer modeling and animation to reuse the existing dataset for generating new shapes and reduce tedious time cost of artists. To achieve this, [2] exploits the learning capability of deep neural networks to learn how shapes deform naturally from a given dataset, provide a differentiable metric measuring visual similarity and build reliable mapping between the latent spaces with cycle-consistency, including proposes a mesh-based convolutional variational auto-encoder (VAE) to encode a shape set with flexible deformations in a compact latent space, a differentiable network to measure visual similarity, and a cycle-consistent GAN for reliable mapping between latent spaces. The automatic deformation transfer method by VC-GAN neural networks introduced in Gao et al. [2] is able to copy deformations from source shapes to a given target mesh, but sometimes causes obvious artefacts and is less accurate in localized areas of detailed facial deformations, as shown in Figure 6c, the artefacts highlighted in boxes are caused by Gao et al. [2] when it transfers the localized subtle face deformation from source character.

To tackle the above problem for transferring subtle detailed facial deformations, in this paper, we propose one straight forward approach to automatically obtain the 3d point-wise correspondences on source and target faces without any manual work. Two mapping processes are employed to obtain the 3D point-wise correspondences, (1) we project the 3D source and target character to 2D images, which are produced by the front view orthographic projection, then the facial feature detection technique is employed to extract the same set of 2D landmarks on both source and target character; (2) these 2D landmarks are inversely mapped back with accurate corrections for the eye and mouth areas, to finally obtain the 3D point-wise correspondences.

Aimed at facial deformation transfer, experiments show that our method fully automatically creates high-quality correspondences between source and target faces, is able to obtain believable deformations transferred much faster, and is simpler than the state-of-the-art automatic deformation transfer method presented in [2].

2. Related Work

Shape deformation plays a very critical role in character modeling and animation filed, especially facial shape deformation, how to quickly create high fidelity expressions for different characters is still one active research topic and in great demand from industry. A comprehensive review about different facial modeling and animation approaches are introduced by Noh et al. [3] and Li et al. [4]. The research to improve face modeling and animation generally includes: (1) produce realistic face animation, (2) can reach real-time performance, (3) reduce manual work by automatic approach, and (4) can be transferred and adapted to different faces.

Mastering facial animation is a long-standing challenge in computer animation and computer graphics. The face is a very important part in expressing one character’s emotions, their state of mind, and to indicate their future actions. Audiences are naturally and particularly trained to notice subtle characteristics of faces and identify different hint from faces (Cao et al. [5]). Research of facial modeling and animation includes two major types, one type is geometric manipulations-based and the other is image manipulation-based. Geometric manipulations include key-framing and geometric interpolations, parameterizations Cohen et al. [6], finite element methods Basu et al. [7], muscle-based modeling Pieper et al. [8], visual simulation using pseudo muscles (Magnenat et al. [9]), spline models (Wang et al. [10]) and free-form deformations (Kalra et al. [11]). Image manipulations include image morphing between photographic images (Ucicr et al. [12]), image blending (Pighin et al. [13]), and vascular expressions (Kalra et al. [14]).

Presently, facial deformation and animation are often produced by feature tracking (Li et al. [15]) or performance driven. In the research of face modelling, analysis and reconstruction, the landmark papers of Eigenfaces Turk et al. [16] and Active Shape Models Cootes et al. [17] were to define the research landscape for years to come. Romdhani et al. [18] present a novel algorithm aiming to estimate the 3D shape, the texture of a human face, along with the 3D pose and the light direction from a single photograph by recovering the parameters of a 3D morphable model. Generally, the algorithms tackling the problem of 3D shape estimation from image data use only the pixels intensity as input to drive the estimation process. Bas et al. [19] explores the problem of fitting a 3D morphable model to single face images using only sparse geometric features (edges and landmark points). Hu et al. [20] present an efficient stepwise 3DMM-to-2D image-fitting procedure by the optimisation process involves all the pixels of the input image, rather than randomly selected subsets, which enhances the accuracy of the fitting.

Sumner et al. [1] performed pioneering work for deformation transfer between different meshes. Local shape deformation is represented using deformation gradients, which are transferred from the source to the target shapes. This proposed approach brings one piratical approach to clone facial expressions to new characters but requires point-wise correspondences between source and target shapes. Semantic deformation transfer introduced in Baran et al. [21] preserves the semantic characteristics of the motion instead of its literal deformation with a shape space that enables interpolation and projection with standard linear algebra. Given several example mesh pairs, semantic deformation transfer infers a correspondence between the shape spaces of the two characters. This enables automatic transfer of new poses and animations. However, it needs paired source and target shapes. Chu et al. [22] present an example-based deformation transfer method to solve the above limitation. With the aid of a few target examples, the characteristic deformations of transferred target models are recovered. To minimize manual effort on the correspondences, Yang et al. [23] also proposes a novel algorithm by adapting a biharmonic weight deformation framework which is able to produce plausible deformation even with only a few key points, although the corresponding points on the target shape still require to be manually specified. Therefore, the research on fully automatic skin deformation method to obtain high-quality deformation transfer results becomes very important in computer modeling and animation to reuse the existing dataset for generating new shapes and reduce tedious time cost of artists. To achieve this, one state-of-the-art automatic algorithm to deform target shapes by VC-GAN neural networks is introduced in Gao et al. [2] to obtain high-quality deformation transfer results for a given character.

3. Our Approach

Our approach consists of three steps to extract the point-wise correspondences from the source facial mesh to the target facial mesh, including: project 3D mesh to 2D space; detect and refine to valid 2D facial landmarks; retrieve 3D point-wise correspondences and correct the eyes and mouth feature points, as shown in Figure 1a–e.

Figure 1a shows one girl’s scanned 3D model, after non-rigid registration, we obtain one cleaned mesh and the scaled orthographic projection is employed to project this mesh to one front view image, as shown in Figure 1b. Then, 68 face landmarks are detected by the OpenCV cascade facial landmark detector, according to experiments, our approach only use 46 points of them as shown in Figure 1c to avoid noise and keep transfer quality. In Figure 1c, the green landmarks are viewed as valid pixels which has corresponding vertex information and is a point on the 3D model surface. But the blue landmarks are viewed as invalid pixels, which need to be refined and find the nearest valid pixel instead, the refinement result of landmarks are shown in Figure 1d, especially the eye and mouth area. Then, inverse Z-buffer are used to retrieve the feature points on the 3D model surface from each landmark in 2D image after the refinement. The feature points obtained so far are not reasonable, for example, the eye corner point. In this case, Principal component analysis (PCA) is used to obtain the principle axes of the eye feature points to correct the eye corner feature points’ position, as shown form Figure 1d,e.

After that, the same set of feature points of another different characters’ face are obtained by the same process and the point-wise correspondence between different character face is achieved for deformation transfer. The deformations copied from our existing expression dataset to this girl face model is showed in Figure 1f. More experiments of deformation transfered from existing dataset to different characters can be seen in Figure 4.

3.1. Convert 3D Mesh to 2D Space

Several projection approaches can be employed to convert 3D facial shapes to 2D images, but here we chose the scaled orthographic projection, because it constrains the space of possible transformations to the space of rigid motions in Euclidean space and additionally allows for scaling [24].

We use

S

to denote the source shape or the target shape. Let

F

be the faces of source or target shape, here the source and target shape can have a different topology.

Then, we compute the 2D space pixel information

z_{i j} = {f_{k}, L_{0}, L_{1}, L_{2}}

by Z-buffer algorithm, where

z_{i j}

stores the face

f_{k}

and barycentric coordinates

L_{0}

,

L_{1}

and

L_{2}

on the face where

Z

is satisfied with Equation (1).

Z (L_{0} x_{0} + L_{1} x_{1} + L_{2} x_{2}) < = Z (x), \forall x \in \partial S

(1)

Here,

x_{0}, x_{1}

and

x_{2}

are vertices of

f_{k}

, and

x

is a point on the surface

\partial S

. If a pixel is not projected by any point, we simply let

z_{i j} = {- 1, 0, 0, 0}

. And these pixels are viewed as invalid pixels.

3.2. Detect and Refine the 2D Facial Landmarks

In Section 3.1, we get the 2D orthographic-view images of source and target shape. Here, the Supervised Descent Method [25] is used to detect the 2D facial landmarks [26], as shown in Figure 1c. We can see some detected 2D landmarks are located at invalid pixels like the blue pixels in Figure 2a, in this case, we need to refine them to valid pixels by the nearest search, starting from

1 - ring

search to

n - ring

, until finding the valid pixel satisfying

f_{k} \neq - 1, f_{k} \in z_{i j}

, as shown in Figure 2b,c. Figure 2d shows that the initial detected invalid pixels (blue dots) are refined to valid facial landmarks (green dots).

3.3. Retrieve 3D Point-Wise Correspondences

After moving every detected 2D landmark

l_{i}

to valid pixel

{\hat{l}}_{i}

, we use the inverse Z-buffer process to get projection information of each pixel

p_{i j}

:

{f_{k}, L_{0}, L_{1}, L_{2}} = Z^{- 1} (p_{i j}), p_{i j} \in \hat{L}

(2)

The 3D point-wise feature

v_{i}

is the vertex within

f_{k}

and has the largest coefficient of

max (L_{0}, L_{1}, L_{2})

. Therefore,

v_{i}

is the 3D feature point index calculated from each valid 2D landmark

\hat{L}

.

Figure 3a shows some obtained 3D feature points (green dots) obtained are still not all in the reasonable locations, especially around the eyes and mouth. In order to correct these feature points, firstly we calculate the current green features’ center of each eye or mouth, to search the closest edge circle.

Then Principal component analysis (PCA) is utilized to obtain the principle axes of the circle vertices. As shown in Figure 3b, the first principle vector (red arrow) is regarded as the width axis of the eye, which is used to determine the two eye corner feature points (large red dots). And among the other two vectors, the one has smaller angle with the circle plane is regarded as the height axis (yellow arrow) to determine the other interval feature points (red dots). After the correction, all 3D facial feature points

V

are obtained correctly from green dots to red dots, shown as Figure 3b. The same correction process are applied for both eye circles and mouth circle, the final 3D facial feature points retrieved can be seen in Figure 1e.

The same set of 3D facial feature points obtained on both source and target face shapes are regarded as the point-wise correspondence for transferring the deformations of existing source dataset to the target face.

3.4. Deformation Transfer

Our approach utilizes one mesh-based effective deformation transfer algorithm introduced by Sumner et al. [1] to copy deformations from one model onto another, regardless of topology or vertices’ number. This algorithm treat the shape deformation as a set of affine transformations of each vertex on the source model as below:

Q v_{i} + d = {\tilde{v}}_{i}, i \in 1 \dots 4

(3)

where Q denotes the rotation matrix and d denotes the translation vector.

v_{i}

and

{\tilde{v}}_{i}

represent vertices of each triangle face on original and deformed mesh, with

v_{4}

is introduced in the direction perpendicular to the triangle. The forth vertex

v_{4}

is computed by

v_{4} = v_{1} + (v_{2} - v_{1}) \times (v_{3} - v_{1}) / \sqrt{| (v_{2} - v_{1}) \times (v_{3} - v_{1}) |}

(4)

Here, Equation (3) can be written as

Q V = \tilde{V}

, therefore, the affine transformation Q can be computed by

Q = \tilde{V} V^{- 1}

(5)

Then

S_{1}, \dots, S_{| s |}

are used to encode the affine transformation of each triangle face on source mesh, and

T_{1}, \dots T_{| t |}

encode the deformations of target mesh. In order to transfer the deformations of source mesh onto the target mesh. One constrained optimization problem for the target affine transformations need to be solved:

min_{T_{1} + d_{1} . . . T_{| T |} + d_{| T |}} \sum_{j = 1}^{| M |} | | S_{s_{j}} - T_{t_{j}} {| |}_{F}^{2}, s u b j e c t t o : T_{j} v_{i} + d_{j} = T_{k} v_{i} + d_{k} \lor i, \lor j, k \in p (v_{i}) .

(6)

It means the target triangle face indexed with

t_{i}

will deform like the source triangle

s_{j}

, where the mapping between source mesh

S_{s_{j}}

and target mesh

T_{t_{j}}

can be computed by the point-wise correspondences from a small set of user selected marker points on source and target meshes. However manually selecting marker points is very time-consuming repetitive work which makes the deformation transfer algorithm inconvenient for general use.

In our approach, we solve the above problem using the robust facial landmark detection method by projecting the 3D model to the 2D image, to automatically estimate the correspondences with different facial meshes. Specifically, the same feature points pattern is detected both on source and target character to obtain the 3D point-wise correspondences. Then, the deformation transfer algorithm introduced by Sumner et al. [1] is utilized to copy the deformations from source mesh onto target mesh.

4. Experiments and Comparisons

Our experiments to evaluate the quality of the 3D correspondence feature points obtained by our method are carried out on a computer with an i7 8650U CPU, 16GB RAM, and Intel(R) UHD Graphics 620.

Figure 4 shows the facial movements copied to three target characters whose number of vertices are 28,426, 11,780 and 11,676 from source shape (9127 verts). The first row in Figure 4 shows the existing character expression dataset used as source shapes, the second to forth rows in Figure 4a illustrate three target characters’ neutral pose and point-wise feature correspondence obtained (red dots), the second to forth rows in Figure 4b demonstrate the deformations copied automatically from source dataset. Without any multi-threading, our running time of detecting 3D correspondences of the target faces from the second row to forth row in Figure 4a are 0.343 s, 0.335 s and 0.327 s respectively. Then, 2.228 s, 1.218 s, and 1.159 s to transfer 9 deformations, as shown in Table 1.

We compare the deformation transfer results obtained from the 46 features determined by our algorithm, manual corrected features (46 features), and more features (72 features), as shown in Figure 5. Figure 5a shows the source character shapes. Figure 5b shows the deformation transfer results by our proposed approach with automatically detected 46 corresponding feature points. The first row of Figure 5c shows the results created by supervised selection of more corresponding points (72 features); the color map for each expression in the second row is the quantitative comparison to show the differences between our approach and supervised selection of 72 features, the Maximum Vertex Errors (MVEs) between them are 0.002431, 0.003451, 0.000422, 0.003075, 0.004760, 0.002156, 0.000167, 0.003513 and 0.003131 respectively. Figure 5d shows the results produced by the manually labeling 46 corresponding feature points; the color map for each expression in the second row is also the quantitative comparison to show the differences between our approach and manually labeling 46 features, the Maximum Vertex Errors (MVEs) between them are 0.011165, 0.008808, 0.010611, 0.017191, 0.034246, 0.027239, 0.004292, 0.020490, and 0.016479, respectively. The above experiments illustrate that our approach can produce competitive results with manual efforts, and much more feature points can not promote the effect significantly, the differences between them can hardly be seen visually.

One state-of-the-art automatic algorithm to deform target shapes by VC-GAN neural networks introduced in Gao et al. [2] can obtain high-quality global deformation transfer results for a given character’s deformations, but causes obvious artefacts and is less accurate with the localized subtle detailed facial deformations as shown in Figure 6. Figure 6a shows the source character shapes. Figure 6b shows the deformation transfer results by our proposed approach with automatically detected 46 corresponding feature points. The first row of Figure 6c shows the results created by Gao et al. [2], the artefacts can be seen in several localized areas; the color map for each expression in the second row shows the differences between our approach and Gao et al. [2], the Maximum Vertex Errors (MVEs) between them are 0.031727, 0.022187, 0.021444, 0.049702, 0.067460, 0.043652, 0.018132, 0.048965 and 0.032280 respectively.

In this experiment, using the same source (28,426 vertices) mesh with 14 training examples and target (8150 vertices) mesh with 12 training examples, Gao et al. [2] takes more than one hour to train the VAE, SimNet and CycleGAN. After this off-line training, it can do the deformation transfer for each mesh in 10 ms. In contrast, without any optimization, our method does not need a training process with hours, and the computing time to detect the 3D feature correspondences of source (28,426 vertices) and target (8150 vertices) in Figure 6 is 0.670 s totally.

5. Conclusions

Facial Animation is a serious and ongoing challenge for the Computer Graphic industry. Because diverse and complex emotions need to be expressed by different facial deformation and animation, copying facial deformations from existing character to another is widely needed in both industry and academia to reduce time-consuming and repetitive manual work.

In this paper, we focused on the specific topic of feasible facial deformation transfer, in order to reuse the existing face expressions dataset and full automatically copy the deformations to new characters, we employ the facial image landmark detection algorithm to find the 3D facial feature correspondences between source and target characters by projecting the 3D face orthogonally 2D and establish the correspondence in 2D first and then map back to 3D.

The experiments demonstrate that our proposed method provides a very simple and effective way to achieve high-quality, believable and competitive deformation transfer results without any manual work. Several improvements can be made in the future. First, we will extend our method to more generic model, for example human body, hand or animal model. And iterations will be introduced to approach and approximate the real deformations to reduce errors.

Author Contributions

All authors contributed to the paper. All authors have read and agreed to the published version of the manuscript.

Funding

This research is supported by Innovate UK (Knowledge Transfer Partnerships Ref: KTP010860) and the EU Horizon 2020 RISE 2017 funded project PDE-GIR (H2020-MSCA-RISE-2017-778035). Shaojun Bian is also supported by China Scholarship Council.

Conflicts of Interest

The authors declare no conflict of interest.

References

Sumner, R.W.; Popović, J. Deformation transfer for triangle meshes. ACM Trans. Graph. (TOG) 2004, 23, 399–405. [Google Scholar] [CrossRef]
Gao, L.; Yang, J.; Qiao, Y.L.; Lai, Y.K.; Rosin, P.L.; Xu, W.; Xia, S. Automatic unpaired shape deformation transfer. ACM Trans. Graph. (TOG) 2018, 37, 237. [Google Scholar] [CrossRef] [Green Version]
Noh, J.Y.; Neumann, U. A Survey of Facial Modeling and Animation Techniques; Technical Report, USC Technical Report; University of Southern California: Los Angeles, CA, USA, 1998; pp. 99–705. [Google Scholar]
Li, L.; Liu, Y.; Zhang, H. A survey of computer facial animation techniques. In Proceedings of the 2012 IEEE International Conference on Computer Science and Electronics Engineering, Hangzhou, China, 23–25 March 2012; Volume 3, pp. 434–438. [Google Scholar]
Cao, C.; Bradley, D.; Zhou, K.; Beeler, T. Real-time high-fidelity facial performance capture. ACM Trans. Graph. (TOG) 2015, 34, 46. [Google Scholar] [CrossRef]
Cohen, M.M.; Massaro, D.W. Modeling coarticulation in synthetic visual speech. In Models and Techniques in Computer Animation; Springer: Berlin, Germany, 1993; pp. 139–156. [Google Scholar]
Basu, S.; Oliver, N.; Pentland, A. 3D modeling and tracking of human lip motions. In Proceedings of the Sixth International Conference on Computer Vision, Bombay, India, 4–7 January 1998; pp. 337–343. [Google Scholar]
Pieper, S.; Rosen, J.; Zeltzer, D. Interactive graphics for plastic surgery: A task-level analysis and implementation. In Proceedings of the Symposium on Interactive 3 D Graphics: Proceedings of the 1992 Symposium on Interactive 3 D Graphics, Vienna, Austria, 29 June–1 July 1992; Volume 1992, pp. 127–134. [Google Scholar]
Magnenat-Thalmann, N.; Primeau, E.; Thalmann, D. Abstract muscle action procedures for human face animation. Vis. Comput. 1988, 3, 290–297. [Google Scholar] [CrossRef]
Wang, C.L.; Forsey, D.R. Langwidere: A new facial animation system. In Proceedings of the IEEE Computer Animation’94, Geneva, Switzerland, 25–28 May 1994; pp. 59–68. [Google Scholar]
Kalra, P.; Mangili, A.; Thalmann, N.M.; Thalmann, D. Simulation of facial muscle actions based on rational free form deformations. In Computer Graphics Forum; Wiley Online Library: Hoboken, NJ, USA, 1992; Volume 11, pp. 59–69. [Google Scholar]
Ucicr, T. Feature-based image metamorphosis. Comput. Graph. 1992, 26, 2. [Google Scholar]
Pighin, F.; Hecker, J.; Lischinski, D.; Szeliski, R.; Salesin, D.H. Synthesizing realistic facial expressions from photographs. In Proceedings of the ACM SIGGRAPH 2006 Courses, Boston, MA, USA, 30 July–3 August 2006; p. 19. [Google Scholar]
Kalra, P.; Magnenat-Thalmann, N. Modeling of vascular expressions in facial animation. In Proceedings of the IEEE Computer Animation’94, Geneva, Switzerland, 25–28 May 1994; pp. 50–58. [Google Scholar]
Li, H.; Trutoiu, L.; Olszewski, K.; Wei, L.; Trutna, T.; Hsieh, P.L.; Nicholls, A.; Ma, C. Facial performance sensing head-mounted display. ACM Trans. Graph. (TOG) 2015, 34, 47. [Google Scholar] [CrossRef] [Green Version]
Turk, M.A.; Pentland, A.P. Face recognition using eigenfaces. In Proceedings of the1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Shanghai, China, 11–13 March 1991; pp. 586–591. [Google Scholar]
Cootes, T.F.; Taylor, C.J.; Cooper, D.H.; Graham, J. Active shape models-their training and application. Comput. Vis. Image Underst. 1995, 61, 38–59. [Google Scholar] [CrossRef] [Green Version]
Romdhani, S.; Vetter, T. Estimating 3D shape and texture using pixel intensity, edges, specular highlights, texture constraints and a prior. In Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05), San Diego, CA, USA, 20–25 June 2005; Volume 2, pp. 986–993. [Google Scholar]
Bas, A.; Smith, W.A.; Bolkart, T.; Wuhrer, S. Fitting a 3D morphable model to edges: A comparison between hard and soft correspondences. In Asian Conference on Computer Vision; Springer: Berlin, Germany, 2016; pp. 377–391. [Google Scholar]
Hu, G.; Yan, F.; Kittler, J.; Christmas, W.; Chan, C.H.; Feng, Z.; Huber, P. Efficient 3D morphable face model fitting. Pattern Recognit. 2017, 67, 366–379. [Google Scholar] [CrossRef] [Green Version]
Baran, I.; Vlasic, D.; Grinspun, E.; Popović, J. Semantic deformation transfer. ACM Trans. Graph. (TOG) 2009, 28, 36. [Google Scholar] [CrossRef]
Chu, H.K.; Lin, C.H. Example-based Deformation Transfer for 3D Polygon Models. J. Inf. Sci. Eng. 2010, 26, 379–391. [Google Scholar]
Yang, J.; Gao, L.; Lai, Y.K.; Rosin, P.L.; Xia, S. Biharmonic deformation transfer with automatic key point selection. Graph. Models 2018, 98, 1–13. [Google Scholar] [CrossRef] [Green Version]
Huber, P. Real-time 3D Morphable Shape Model Fitting to Monocular In-the-Wild Videos. Ph.D. Thesis, University of Surrey, Guildford, UK, 2017. [Google Scholar]
Xiong, X.; De la Torre, F. Supervised descent method and its applications to face alignment. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, USA, 23–28 June 2013; pp. 532–539. [Google Scholar]
Sagonas, C.; Antonakos, E.; Tzimiropoulos, G.; Zafeiriou, S.; Pantic, M. 300 faces in-the-wild challenge: Database and results. Image Vis. Comput. 2016, 47, 3–18. [Google Scholar] [CrossRef] [Green Version]

Figure 1. Pipeline of our approach and facial movements copied from source character.

Figure 2. (a) shows the initial 2D landmarks detected, the blue dots denote the invalid pixels. (b)–(d) show the refining process to get the green valid pixels by the nearest-search.

Figure 3. (a) shows after the refining process, the green valid pixels are all obtained by the nearest-search. (b) shows the final corrected features (red dots) by defining the principle axes of eye circle.

Figure 4. The first row is the source character, second to fourth row are three target characters.

Figure 5. The results of deformation transfer form different input of correspondences on source and target faces. (a) shows the source neutral face and 8 expressions. (b) shows the deformations transferred by our approach. (c) shows the deformations transfered from more correspondences of 72 features from supervised selection. (d) shows that the deformations transfered from manual labelling, 46 features. the color maps illustrate that the 46 features points obtained by our approach can produce competitive results with supervised 72 features (c) and manual labelling (d).

Figure 6. The comparison with Gao et al. [2], (a) shows the source neutral face and 8 expressions. (b) shows the deformations transferred by our approach without artefacts in the localized areas of subtle facial deformations. (c) shows the deformation transfered by Gao et al. [2] causes artefacts in some cases for sensitive facial detailed deformations.

Table 1. The running time of detecting 3D correspondence on different faces with different topology and the running time of deformation transfer for each expression.

Models	Source Face	Girl Face	Female Face	Male Face
Polygon Verts	9127	28,426	11,780	11,676
Detecting 3D correspondence	0.115 s	0.343 s	0.335 s	0.327 s
Deformation transfer	–	0.248 s	0.135 s	0.129 s

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Bian, S.; Zheng, A.; Gao, L.; Maguire, G.; Kokke, W.; Macey, J.; You, L.; Zhang, J.J. Fully Automatic Facial Deformation Transfer. Symmetry 2020, 12, 27. https://doi.org/10.3390/sym12010027

AMA Style

Bian S, Zheng A, Gao L, Maguire G, Kokke W, Macey J, You L, Zhang JJ. Fully Automatic Facial Deformation Transfer. Symmetry. 2020; 12(1):27. https://doi.org/10.3390/sym12010027

Chicago/Turabian Style

Bian, Shaojun, Anzong Zheng, Lin Gao, Greg Maguire, Willem Kokke, Jon Macey, Lihua You, and Jian J. Zhang. 2020. "Fully Automatic Facial Deformation Transfer" Symmetry 12, no. 1: 27. https://doi.org/10.3390/sym12010027

APA Style

Bian, S., Zheng, A., Gao, L., Maguire, G., Kokke, W., Macey, J., You, L., & Zhang, J. J. (2020). Fully Automatic Facial Deformation Transfer. Symmetry, 12(1), 27. https://doi.org/10.3390/sym12010027

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Fully Automatic Facial Deformation Transfer

Abstract

1. Introduction

2. Related Work

3. Our Approach

3.1. Convert 3D Mesh to 2D Space

3.2. Detect and Refine the 2D Facial Landmarks

3.3. Retrieve 3D Point-Wise Correspondences

3.4. Deformation Transfer

4. Experiments and Comparisons

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI