@nrailgun 2016-06-13T12:06:21.000000Z 字数 1356 阅读 2049

SDM and its Applications to Face Alignment

论文笔记

Derivation of SDM

Given an image $d \in \mathfrak R^{m \times 1}$ of $m$ pixels, $d(x) \in \mathfrak R^{p \times 1}$ indexes p landmarks in the image. $h$ is a non-linear feature extraction function (e.g., SIFT), and $h(d(x)) \in \mathfrak R^{128p \times 1}$ in case of SIFT.

During training, we will assume that the correct $p$ landmarks (in our case $66$ ) are known and refered to as $x_*$ . We ran the face detector on the training images to provide an initial configuration of the landmarks $x_0$ , which corresponds to an average shape.

In this setting, face alignment can be framed as minimizing the following function over $\Delta x$

f (x 0 + Δ x) = ∥ h (d (x 0 + Δ x)) - ϕ * ∥ 22

$f(x_0 + \Delta x) = \| h(d(x_0 + \Delta x)) - \phi_* \|_2^2$
where

ϕ∗=h(d(x∗)) $\phi_* = h(d(x_*))$ .

For derivation purposes, we will assume that $h$ is twice differentiable. We apply a second order Taylor expansion

f (x 0 + Δ x) \approx f (x 0) + J f (x 0) T Δ x + 1 2 Δ x T H (x 0) Δ x

$f(x_0 + \Delta x) \approx f(x_0) + J_f(x_0)^T \Delta x + \frac 1 2 \Delta x^T H(x_0) \Delta x$
where

Jf(x0)∈Rp×1 $J_f(x_0) \in \mathfrak R^{p \times 1}$ ,

H(x0)∈Rp×p $H(x_0) \in \mathfrak R^{p \times p}$ .

SDM will learn a sequence of generic descent directions $\{ R_k \}$ and bias terms $\{ b_k \}$

x k = x k - 1 + R k - 1 ϕ k - 1 + b k - 1

$x_k = x_{k-1} + R_{k-1} \phi_{k-1} + b_{k-1}$
such that the succession of

xk $x_k$ converges to

x∗ $x_*$ .

Learning for SDM

Minimize

arg min R k, b k \sum d i \sum x i k ∥ Δ x k i * - R k ϕ i k - b k ∥ 2

$\arg \min_{R_k, b_k} \sum_{d^i} \sum_{x^i_k} \| \Delta x^{ki}_* - R_k \phi_k^i - b_k \|^2$
where

Δxki=xi∗−xik $\Delta x^{ki} = x_*^i - x_k^i$ .

SDM and its Applications to Face Alignment

Derivation of SDM

Learning for SDM

内容目录