Besnerais and Champagnat, 2005). Other
computational reduction approaches use a sampled
set of pixels for parameter estimation and avoid
computing interpolation at each iteration (Keller and
Averbuch, 2003). In (Keller and Averbuch, 2004) a
bidirectional formulation is introduced that speeds
up the convergence properties for large motions.
Recently, Thomas et al. (Brox et al., 2004),
introduced a robust method to compute the optical
flow by adding to the brightness data constraint of
the energy functional another constraint: the gradient
constraint. The new energy functional produced one
of the best optical flow results in the current
literature. Therefore, a comparison is needed to
show the benefits of combining the gradient
constraint with the brightness constraint for
estimating the global motion.
Over-constraining the optical flow problem allows
more precise determination of a solution. The use of
redundant information enforces robustness with
respect to measurement noise. Constraints can be
obtained using several approaches by either applying
the same equation to multiple points or defining
multiple constraints for each image point. The later
can be obtained by applying a set of differential
equations (Bimbo et al., 1996) or applying the same
set of equations to different functions which are
related to image brightness. When the image motion
conforms to the model assumptions it produces
accurate flow estimates. However, the problem is
that parametric motion models applied over the
entire image are rarely valid due to varying depths,
transparency or independent motion. Therefore, It is
useful to use robust statistics to estimate a dominant
motion in the scene and then fit additional motions
to outlying measurements (Black and Jepson, 1996,
Irani et al., 1994). The outlying measurements which
are grouped together and segmented correspond to
independently moving objects and their motion is
estimated independently. It is also well-known that
the use of multiresolution methods improves the
estimation for large motions (Odobez and
Bouthemy, 1995). Spatiotemporal information gives
better results than spatial information (Barron et al.,
1994), and specifically, spatiotemporal
neighbourhood information assists in obtaining
better estimates for the motion vectors (Namuduri,
2004).
In this paper, we begin in section 2 by reviewing
the inverse compositional Lucas-Kanade algorithm
using only the brightness constancy. We proceed in
section 3 to elaborate on the constancy assumptions
by using the gradient constancy alone or combined
with the brightness constancy. In section 4 we
propose a new data constraint that combines the
brightness constancy with the gradient constancy
using multiple quadratic error functions. We
compare empirically the different data constraints in
section 5 both in terms of performance and speed.
We conclude in section 6.
2 INVERSE COMPOSITIONAL
IMAGE ALIGNMENT
Let W(x,p) denote a warping function that takes the
pixel x and maps it to subpixel location W(x,p)
where p=(p
1
,..,p
n
)
T
is a vector of motion parameters.
The goal of the inverse compositional (Baker and
Matthews, 2004) is to align a template image T(x) to
an input image I(x), where x=(x,y)
T
is a vector of
pixel coordinates. The inverse compositional
minimizes the sum of the squared differences (SSD)
between the current frame T and the motion
compensated frame I
2
() ( (; )) ( (;))
BC
x
Ep TWxpIWxp=Δ−
⎤
⎦
(1)
with respect to ∆p, where ∆p is the incremental
update to the motion parameters p by updating the
warp:
1
(;) (;) (; )Wxp Wxp Wx p
−
←Δo
(2)
Computing the backward warp of the image
I(W(x;p)) requires interpolating the image at
subpixel locations. Before deriving the solution, a
first order Taylor expansion is performed on (1):
2
((;0)) ((;))
x
W
TW x T p IW x p
p
⎤
∂
+∇ Δ −
⎥
∂
⎦
∑
(3)
where W(x;0) is the identity warp. Solving the least
squares equation for ∆p gives:
1
((;) ()
T
x
W
pH T IWxpTx
p
−
⎡⎤
∂
Δ= ∇ −
⎤
⎢⎥
⎦
∂
⎣⎦
∑
(4)
where H is the Hessian matrix:
T
x
WW
HT T
pp
⎤⎡ ⎤
∂∂
=∇ ∇
⎥⎢ ⎥
∂∂
⎦⎣ ⎦
∑
(5)
Assuming affine warp p=(p1,p2,p3,p4,p5,p6),
135
246
1
(;)
1
1
ppp
Wxp y
ppp
⎛⎞
+
⎛⎞
⎜⎟
=
⎜⎟
⎜⎟
+
⎝⎠
⎜⎟
⎝⎠
(6)
VISAPP 2008 - International Conference on Computer Vision Theory and Applications
566