SIMULATING DYNAMICAL SYSTEMS FOR EARLY VISION
Babette Dellen
1,2
and Florentin W¨org¨otter
3
1
Bernstein Center for Computational Neuroscience,Max-Planck Institute for Dynamics and Self-Organization
Bunsenstrasse 10, G¨ottingen, Germany
2
Institut de Rob`otica i Inform`atica Industrial (CSIC-UPC), Llorens i Artigas 4-6, 08028 Barcelona, Spain
3
Bernstein Center for Computational Neuroscience, University G¨ottingen, Bunsenstrasse 10, G¨ottingen, Germany
Keywords:
Early vision, Stereo matching, Energy minimization, Dynamical systems.
Abstract:
We propose a novel algorithm for stereo matching using a dynamical systems approach. The stereo correspon-
dence problem is first formulated as an energy minimization problem. From the energy function, we derive a
system of differential equations describing the corresponding dynamical system of interacting elements, which
we solve using numerical integration. Optimization is introduced by means of a damping term and a noise
term, an idea similar to simulated annealing. The algorithm is tested on the Middlebury stereo benchmark.
1 INTRODUCTION
In stereo vision, 3D information is reconstructed from
stereo image pairs, i.e. two images of the same scene
taken from a different viewpoint. Algorithmic so-
lutions to this problem are not only of interest for
the field of computer vision [Scharstein and Szeliski,
2002], but also for related fields, such as computa-
tional neuroscience [Roe et al., 2007]. Different ap-
proacheshavebeen compared in a study by Scharstein
and Szeliski (2002). In general, we distinguish be-
tween local algorithms and methods based on global
optimization. Local methods are mainly character-
ized by their matching cost computation and cost ag-
gregation step, while global algorithms formulate a
global energy function which is then minimized. This
energy minimization problem is known to be NP hard.
The algorithms are distinguished based on the mini-
mization procedure used. Common methods are sim-
ulated annealing [Marroquin et al., 1987, Geman and
Geman, 1984, Barnard, 1989], graph cuts [Scharstein
and Szeliski, 2002, Boykov et al., 2001], and max
flow [Roy, 1999]. If global optimization is reduced
to independent scanlines, methods such as dynamic
programming or scanline optimization can be used to
compute a solution in polynomal time [Scharstein and
Szeliski, 2002].
In this paper, we propose a novel framework for
computing approximate solutions to the energy min-
imization problem on the example of early stereo vi-
sion. From the energy function, a system of ordi-
nary differential equations, determining the temporal
evolution of the system, can be derived. Each pixel
represents a “mass point”, moving along a single di-
mension with an amplitude encoding the disparity es-
timate (or label) of the pixel. Each mass is moving un-
der the influence of a data force, which is derivedfrom
the image data, and interacts with its neighbors via an
interaction force. The resulting system of differen-
tial equations is solved using a Runge Kutta method
of 4th order with fixed step size. A damping force
ensures that the dynamical systems settles at a stable
state.
2 THE MODEL SYSTEM
2.1 Stereo Vision as Energy
Minimization
The general framework we consider can be defined as
follows. Let P be the set of pixels in an image. The
goal is to find a disparity z
p
for each pixel p ∈ P which
minimize a gobal energy
E(z
p
) = E
data
(z
p
) +
∑
q∈N(p)
E
int
(z
p
,z
q
) , (1)
where N(p) is the neighborhood of pixel p. The data
term E
data
measures how well the disparity values are
in agreement with the input data. The interaction term
525
Dellen B. and Wörgötter F. (2009).
SIMULATING DYNAMICAL SYSTEMS FOR EARLY VISION.
In Proceedings of the Fourth International Conference on Computer Vision Theory and Applications, pages 525-528
DOI: 10.5220/0001800905250528
Copyright
c
SciTePress