object at corresponding points in the two images of a
stereo pair.
Such algorithms are known to fail when:
- there are repetitive objects
- the area has only a little texture
- disparities vary rapidly within the correlation
window
- an occlusion exists
- the image does not comply to the ordering
constraint (Gong and Yang, 2003).
Figure 1: An example of repetitive objects: the windows in
the building are repetitive (see red arrows).
Over the years several attempts have been made
to overcome these problems (Okutomi and Kanade
(1993), Szeliski and Scharstein (2002)). In many
cases, the algorithms ignore problematic locations
such as repetitive objects or occlusions in order to
avoid significant depth errors. However, removing
those locations from the calculations is problematic
since the distance to these objects is not calculated
and is missing in the results.
An example of this approach has been presented
by Fua (1993) who uses a consistency criterion to
reject invalid matches. The matching is performed
twice for each template/pixel. The first time, the
template is taken from the first image and matched
to the second. The second time, the template is taken
from the second image and matched to the first.
Only when both matchings result in the same
location is the matching considered valid. Otherwise
the templates/pixels are rejected. This method rejects
repetitive objects and the distance to those objects is
not calculated. The advantage in our approach is that
instead of rejecting the repetitive objects we find
those objects and remove the repetition by adding a
location that stops the repetition.
Szeliski and Scharstein (2002) presented an
algorithm for stereo matching that addresses two
factors - the uniqueness constraint and the stereo
occlusions. The algorithm uses the symmetric
matching of Fua (1993) to detect ambiguous
matching of repetitive objects. It resolves this
ambiguity using adaptive window approach that
enlarges the template size to include non-repetitive
objects (Kanade and Okutomi, 1994). In general, the
template should be large enough to include enough
texture for correlation matching. On the other hand,
it should be small enough to avoid unwanted
smoothing and the effects of projection distortion.
The probability of mismatching decreases as the size
of the template increases. Too small a template will
result in poor disparity estimation, since the signal-
to-noise ratio is low due to the lack of texture.
However, when the template is too large it leads to
loss of accuracy due to disparity changes within the
template. This causes different projection distortions
in both images. In addition, a large window
contributes to additional noise from regions without
texture (Kanade and Okutomi, 1994). In these cases,
the position of the maximum correlation may not
represent accurately the correct matching. Kanade
and Okutomi (1994) suggested a method for
adaptive window size selection. This approach
increases the template size iteratively and calculates
the uncertainty of matching. The template size
increases as long as the uncertainty of matching
decreases. The method presented in our paper finds
the regions that need to be added to the original
template directly without any iterations.
Additionally, instead of enlarging the whole
template size we add to the template only one region
that resolves the matching uncertainty.
In this paper, a new method for dealing with
repetitive objects in stereo images is proposed. The
proposed method creates a composed template based
on multiple small templates that contain relevant
information and removes regions that might yield
bad results such as regions without texture or
regions with large disparity changes. An instance of
the repetitive object in combination with the object
that breaks the repetition creates a unique composed
template. The method is computationally less
intensive than most other approaches.
2 METHOD AND
IMPLEMENTATION
Feature based stereo techniques match templates
from the left image to those in the right. Templates
were selected in regions with high intensity
variations (edges, corners, etc.). A flow chart of the
algorithm is shown in Figure 3. The main steps of
the algorithm are described below:
FAST TEMPLATE MATCHING OF REPETITIVE OBJECTS IN STEREOSCOPY
199