STEREO VISION MATCHING OVER SINGLE-CHANNEL
COLOR-BASED SEGMENTATION
Pablo Revuelta Sanz
1
, Belén Ruiz Mezcua
1
, José M. Sánchez Pena
1
and Jean-Phillippe Thiran
2
1
Carlos III University of Madrid, Spanish Center for Captioning and Audiodescription (CESyA)
Av. Peces Barba, 1, 28918 Leganés, Madrid, Spain
2
Signal Processing Laboratory (LTS5), École Polytechnique Fédéral de Lausanne (EPFL)
Station 11, CH1015 Lausanne, Switzerland
Keywords: Segmentation, Single-channel, Depth map, Stereovision.
Abstract: Stereo vision is one of the most important passive methods to extract depth maps. Among them, there are
several approaches with advantages and disadvantages. Computational load is especially important in both
the block matching and graphical cues approaches. In a previous work, we proposed a region growing
segmentation solution to the matching process. In that work, matching was carried out over statistical
descriptors of the image regions, commonly referred to as characteristic vectors, whose number is, by
definition, lower than the possible block matching possibilities. This first version was defined for gray scale
images. Although efficient, the gray scale algorithm presented some important disadvantages, mostly related
to the segmentation process. In this article, we present a pre-processing tool to compute gray scale images
that maintains the relevant color information, preserving both the advantages of gray scale segmentation and
those of color image processing. The results of this improved algorithm are shown and compared to those
obtained by the gray scale segmentation and matching algorithm, demonstrating a significant improvement
of the computed depth maps.
1 INTRODUCTION
Stereo vision is a common procedure used to obtain
a 3D representation of a scene where the information
is provided from two different image projections of
the same scene. This particular process is carried out
automatically by human vision. However,
implementing this technique in a computer vision
system presents many diverse problems which will
be discussed throughout this study.
All multi and stereo view vision based
approaches must take into consideration a process
known as Matching. This consists in identifying the
same physical points in different images (Pons and
Keriven, 2007). The difference between these
images is referred to as the disparity, from which the
depth information can be recovered.
We propose in this paper an improvement of a
previous work (Revuelta Sanz et al., 2010b). In this
previous work, stereo vision has been achieved by
matching the region descriptors instead of matching
blocks or edges located in both images. Regarding
color images matching, algorithms found in the
literature show to have important computational
load, whichever is the color space chosen to process
the image (see (Kuan et al., 2008; Mushrif and Ray,
2008; Ozden and Polat, 2007) for examples). The
segmentation and the descriptors extraction of each
region were obtained by means of a region growing
and indexing algorithm (Revuelta Sanz et al.,
2010a), gray scale based. The contribution to this
algorithm is the inclusion of color information in the
region growing process, and our results will only be
compared to those obtained with the gray scale
version. The goal of this complementary
functionality is to take advantage of the color
information in image segmentation, improving the
depth maps accuracy, while preserving the
simplicity of the gray scale approach.
This paper is organized as follows. After this
introduction, section 2 explains the details of the
proposed pre-processing tool. In the same section
the effects of this pre-processing are shown. In
section 3, the segmentation and matching process of
pairs of images is described. Results of the
application of the described algorithm are shown in
section 4, and discussed in section 5, comparing
126
Revuelta Sanz P., Ruiz Mezcua B., M. Sánchez Pena J. and Thiran J..
STEREO VISION MATCHING OVER SINGLE-CHANNEL COLOR-BASED SEGMENTATION.
DOI: 10.5220/0003473201260130
In Proceedings of the International Conference on Signal Processing and Multimedia Applications (SIGMAP-2011), pages 126-130
ISBN: 978-989-8425-72-0
Copyright
c
2011 SCITEPRESS (Science and Technology Publications, Lda.)