sults with some examples. For every example we
show the ROIS of the real images that fit best the man-
ual ones. In other words, we show the detected ROIs
that maximize the overlapping area of the ground
truth regions.
The COREST detector has selected the image re-
gions according to the stability measures in the color
and in the space features. Then, the contrasted re-
gions and and isolated regions are detected as mean-
ingful according to the human perception. As we have
introduced in the section 2 the analysis of the space
properties plays an important role in identifying ob-
jects in real images. Then, it makes possible to group
a set of disjoint regions that belong to an object that is
partially occluded. This phenomena can be illustrated
with the image a) of the Figure 5. Thus, if we look
at the palm picture we realize that the sand area is oc-
cluded by a shadow that the user has omitted. Being
the shadow so contrasted with the color of the sand, a
classical segmentation could never join the two parts.
The same occurs in the detection of the background
region in the fireman scene f). The region belonging
to the wall is occluded by a picture and it is broken
into a set of parts. The space scale analysis has also
effect in the presence of textures. Then, the system
is able to group similar color regions. Natural scenes
have plenty of textures such as trees a), clouds h),
waves e), or the leopard skin d). Moreover, images
that contain man made objects can also present repet-
itive patterns that a human perceive as belonging to
the same object. This is the case of the white and red
striped bars of the figure c) or clothing garments of
the image b).
4 CONCLUSIONS
We have presented a novel region detector on color
images that combines a classical color segmentation
approach with a space scale analysis. We have used
the mean shift algorithm to measure the stability of
the regions on the color and the space domains. The
detector gives a very high degree of freedom about
the shape of the output regions making it suitable to
describe any image content. Moreover, the multiscale
approach allows the system to detect ROIS composed
by disjoint regions that can come from partial over-
lapped elements or textured areas. We have made
some experiments to evaluate the regions of interest of
manual labelled image vs. the regions of interest of a
real scene. Using a human based benchmark we have
demonstrated that exist enough correlation to use this
region detector in applications where the information
has to be matched according to the human represen-
tation. One of the potentially applications could be
found int the content based retrieval systems that al-
low sketch based queries. dfafd
ACKNOWLEDGEMENTS
This work has been partially supported by the project
TIC2003-09291 and the grant 2002FI-00724.
REFERENCES
Cheng, H.-D., Jiang, X.-H., Sun, Y., and Wang, J. (2001).
Color image segmentation: advances and prospects.
Pattern Recognition, 12(34):2259–2281.
Christoudias, C., Georgescu, B., and Meer, P. (2002). Syn-
ergism in low level vision. pages IV: 150–155.
Comaniciu, D. and Meer, P. (1999). Mean Shift Analysis
and Applications. In Proceedings of the IEEE ICCV,
pages 1197–1203, Kerkyra, Greece.
Forss´en, P.-E. (2007). Maximally stable colour regions for
recognition and matching. In IEEE Conference on
Computer Vision and Pattern Recognition, Minneapo-
lis, USA. IEEE Computer Society, IEEE.
Lindeberg, T. (1993). Scale-Space Theory in Computer
Vision (The International Series in Engineering and
Computer Science). Springer.
Martin, D., Fowlkes, C., Tal, D., and Malik, J. (2001).
A database of human segmented natural images and
its application to evaluating segmentation algorithms
and measuring ecological statistics. Technical report,
EECS Department, University of California, Berkeley.
Matas, J., Chum, O., Martin, U., and Pajdla, T. (2002). Ro-
bust wide baseline stereo from maximally stable ex-
tremal regions. In Proceedings of the BMVC, vol-
ume 1, pages 384–393, London.
Mikolajczyk, K., Tuytelaars, T., Schmid, C., Zisserman,
A., Matas, J., Schaffalitzky, F., Kadir, T., and Gool,
L. V. (2005). A comparison of affine region detectors.
IJCV, 65(1/2):43–72.
Unnikrishnan, R., Pantofaru, C., and Hebert, M. (2007). To-
ward objective evaluation of image segmentation al-
gorithms. 29(6):929–944.
Veltkamp, R. and Tanase, M. (2000). Content-based image
retrieval systems: A survey. Technical Report UU-CS-
2000-34, Department of Information and Computing
Sciences, Utrecht University.
VISAPP 2009 - International Conference on Computer Vision Theory and Applications
208