mance under both image formats (cartesian and log-
polar). One of the most interesting considerations
refers to the concept of “scale”. Visual information
can be salient at some particular scale. However,
while in cartesian images, scale has a global meaning,
it varies across a log-polar image. This new concept
of scale and its effect on saliency computation is also
studied in this paper.
This paper is organized as follows. First, an
overview of the entropy-based saliency computation
is given in Sect. 2. Sect. 3 describes some neces-
sary adaptations of the approach to log-polar images.
Then, experimental work is described in Sect. 4. Fi-
nally, in Sect. 5 we emphasize the main conclusions
and mention ideas of further work.
2 ENTROPY-BASED SALIENCY
COMPUTATION
Saliency is a measure of the object distinctiveness
among its neighbours and it can be quantified in sev-
eral ways (Itti, 2003; Kadir and Brady, 2001). The
maximum local entropy is a measure that allows to
compute the saliency value. Particularly we explore
here the Scale Saliency (Kadir and Brady, 2001), a
detection method using the local entropy to report
salient regions.
2.1 Local Entropy Computation
The local entropy computation algorithm, or Scale
Saliency (Kadir and Brady, 2001) estimates salient
points showing unpredictable characteristics simul-
taneously in the space-scale of the point. The lo-
cal complexity or unpredictability in the space of the
point is measure by Shannon Entropy of local im-
age attributes. This metric depends on the Probability
Density Function (PDF) taken the grey level of the
image as the local descriptor. Although we only use
the grey level of image, other local image descriptors,
such as color, orientation or edge information could
also be employed.
On the other hand, to measure the scale unpre-
dictability, the local PDF is estimated at multiples
scales and the extremes in the entropy are used as
a base for scale selection. In this way, the statistics
of the local descriptor over a range of scales around
the peaks are used to estimate the inter-scale unpre-
dictability.
Scale Saliency measures the entropy for each pixel
location over a range of scales, choosing those scales
at which the entropy is maximun. Then for such max-
imum scales, the entropy value is weighted by the
metric of inter-scale unpredictability. The algorithm
yields a three-dimensional vector with the spatial lo-
cation (two dimensions) and scale for each salient
value.
2.2 Clustering using Local Probability
Density
The Scale Saliency algorithm mentioned above re-
ports a too great number of salient points, many of
which are neighbours and salient at a similar scale.
Therefore, a clustering procedure is needed to repre-
sent all these points more efficiently and less redun-
dant. However, since neither the number of salient
points nor the number of clusters are known a pri-
ori, the clustering algorithm used in (Kadir and Brady,
2001) seems not adequate. In contrast, we use an al-
ternative clustering approach (Pascual et al., 2006),
only requiring a radius r to define the density func-
tion from which data points are grouped. For the
user point of view, this radius is easier to set than the
number of clusters (how r is set will be explained in
Sect. 4). The input data to the clustering algorithm
are three-dimension vectors: the two spatial dimen-
tions and the scale of the detected salient points.
3 DEALING WITH
SPACE-VARIANT SAMPLING
Log-polar images can be obtained by different tech-
niques (Traver and Pla, 2003). In this work we use
a software-based transformation (Bolduc and Levine,
1998) by resampling a cartesian image using the
space-variant log-polar grid. This grid consists of R
concentric rings whose size grow exponentially from
the center (fixation point) to periphery, and of S uni-
formly spaced angular sectors. To refer to discrete
positions in a log-polar image, it is used the notation
(u,v), where 0 ≤ u < R and 0 ≤ v < S.
The loss of information content imposed by the
foveation process represents a problem to extract the
features from space-variant representations. In this
context, the scale is not a global concept but it takes
a local meaning, which calls for some adaptations of
the scale selection of the original algorithm. On the
other hand, the Scale Saliency does not analyse im-
age borders, thus the salient regions associated with
peripheral information are difficult to detect. The fol-
lowing subsections address both issues.
VISAPP 2008 - International Conference on Computer Vision Theory and Applications
502