2 RELATED WORKS
Before describing in detail the proposed method for
image retargeting it is instructive to examine the cur-
rent state of the art.
A non-photorealistic algorithm was proposed
in (Setlur et al., 2005) to adapt automatically images
for small sized displays with different sizes and/or as-
pect ratios, while preserving the important features in
the image. This is achieved by first decomposing the
image in a background layer and foreground objects.
Then, all important zones are temporally cropped and
the other parts are removed from the image. The fi-
nal image is obtained by pasting the important zones,
with the rest of the image obtained by resizing some
parts using inpainting (Harrison, 2001).
In (Liu and Gleicher, 2005) and (Liu and Gleicher,
2006) the authors propose a method for image and
video retargeting, respectively. Their method presents
a tradeoff between image resizing and image crop-
ping. The method consists on finding the Region-of-
Interest (ROI) of the image and constructing a novel
Fisheye-View warp that applies a linear scaling func-
tion in each dimension of the image. Basically, in this
method the information on the ROI is preserved and
the rest of the image is warped. In the video applica-
tion they use a combination of the image and saliency
maps to find the ROI. After this operation, cropping,
virtual pan and shot cuts are used to resize the image.
In (Gal et al., 2006) is presented a method for in-
homogeneous 2D texture mapping by a feature mask
to the general problem of warping, that preserves
some parts of the image specified by the user. The
feature-aware texture warping is done by applying a
formulation based on Laplacian editing technique.
An automatic non-uniform global warping was
employed by (Wolf et al., 2007) in a problem of video
retargeting. The algorithm starts by analyzing the
frame to detect important zones and then shrinks less
important regions. The analysis of the frames is based
on local saliency, and object and motion detection.
The technique of cropping the important zones
and then resizing them to obtain the desired aspect
ratio was applied in (Tao et al., 2007) to a problem
of video retargeting. In (Chen and Sen, 2008) the au-
thors use graph cuts to find and remove low gradient
sheets, for video summarization applications.
An approach for the summarization of visual data
for images and videos is proposed in (Simakov et al.,
2008). The authors propose a measure to quantify
how “good” is a visual summary. This measure can
be applied to compare two images or two videos se-
quences with different sizes. This is useful to im-
prove some objective function within an optimiza-
tion process, to generate good visual summaries or to
compare quantitatively and evaluate visual summaries
produced by different methods.
An important work in image retrieving was re-
cently presented by Avidan (Avidan and Shamir,
2007), and improved by Rubinstein (Rubinstein et al.,
2008) for video retargeting. In (Avidan and Shamir,
2007) the authors create an operator called seam carv-
ing that permits the reduction and enlarging of im-
ages. A seam was defined as an optimal 8-connected
path of pixels between opposing margins of the im-
age, where optimally is determined using an image
energy function. The algorithm preserves the image
structure by removing more low energy pixels than
high energy pixels. The computation of the optimal
seam is based on an efficient dynamic programming
algorithm. In (Rubinstein et al., 2008) the authors
extend the work for video. There, the dynamic pro-
gramming had to be replaced by graph cuts, suitable
for 3D volumes. A novel energy function was also in-
troduced, improving the visual quality of both videos
and images.
The principal problem of the algorithm in (Avidan
and Shamir, 2007; Rubinstein et al., 2008) is related
with the lower performance when applied to images
with big sizes. The algorithm is slow because only
one line is discarded per iteration.
3 A STABLE PATH APPROACH
FOR IMAGE RESIZING
In the work to be detailed, the image grid is consid-
ered as a graph with pixels as nodes and arcs con-
necting neighbouring pixels. The weight of each arc,
w(p, q), is a function of pixels values and pixels rela-
tive positions. A path from vertex (pixel) v
1
to vertex
(pixel) v
n
is a list of unique vertices v
1
, v
2
, . . . , v
n
, with
v
i
and v
i+1
corresponding to neighbour pixels. The to-
tal cost of a path is the sum of each arc weight in the
path
∑
n
i=2
w(v
i−1
, v
i
).
A path from a source vertex v to a target vertex u
is said to be a shortest path if its total cost is minimum
among all v-to-u paths. The distance between a source
vertex v and a target vertex u on a graph, d(v, u), is the
total cost of a shortest path between v and u.
A path from a sub-graph Ω
1
to a sub-graph Ω
2
is
said to be a shortest path between Ω
1
and Ω
2
if its
total cost is minimum among all v ∈ Ω
1
-to-u ∈ Ω
2
paths. The distance from a sub-graph Ω
1
to a sub-
graph Ω
2
, d(Ω
1
, Ω
2
), is the total cost of a shortest
path between Ω
1
and Ω
2
:
d(Ω
1
, Ω
2
) = min
v∈Ω
1
,u∈Ω
2
d(v, u). (1)
IMAGE RETARGETING USING STABLE PATHS
41