
 
normalized cross correlation (Hsu et al., 2005). A 
real-time implementation that adopts the two 
images’ matching through the Fourier-Mellin 
transformation has been reported in (Martinez et al., 
2004). The use of fuzzy logic for the global motion 
vector computation can produce optimal results 
(Güllü and Ertürk, 2004). In order to enhance the 
compensated frame position Kalman filtering was 
utilized (Hsu et al., 2005, Ertürk, 2002). The 
estimation of the motion in a sequence is also 
realized by optical flow techniques. The 
approximation of the image flow field provides both 
the translational and rotational information. The 
undesired motion effects are calculated in (Suk et 
al., 2005) by estimating the rotational center and the 
angular frequency from the local translational 
motion definition by fine-to-coarse multi-resolution 
motion estimation. In (Pauwels et al., 2007) the 
stabilization is accomplished by fixating at the 
central image region, whilst optical flow estimation 
optimizes this approximation. In most of the cases 
the global motion vector is computed via a series of 
local motion vectors. These describe the movement 
in a particle of the image, which results to a better 
estimation of the indented camera movement and the 
undesired motion. 
In this paper, a novel fuzzy Kalman digital 
image stabilization technique in the log-polar plane 
is proposed. First a transformation from the 
Cartesian plane to the log-polar one takes place. The 
acquired log-polar image sequence provides lesser 
information in the background of the scenery than in 
the foreground. This is due to the proper attribute of 
the log-polar transformation to preserve high-
resolution at the center of the image, which 
diminishes logarithmiticaly towards the periphery. 
The motion estimation in the log-polar plane 
provides a space-variant distribution of the local 
motion vectors due to the aforementioned nature of 
the log-polar plane. Consequently, the extracted 
local motion vectors are imported into a recursive 
fuzzy system based to the one presented in (Güllü 
and Ertürk, 2004). However there are some distinct 
differences. One lies to the fact that in this paper, the 
fuzzy system utilizes the Kalman filter’s 
mathematical model to filter the inputs 
straightforwardly. Moreover, no mean operation 
filtering takes place to the measured fluctuations. 
Finally, the filtered vectors, define the global motion 
vector from which the compensation vector is 
calculated. The innovation of using log-polar images 
for the motion field extraction provided optimal 
results not only to the stabilization of each frame, 
but also to the visual quality of the video output. The 
advantages of the log-polar plane are well exploited, 
as (i) the processing time is lesser, (ii) a single 
motion estimation extraction provides information 
for both the rotational and translational irregularities 
and (iii) the center of attention has a higher impact 
to the whole process without further preprocessing. 
2 LOG-POLAR 
TRANSFORMATION 
The motion estimation process preserves high 
computational burden, so it is normally improper for 
real-time applications. One way to overcome the 
computational burden is to sub-sample the images. 
Yet, to estimate the motion field, all available 
information is needed. Thus, a resolution decrease is 
inappropriate as it causes loss of major information 
and the provided results are sparse and inaccurate. 
However, the volume of the image data can be 
reduced by a topological arrangement, without loss 
of information. Notably, a space-variant 
arrangement such as log-polar provides lesser image 
data without constraining the field of view, or the 
image resolution at the fixation point. The log-polar 
transformation is based on the human’s eyes 
projections of the retina plane to the visual cortex. It 
finds its origins into studies on the vision 
mechanisms of the primates. The adoption of this 
topology into artificial vision systems exhibits 
several advantages as in visual attention, throughput 
rate and real-time processing. Many applications of 
the log-polar transformation have been reported, 
such as the time-to-impact estimation (Tistarelli and 
Sandini, 1993), wavelet extraction based on log-
polar mapping (Pun and Lee, 2003), tracking (Metta 
et al., 2004) and disparity estimation and vergence 
control in (Manzotti et al., 2001). 
   
Figure 1: The log-polar transformation maps radial lines 
and concentric circles into lines parallel to the coordinate 
axes. 
The mathematical model of the log-polar 
mapping can be expressed as a transformation 
between the polar plane (ρ,  θ) (retinal plane), the 
log-polar plane (η,  ξ) (cortical plane) and the 
Cartesian plane (x, y) (image plane). 
VISAPP 2008 - International Conference on Computer Vision Theory and Applications
470